Create a Bulk Extract Job

Extract a list of URLs asynchronously

Body Params
string
required

Your Diffbot token.

string
required

Job name. This should be a unique identifier and will be used to modify your bulk job and retrieve its output.

string
required

Space-delimited list of URLs to process. If you are on the Startup plan, jobs must contain at least 50 URLs.

string
required

The full Extract (or Custom) API to be used for each URL. For instance, to process each URL via the Article API, supply https://api.diffbot.com/v3/article. You may also include API parameters, e.g. https://api.diffbot.com/v3/article?fields=meta,tags.

string

Set custom headers to be used for processing each URL. Send multiple customHeaders values in your POST body, with header keys/values delimited by a colon (and URL-encoded).

string

Send a message to this email address when the bulk job completes.

string

Pass a URL to be notified when the bulk job completes. You will receive a POST with a JSON response.

string

Pass obeyRobots=0 to ignore a site's robots.txt instructions.

float

Specify the number of days as a floating-point (e.g. repeat=7.0) to repeat this job. By default bulk jobs will not be repeated.

string

Enter ||-separated strings to limit pages processed to those whose HTML contains any of the content strings. If a page does not contain at least one of the strings, it will be ignored.

Responses

Language
LoadingLoading…
Response
Click Try It! to start a request and see the response here! Or choose an example:
application/json