Skip to content

Re-extract historical results with the current schema

POST
/api/v1/extraction/backfill
curl --request POST \
--url https://dashboard.justcrawl.io/api/v1/extraction/backfill \
--header 'Authorization: Bearer <token>' \
--header 'Content-Type: application/json' \
--data '{ "domain": "example", "pageType": "example", "resultIds": [ "2489E9AD-2EE2-8E00-8EC9-32D5F69181C0" ], "limit": 20 }'

Synchronous backfill capped at 20 results per call. Use resultIds to target specific rows, or omit to process the most recent results for the domain + page type.

Media type application/json
object
domain
required
string
pageType
required
string
resultIds
Array<string>
limit
integer
default: 20 <= 50

Backfill summary

Media type application/json
object
processed
integer
skipped
integer
failed
integer
results
Array<object>
object
id
string
status
string
newFields
integer
Example generated
{
"processed": 1,
"skipped": 1,
"failed": 1,
"results": [
{
"id": "example",
"status": "example",
"newFields": 1
}
]
}

Validation error or batch too large

No schema for domain + pageType