Skip to content

List extraction history for one URL item

GET
/api/v1/extraction/urls/{urlItemId}/extractions
curl --request GET \
--url 'https://dashboard.justcrawl.io/api/v1/extraction/urls/2489E9AD-2EE2-8E00-8EC9-32D5F69181C0/extractions?page=1&pageSize=25' \
--header 'Authorization: Bearer <token>'
urlItemId
required
string format: uuid
page
integer
default: 1 >= 1
pageSize
integer
default: 25 <= 100

Paginated extraction history for the URL item.

Media type application/json
object
items
Array<object>
object
id
string format: uuid
orgId
string format: uuid
jobId
string format: uuid
domain
string
schemaId
string
nullable
dataType
string
normalized
object
key
additional properties
any
rawBlobKey

Internal storage key for the raw HTML blob.

string
nullable
qualityCompleteness
number
qualityValidation
number
qualityComposite
number
latencyMs
integer
nullable
deliveredAt
string format: date-time
nullable
createdAt
string format: date-time
total
integer
url
string
nullable
domain
string
nullable
indexingPending

Count of this URL’s jobs sitting in extraction_done awaiting the bulk-writer flush (Phase 1 Lane D).

integer
Example generated
{
"items": [
{
"id": "2489E9AD-2EE2-8E00-8EC9-32D5F69181C0",
"orgId": "2489E9AD-2EE2-8E00-8EC9-32D5F69181C0",
"jobId": "2489E9AD-2EE2-8E00-8EC9-32D5F69181C0",
"domain": "example",
"schemaId": "example",
"dataType": "example",
"normalized": {},
"rawBlobKey": "example",
"qualityCompleteness": 1,
"qualityValidation": 1,
"qualityComposite": 1,
"latencyMs": 1,
"deliveredAt": "2026-04-15T12:00:00Z",
"createdAt": "2026-04-15T12:00:00Z"
}
],
"total": 1,
"url": "example",
"domain": "example",
"indexingPending": 1
}

Missing or invalid authentication token

Media type application/json
object
error
string
Example generated
{
"error": "example"
}

Insufficient permissions

Media type application/json
object
error
string
Example generated
{
"error": "example"
}

Resource not found

Media type application/json
object
error
string
Example generated
{
"error": "example"
}