Get latest schema for a domain + page type
GET
/api/v1/extraction/schemas/{domain}/{pageType}
const url = 'https://dashboard.justcrawl.io/api/v1/extraction/schemas/example/product';const options = {method: 'GET', headers: {Authorization: 'Bearer <token>'}};
try { const response = await fetch(url, options); const data = await response.json(); console.log(data);} catch (error) { console.error(error);}curl --request GET \ --url https://dashboard.justcrawl.io/api/v1/extraction/schemas/example/product \ --header 'Authorization: Bearer <token>'Authorizations
Section titled “Authorizations ”Parameters
Section titled “ Parameters ”Path Parameters
Section titled “Path Parameters ” domain
required
string
pageType
required
string
Responses
Section titled “ Responses ”Latest extraction schema
Media type application/json
object
id
string
domain
string
pageType
string
version
integer
attributes
Array<object>
object
name
required
string
xpaths
required
Array<string>
type
required
string
description
string
antiPatterns
Array<string>
semanticType
string
validation
object
minLength
integer
maxLength
integer
min
number
max
number
pattern
string
confidence
number
discoveredBy
string
sampleCount
integer
lastValidatedAt
string format: date-time
expiresAt
string format: date-time
createdAt
string format: date-time
Example
{ "pageType": "product", "attributes": [ { "type": "text", "semanticType": "brand_name" } ], "discoveredBy": "llm"}Resource not found
Media type application/json
object
error
string
Example generated
{ "error": "example"}