GET
/
api
/
dataset
/
crawl_options
/
{dataset_id}
curl --request GET \
  --url https://api.trieve.ai/api/dataset/crawl_options/{dataset_id} \
  --header 'Authorization: <api-key>' \
  --header 'TR-Dataset: <tr-dataset>'
{
  "crawl_options": {
    "exclude_paths": [
      "https://example.com/exclude"
    ],
    "exclude_tags": [
      "#ad",
      "#footer"
    ],
    "include_paths": [
      "https://example.com/include"
    ],
    "include_tags": [
      "h1",
      "p",
      "a",
      ".main-content"
    ],
    "interval": "daily",
    "limit": 1000,
    "site_url": "https://example.com"
  }
}

Authorizations

Authorization
string
header
required

Headers

TR-Dataset
string
required

The dataset id or tracking_id to use for the request. We assume you intend to use an id if the value is a valid uuid.

Path Parameters

dataset_id
string
required

The id of the dataset you want to retrieve.

Response

200
application/json
Crawl options retrieved successfully
crawl_options
object

Options for setting up the crawl which will populate the dataset.