# Bright Data AI integration on Definable

> Bright Data provides the world's #1 web data platform with Web Unlocker for bypassing anti-bot systems, SERP API for search engine data, and pre-made scrapers for popular websites.

## What this connects

Bright Data provides the world's #1 web data platform with Web Unlocker for bypassing anti-bot systems, SERP API for search engine data, and pre-made scrapers for popular websites. Collect any web data at scale.

Vendor: https://brightdata.com/

## Tools available

**10** tools available. First 10:

- `BRIGHTDATA_CRAWL_API` — Trigger Site Crawl — Tool to trigger an asynchronous site crawl job to extract content across multiple pages or entire domains. Returns a snapshot_id required by BRIGHTDATA_GET_SNAPSHOT_STATUS (poll until complete) and BRIGHTDATA_GET_SNAPSHOT_RESULTS (call only after completion; querying early yields empty or partial data). Use when you need to start a crawl for a given dataset and list of URLs. Large crawls can produce very large payloads — fetch results incrementally.
- `BRIGHTDATA_DATASET_LIST` — Browse Available Scrapers — Tool to list all available pre-made scrapers (datasets) from Bright Data's marketplace. Use when you need to browse available data sources for structured scraping.
- `BRIGHTDATA_FILTER_DATASET` — Filter Dataset — Tool to apply custom filter criteria to a marketplace dataset (BETA). Use after selecting a dataset to generate a filtered snapshot.
- `BRIGHTDATA_GET_LIST_OF_AVAILABLE_CITIES` — Get Available Cities — Tool to get available static network cities for a given country. Use when you need to configure static proxy endpoints after selecting a country.
- `BRIGHTDATA_GET_LIST_OF_AVAILABLE_COUNTRIES` — Get Available Countries — Tool to list available countries and their ISO 3166-1 alpha-2 codes. Use when you need to configure zones with valid country codes before provisioning proxies.
- `BRIGHTDATA_GET_SNAPSHOT_RESULTS` — Download Scraped Data — Tool to retrieve the scraped data from a completed crawl job by snapshot ID. Only call after confirming the job is complete via BRIGHTDATA_GET_SNAPSHOT_STATUS — querying before completion yields empty or partial data. Use after triggering a crawl or filtering a dataset to download the collected data.
- `BRIGHTDATA_GET_SNAPSHOT_STATUS` — Check Crawl Status — Tool to check the processing status of a crawl job using snapshot ID. Call before attempting to download results to ensure data collection is complete.
- `BRIGHTDATA_LIST_WEB_UNLOCKER_ZONES` — List Unlocker Zones — Tool to list your configured Web Unlocker zones and proxy endpoints. Use to view available zones for web scraping and bot protection bypass.
- `BRIGHTDATA_SERP_SEARCH` — SERP Search — Tool to perform SERP (Search Engine Results Page) searches across different search engines using Bright Data's SERP Scrape API. Use when you need to retrieve search results, trending topics, or competitive analysis data. This action submits an asynchronous request and returns a response ID for tracking.
- `BRIGHTDATA_WEB_UNLOCKER` — Web Unlocker — Tool to bypass bot detection, captcha, and other anti-scraping measures to extract content from websites. Use when you need to scrape websites that block automated access or require JavaScript rendering. Some responses may still contain CAPTCHA challenge pages or incomplete HTML; inspect the response before retrying, and avoid aggressive retry loops.

## Auth

Auth schemes: `API_KEY`.

## How agents use Bright Data

Inside a Definable workflow, Bright Data is one of the tools the **Distributor specialist** can call. Example coordination patterns:

- **Researcher → Bright Data** — the Researcher (GPT-5.5) pulls context from Bright Data (records, threads, documents), synthesises findings, and briefs the rest of the team.
- **Writer → Distributor → Bright Data** — the Writer (Claude Opus 4.7) drafts copy in brand voice, the Verifier passes it, then the Distributor writes the result into Bright Data (create record, post message, draft email).
- **Designer / Engineer → Distributor → Bright Data** — the Designer ships an asset or the Engineer ships a code change, the Distributor delivers it via Bright Data (attach file, open PR comment, post status).

The Verifier checks every Bright Data call. On rate limit, schema drift, or auth refresh it self-heals and retries — the workflow completes without manual intervention.

## Categories

- ai web scraping — https://definable.ai/apps/category/ai-web-scraping/
- developer tools — https://definable.ai/apps/category/developer-tools/

## Related

- HTML page: https://definable.ai/apps/brightdata/
- Same category (ai web scraping): https://definable.ai/apps/category/ai-web-scraping/
- All integrations: https://definable.ai/apps/
- Workflow (multi-agent loop): https://definable.ai/workflow/
- Apps llms.txt index: https://definable.ai/llms-apps.txt
