# Scrapingant AI integration on Definable

> ScrapingAnt is a web scraping API service that enables data extraction from websites through headless Chrome browsers, rotating proxies, CAPTCHA/Cloudflare bypass, LLM-ready markdown output, and AI-powered structured data extraction.

## What this connects

ScrapingAnt is a web scraping API service that enables data extraction from websites through headless Chrome browsers, rotating proxies, CAPTCHA/Cloudflare bypass, LLM-ready markdown output, and AI-powered structured data extraction.

Vendor: https://scrapingant.com

## Tools available

**9** tools available. First 9:

- `SCRAPINGANT_EXTRACT_CONTENT_AS_MARKDOWN` — Extract Content as Markdown — This tool extracts content from a given URL and converts it into Markdown format. It is particularly useful for preparing text for Language Learning Models (LLMs) and Retrieval-Augmented Generation (RAG) systems. It supports GET, POST, PUT, and DELETE methods.
- `SCRAPINGANT_EXTRACT_DATA_WITH_AI` — Extract Data with AI — This tool allows you to extract structured data from a web page using ScrapingAnt's AI-powered extraction capabilities. You provide a URL and an AI query (prompt) describing what data you want to extract, and the tool returns the extracted data in a structured format. It supports additional parameters for browser rendering, proxies, and cookies to handle dynamic content and localization.
- `SCRAPINGANT_GET_API_CREDITS_USAGE` — Get API Credits Usage — This tool retrieves the current API credit usage status for the authenticated ScrapingAnt account. It enables users to monitor their consumption of API credits, check their current usage against the subscription limits, and manage their API credits effectively.
- `SCRAPINGANT_GET_V1_USAGE` — Get V1 Usage (Deprecated) — [DEPRECATED - Use v2] Tool to get the current subscription status and API credits usage information. This is the legacy v1 endpoint which is no longer actively maintained.
- `SCRAPINGANT_SCRAPE_WEB_PAGE` — Scrape Web Page — This tool scrapes a web page using the ScrapingAnt API. It fetches the HTML content of the specified URL. Users can customize the scraping behavior by enabling a headless browser, using proxies, waiting for specific elements, executing JavaScript, passing cookies, and blocking certain resources.
- `SCRAPINGANT_SCRAPE_WEBPAGE_POST` — Scrape Webpage via POST — Tool to perform a POST request through ScrapingAnt's proxy to scrape a webpage. Use when you need to scrape pages that require POST method, such as form submissions or APIs that only accept POST requests. Data is forwarded transparently to the target web page.
- `SCRAPINGANT_SCRAPE_WEBPAGE_PUT` — Scrape Webpage with PUT — Tool to perform a PUT request through ScrapingAnt's proxy to scrape a webpage that requires PUT method. Use when the target webpage requires PUT method for data submission. Data is forwarded transparently to the target web page.
- `SCRAPINGANT_SCRAPE_WEBPAGE_V1_POST` — Scrape Webpage (v1 POST - Deprecated) — [DEPRECATED - Use v2] Tool to scrape a webpage using POST method with ScrapingAnt's v1 API. Returns JSON with content, cookies, and status_code. This is the legacy v1 endpoint which is no longer actively maintained. Use the v2 endpoints for new implementations.
- `SCRAPINGANT_SCRAPE_WITH_EXTENDED_JSON_OUTPUT` — Scrape with Extended JSON Output — Scrapes a web page and returns comprehensive data including HTML content, plain text, cookies, HTTP headers, XHR/Fetch requests, and iframe content. This tool uses ScrapingAnt's extended endpoint which provides much richer data than standard scraping: - Full HTML and extracted plain text content - All cookies and HTTP response headers from the target page - Captured XHR/Fetch API requests made by the page (useful for finding hidden APIs) - Content from embedded iframes Best used when you need more than just the HTML - such as analyzing cookies, headers, or JavaScript API calls made by a page. For simple HTML scraping, consider using the basic scrape tool instead for lower API credit usage.

## Auth

Auth schemes: `API_KEY`.

## How agents use Scrapingant

Inside a Definable workflow, Scrapingant is one of the tools the **Distributor specialist** can call. Example coordination patterns:

- **Researcher → Scrapingant** — the Researcher (GPT-5.5) pulls context from Scrapingant (records, threads, documents), synthesises findings, and briefs the rest of the team.
- **Writer → Distributor → Scrapingant** — the Writer (Claude Opus 4.7) drafts copy in brand voice, the Verifier passes it, then the Distributor writes the result into Scrapingant (create record, post message, draft email).
- **Designer / Engineer → Distributor → Scrapingant** — the Designer ships an asset or the Engineer ships a code change, the Distributor delivers it via Scrapingant (attach file, open PR comment, post status).

The Verifier checks every Scrapingant call. On rate limit, schema drift, or auth refresh it self-heals and retries — the workflow completes without manual intervention.

## Categories

- ai web scraping — https://definable.ai/apps/category/ai-web-scraping/
- developer tools — https://definable.ai/apps/category/developer-tools/

## Related

- HTML page: https://definable.ai/apps/scrapingant/
- Same category (ai web scraping): https://definable.ai/apps/category/ai-web-scraping/
- All integrations: https://definable.ai/apps/
- Workflow (multi-agent loop): https://definable.ai/workflow/
- Apps llms.txt index: https://definable.ai/llms-apps.txt
