Scrapegraph Ai

27 tools available

ScrapeGraphAI is an AI-powered web scraping API that enables developers to extract structured data from any website using natural language prompts. Website https://scrapegraphai.com

Connect Scrapegraph Ai to Definable to automate repetitive tasks, sync data with the rest of your stack, trigger on real-world events. Personas call Scrapegraph Ai's 27 tools directly from chat or scheduled flows. You can wire it into any persona to read, write, and react when an event fires.

Common workflows with Scrapegraph Ai

01

Trigger on event

  1. Listen for a new event
  2. Parse and validate the payload
  3. Take an action with the linked tool
02

Daily summary

  1. Pull yesterday's activity
  2. Summarise with the model of your choice
  3. Send the digest via email or Slack
03

Cross-tool sync

  1. Read records from one app
  2. Transform with a persona
  3. Write the result back via this integration

Scrapegraph Ai actions 27

Every Scrapegraph Ai action below is a callable tool any Definable persona can invoke.

Convert Webpage to Markdown (V2)

Tool to convert any webpage into clean, well-formatted Markdown with full parameter control. Use when you need advanced options like stealth mode, custom headers, or webhook notifications. Supports all Markdownify API parameters.

Generate Schema

Generate or modify a JSON schema based on a search query for structured data extraction. Use when you need a schema template for scraping specific data fields.

Get Agentic Scraper History

Retrieve paginated history of agentic scraper jobs. Use to view past scraping requests, their status, and results.

Get Crawler History

Retrieve the history of crawler jobs for your account. Returns paginated list of past crawler requests with their status, results, and metadata.

Get Credits

Retrieve remaining and used credits for your ScrapeGraphAI account. Useful for checking credit availability before bulk scraping operations to avoid mid-run failures.

Get Endpoint Suggestions

Tool to get AI-powered suggestions for creating scraping endpoints. Use when you need to identify what data can be extracted from a website and how to structure the scraping logic.

Get Live Session URL

Tool to get a URL for a live browser session. Use when you need to interact with a webpage in real-time through a controlled browser environment.

Get Markdownify History

Tool to retrieve the history of markdownify webpage-to-Markdown conversion jobs. Use when you need to view past markdownify requests and their statuses.

Get Scrape History

Retrieve the history of scrape jobs from your ScrapeGraphAI account. Use this to check the status of past scrapes, view results, and track credit usage.

Get Searchscraper History

Get the history of searchscraper jobs with pagination support. Use this to retrieve past searchscraper requests, their status, and results.

Get Sitemap History

Tool to retrieve the history of sitemap extraction jobs. Use when you need to view past sitemap extraction requests, their status, and results.

Get Smartscraper History

Tool to retrieve the history of smartscraper jobs. Use when you need to view past scraping requests and their results.

Get Usage Timeline

Tool to retrieve usage timeline statistics for your ScrapeGraphAI account. Use when you need to visualize or analyze service usage patterns over time.

Get Webhook Logs

Tool to retrieve webhook delivery logs for a crawler job. Use when you need to check the status and history of webhook notifications sent for a specific crawler execution.

List Scheduled Jobs

Retrieve a paginated list of all scheduled scraping jobs for your account. Use this action to view and manage your scheduled jobs, including their configuration, cron schedules, and active status. Supports filtering by service type and active status.

Markdownify Status

Check the status and retrieve results of a Markdownify webpage-to-Markdown conversion job. Use this action to poll for the status of an async Markdownify request started via SCRAPEGRAPH_AI_MARKDOWNIFY. Note: The ScrapeGraph AI API typically returns completed results synchronously, so this status endpoint is primarily useful for long-running conversions of large or complex webpages.

Save Endpoint Configuration

Tool to save custom scraping endpoint configurations to ScrapeGraphAI. Use when you need to create reusable scraping endpoints with specific parameters and extraction logic.

Search Scraper

Perform AI-powered web searches with structured, parsed results. Some sites block scrapers and return empty bodies; treat these as unrecoverable for that URL. JS-rendered pages may yield incomplete content.

Check SearchScraper Status

Check the status and results of an asynchronous SearchScraper job.

SmartCrawler Status

Check the status and retrieve results of a SmartCrawler web crawling job. Use this action to poll for completion and get the extracted content from a previously started SmartCrawler job. Returns the job status, crawled URLs, page content in markdown/HTML format, and LLM extraction results (if enabled). Implement a polling timeout (e.g., max retries or elapsed time cap) to avoid indefinite loops when waiting for long-running jobs.

Start Smart Scraper

Start AI-powered web scraping with natural language extraction prompts. When `wait` is false (default), returns a `request_id`; poll for results using SCRAPEGRAPH_AI_SMART_SCRAPER_STATUS. Check `error` and `job_status` fields in the response before using extracted data.

SmartScraper Status

Check the status and retrieve results of a SmartScraper web scraping job. Use this action to poll for completion after starting a SmartScraper job with wait=false. The request_id is returned by the Start SmartScraper action. Typical workflow: 1. Start a scraping job with SCRAPEGRAPH_AI_SMART_SCRAPER_START (wait=false) 2. Use the returned request_id to check status with this action 3. Poll until status is 'completed' or 'failed' 4. When completed, the 'result' field contains the extracted data. When completed, also check the 'error' field before consuming 'result', as 'failed' status populates 'error' instead of 'result'.

Start Smart Crawler (Async)

Tool to start a multi-page web crawl using SmartCrawler for AI-powered data extraction. Use when you need to extract structured data from multiple pages of a website. Returns immediately with a task_id - use the status check action to monitor progress and retrieve results.

Submit Feedback

Submit feedback and ratings for completed ScrapeGraphAI requests.

Submit Product Feedback

Submit product feedback for ScrapeGraphAI. Use to provide ratings, comments, suggestions, and other feedback about the product itself.

Convert JSON to TOON Format

Tool to convert JSON data to TOON (Token-Oriented Object Notation) format. Use when you need to reduce token usage for LLM processing while maintaining data structure.

Validate API Key

Validate your ScrapeGraphAI API key to ensure it is active and authorized. Use this action to check API key validity before making other API calls.

Frequently asked questions

What can I automate with Scrapegraph Ai on Definable?

Anything Scrapegraph Ai exposes through its API. Common ai web scraping workflows on Definable include automate repetitive tasks, sync data with the rest of your stack, trigger on real-world events. Personas can call any of the 27 Scrapegraph Ai tools directly, then chain the result into another integration without you writing code.

How does Scrapegraph Ai authentication work?

Scrapegraph Ai uses API_KEY on Definable. You connect once from the integrations page, scoped to the permissions you choose, and from then on any persona that has the integration enabled can act on your behalf. Tokens are encrypted at rest and rotated automatically.

Is the Scrapegraph Ai integration included in my Definable plan?

Yes — every Definable plan, including Starter, includes access to all 27 Scrapegraph Ai tools. You only need a separate Scrapegraph Ai subscription if Scrapegraph Ai itself charges per seat or per API call.

Is using Scrapegraph Ai through Definable secure?

Every call from a persona to Scrapegraph Ai is logged with the user, persona, prompt, and response. Tokens never leave Definable's secrets vault, scopes are configurable per persona, and you can revoke access at any time from the integration page.

How do I get started with Scrapegraph Ai on Definable?

Sign up for Definable, open the integrations page, find Scrapegraph Ai, and connect via OAuth or API key. You can immediately attach Scrapegraph Ai to any persona and start running workflows. The free Starter plan includes 5,000 credits/month.

What Scrapegraph Ai actions does Definable expose?

Definable exposes all 27 Scrapegraph Ai actions as callable tools — including `Convert Webpage to Markdown (V2)`, `Generate Schema`, `Get Agentic Scraper History`, plus 24 more. Each tool gets a typed parameter schema so personas know exactly how to call it.

Ready to automate with Scrapegraph Ai?

Wire it up in minutes. No coding required.

← All integrations