Website Crawling for AI Knowledge Base | Definable AI
Real-time website crawling for your Knowledge Base. Auto-ingest pages, monitor changes, and keep your AI context up to date with the latest content from any public website.
Features
- Recursive crawling follows internal links to capture entire sites
- Scheduled re-crawling to detect and ingest content changes
- Respects robots.txt and rate limiting for responsible crawling
- JavaScript rendering support for dynamic single-page apps
- Selective page filtering with URL patterns and depth limits
- Automatic content extraction strips navigation, ads, and boilerplate