Skip to content
GitHub Universe 2025
Explore 100+ talks, demos, and workshops at Universe 2025. Choose your favorites.
#

crawl4ai

Here are 58 public repositories matching this topic...

The Web Metadata Extraction Toolkit is designed to streamline the process of extracting, cleaning, and analyzing metadata from websites. Utilizing advanced AI models and custom extraction strategies, this toolkit helps users efficiently gather data like titles, descriptions, and keywords, which are crucial for SEO and content strategy.

  • Updated Jul 8, 2024
  • Python

A feature-rich web application for automated news scraping and summarization. It allows users to enter article URLs, fetches the full content, and generates concise summaries. The system supports both local inference with custom models and remote deployment via FastAPI or Streamlit interfaces.

  • Updated May 20, 2025
  • Python

🕷️ A lightweight Model Context Protocol (MCP) server that exposes Crawl4AI web scraping and crawling capabilities as tools for AI agents. Similar to Firecrawl's API but self-hosted and free. Perfect for integrating web scraping into your AI workflows with OpenAI Agents SDK, Cursor, Claude Code, and other MCP-compatible tools.

  • Updated Aug 16, 2025
  • Python

AI Scraper : scrap and extract data from website in any format (CSV, JSON, HTML...) using Selenium or Crawl4ai, and using Ollama or Sambanova API, and using Streamlit for UI as chatbot

  • Updated May 22, 2025
  • Python

AI Web Crawler is a powerful, AI-powered web crawler that extracts product information from e-commerce websites and downloads associated PDF documents. Built with modern Python technologies and featuring intelligent pagination handling, duplicate detection, and advanced PDF processing.

  • Updated Sep 20, 2025
  • Python

Improve this page

Add a description, image, and links to the crawl4ai topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the crawl4ai topic, visit your repo's landing page and select "manage topics."

Learn more