Skip to content
@apify

Apify

We're making the web more programmable.

Apify Banner

Apify is the largest ecosystem where developers build, deploy, and publish data extraction and web automation tools. We call them Actors.

Learn About Apify 🧑‍🎓

  • Find hundreds of ready-made Actors for your web scraping or automation project on Apify Store.
  • Learn everything about web scraping and automation with our free courses that will turn you into an expert scraping developer.
  • Publish your web scrapers as paid Actors on the Apify platform, attract people who need these solutions, and get regular passive income.
  • View our livestreams and video content at the Apify YouTube channel.
  • Learn more through tutorials and thought leadership content about web scraping on Apify Blog and Crawlee Blog.

We are hiring! 🕸️

Check out the open positions at Apify and help us make the web more programmable.

Pinned Loading

  1. crawlee-python Public

    Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Wo…

    Python 5.4k 365

  2. crawlee Public

    Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, an…

    TypeScript 17.3k 777

  3. apify-cli Public

    Apify command-line interface helps you create, develop, build and run Apify actors, and manage the Apify cloud platform.

    TypeScript 136 21

  4. proxy-chain Public

    Node.js implementation of a proxy server (think Squid) with support for SSL, authentication and upstream proxy chaining.

    JavaScript 898 146

  5. got-scraping Public

    HTTP client made for scraping based on got.

    TypeScript 640 50

  6. fingerprint-suite Public

    Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.

    TypeScript 1.3k 126

Repositories

Showing 10 of 156 repositories
  • actor-mastra-mcp-agent Public

    🤖 AI agent using mastra.ai with Apify MCP Server. 🚀 Runs queries via OpenAI models, taps Apify Actors for web data, and outputs to datasets. 🛠️

    4 Apache-2.0 1 2 1 Updated Mar 26, 2025
  • actor-inspector-agent Public

    Actor Inspector Agent is an Apify Actor designed to evaluate and rate other Apify Actors based on criteria such as documentation quality, input clarity, code standards, functionality, performance, and uniqueness.

    Python 4 Apache-2.0 0 0 1 Updated Mar 26, 2025
  • apify-docs Public

    This project is the home of Apify's documentation.

    API Blueprint 33 Apache-2.0 88 80 (2 issues need help) 14 Updated Mar 26, 2025
  • rag-web-browser Public

    RAG Web Browser is an Apify Actor to feed your LLM applications and RAG pipelines with up-to-date text content scraped from the web.

    TypeScript 35 Apache-2.0 7 9 3 Updated Mar 26, 2025
  • apify-sdk-python Public

    The Apify SDK for Python is the official library for creating Apify Actors in Python. It provides useful features like actor lifecycle management, local storage emulation, and actor event handling.

    Python 128 Apache-2.0 11 11 2 Updated Mar 26, 2025
  • fingerprint-suite Public

    Browser fingerprinting tools for anonymizing your scrapers. Developed by Apify.

    TypeScript 1,274 Apache-2.0 126 18 4 Updated Mar 26, 2025
  • crawlee Public

    Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

    TypeScript 17,256 Apache-2.0 777 140 (1 issue needs help) 20 Updated Mar 26, 2025
  • crawlee-python Public

    Crawlee—A web scraping and browser automation library for Python to build reliable crawlers. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with BeautifulSoup, Playwright, and raw HTTP. Both headful and headless mode. With proxy rotation.

    Python 5,446 Apache-2.0 365 77 8 Updated Mar 26, 2025
  • apify-client-python Public

    Apify API client for Python

    Python 59 Apache-2.0 12 9 0 Updated Mar 26, 2025
  • actor-whitepaper-web Public

    Documentation site for the Actor Programming Model – a fresh take on serverless microapps. Built with Astro.

    MDX 2 MIT 0 4 11 Updated Mar 26, 2025