Skip to content

Agenty/scrapingai

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

scarapingai

version license

Extract data from websites automatically with AI or build web scraping agents for bulk URL scraping.

Auto extract website data with AI

Installation

Install it via npm:

npm i scarapingai

Highlights

  • Built-in residential proxies and captcha handling
  • Smart ad blocker, popup blocker for better performance
  • Accept cookie consent automatically to close cookie banners
  • Compatible with Puppeteer, Playwright for browser automation and testing.
  • Background jobs for bulk URL scraping with automatic retry & error handling.

Usage

Get your api key from here

const agenty = new Agenty(API_KEY);
const data = await agenty.browser.extract("https://example.com");
console.log(data);

Extract

To auto-extract product, jobs listing, SEO meta data, schema JSON etc from given URL

const data = await agenty.browser.extract("https://example.com");
console.log(data);

Scrape

To extract data from given CSS selector or custom jQuery function

const data = await agenty.browser.scrape("https://example.com");
console.log(data);

Screenshot

To capture a screenshot for given URL

const data = await agenty.browser.screenshot("https://example.com");
console.log(data);

PDF

To convert webpage into PDF.

const data = await agenty.browser.pdf("https://example.com");
console.log(data);

Content

To get HTML content from a URL.

const data = await agenty.browser.content("https://example.com");
console.log(data);

License

scrapingai is a project by Agenty, released under the MIT License.

About

Build web scraping agents using AI to auto-extract the data from websites, capture screenshot, generate pdf from URL and web crawling with Agenty

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published