Scrapfly SDK

npm install scrapfly-sdk

Typescript/NodeJS SDK for Scrapfly.io web scraping API which allows to:

Scrape the web without being blocked.
Use headless browsers to access Javascript-powered page data.
Scale up web scraping.
... and much more!

For web scraping guides see our blog and #scrapeguide tag for how to scrape specific targets.

Quick Intro

Register a Scrapfly account for free
Get your API Key on scrapfly.io/dashboard
Start scraping: 🚀

import { ScrapflyClient, ScrapeConfig } from 'scrapfly-sdk';

const key = 'YOUR SCRAPFLY KEY';
const client = new ScrapflyClient({ key });
const apiResponse = await client.scrape(
    new ScrapeConfig({
        url: 'https://web-scraping.dev/product/1',
        // optional parameters:
        // enable javascript rendering
        render_js: true,
        // set proxy country
        country: 'us',
        // enable anti-scraping protection bypass
        asp: true,
        // set residential proxies
        proxy_pool: 'public_residential_pool',
        // etc.
    }),
);
console.log(apiResponse.result.content); // html content
// Parse HTML directly with SDK (through cheerio)
console.log(apiResponse.result.selector('h3').text());

For more see /examples directory.
For more on Scrapfly API see our getting started documentation For Python see Scrapfly Python SDK

Debugging

To enable debug logs set Scrapfly's log level to "DEBUG":

import { log } from 'scrapfly-sdk';

log.setLevel('DEBUG');

Additionally, set debug=true in ScrapeConfig to access debug information in Scrapfly web dashboard:

import { ScrapflyClient } from 'scrapfly-sdk';

new ScrapeConfig({
    url: 'https://web-scraping.dev/product/1',
    debug: true,
    // ^ enable debug information - this will show extra details on web dashboard
});

Development

Install and setup environment:

$ npm install

Build and test:

$ npm run build
$ npm run tests

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.github/workflows		.github/workflows
__tests__		__tests__
examples		examples
src		src
.eslintignore		.eslintignore
.eslintrc.json		.eslintrc.json
.gitleaks.toml		.gitleaks.toml
.pre-commit-config.yaml		.pre-commit-config.yaml
.prettierignore		.prettierignore
.prettierrc		.prettierrc
LICENSE		LICENSE
README.md		README.md
jest.config.js		jest.config.js
package-lock.json		package-lock.json
package.json		package.json
tsconfig.json		tsconfig.json
tsconfig.release.json		tsconfig.release.json

License

scrapfly/typescript-scrapfly

Folders and files

Latest commit

History

Repository files navigation

Scrapfly SDK

Quick Intro

Debugging

Development

About

Topics

Resources

License

Security policy

Stars

Watchers

Forks

Languages