Skip to content

jakwuh/node-scrapper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

node-scrapper

Non-blocking node.js scrapper.
Inspired by scrappy.

Usage

import {crawl} from 'node-scrapper';

function* parse(browser, spider) {
    let source = yield browser.getText('#my-element');
    let pictureUrl = yield browser.getAttribute('#my-second-element', 'href');
    spider.get(pictureUrl, parsePicture);
}

function* parsePicture(browser, spider) {
    let imageUrl = yield browser.getAttribute('img.logo', 'src');
    saveToDB(imageUrl);
}

crawl(spider => {
    spider.get('http://example.com', parse);
});

About

Non-blocking node.js scrapper

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages