Skip to content

DoraMatching/crawler-tools

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Crawler Tools

A Tool crawl all articles on forums in Vietnam such as Viblo, Kipalog, DayNhauHoc,... written in Javascript programing language using Puppeteer - Chromium Web Driver.

Install Yarn and Sequelize-cli

[sudo] npm install yarn sequelize-cli -g

Clone project

git clone https://github.com/DoraMatching/crawler-tools.git

Install dependencies

cd crawler-tools
yarn install

Migration

yarn db:migrate

OR - remove old DB and re-migrate

yarn db:new # this command will remove developement.db file & migrate DB !

Start crawling data

yarn start