Skip to content

Indonesia Index News Crawler, including 10 online media

Notifications You must be signed in to change notification settings

harryandriyan/warta-scrap

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

52 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

warta-scrap

Indonesia Index News Crawler, including 10 online

Online Media List:

Installation :

Open Terminal, and clone this repo:

git clone https://github.com/harryandriyan/warta-scrap

Go to project folder

cd warta-scrap

Setup virtualenv

virtualenv venv

Activate virtualenv

. venv/bin/activate

Install requirements

pip install -r requirements.txt

How to use

Open the specific project, example

cd republika

Run crawl command, example

scrapy crawl republika -o sampleResult.json -t json