This the link to already extracted data corpus from daraz.com.np
https://github.com/nOOBIE-nOOBIE/Daraz-online-shopping-data-corpus/
Operating System: Linux
Language: python3
libraries: Scrapy
This scraper was purely built for research purpose.
Please note that you might need to make some changes to the scraper if in future the interface of the daraz.com.np is changed.( the scraping is totally based on CSS)
- clone this repository to your computer
- Launch terminal
- Navigate to the folder with file scrapy.cfg
- Enter this code
scrapy crawl darazall -o data.csv
This is the sample code
scrapy crawl darazall -o data.csv
- *It will simply store all the scraped data into the data.csv file
- Product
- Category
- Brand
- Price
- Seller_name
- Average_Product_rating
- Buyer_comments
- Buyer_comment_title
- Buyer_product_review