Skip to content

This Scraper will extract all the product information from daraz.com.np. This Project is shared to assist Researchers in their projects.

License

Notifications You must be signed in to change notification settings

amitupreti/Web-scraper-for-ecommerce-website-daraz.com.np

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 

Repository files navigation

Web scraper for ecommerce website daraz.com.np

If this violates any copyright or raises any legal issues. Please inform me. It will be removed.

This the link to already extracted data corpus from daraz.com.np https://github.com/nOOBIE-nOOBIE/Daraz-online-shopping-data-corpus/

Notes:

 Operating System: Linux
 Language:         python3
 libraries:        Scrapy 

 This scraper was purely  built for research purpose. 

you may need to edit the code at daraz/daraz/spiders/darazall.py. If the website changes in future

Please note that you might need to make some changes to the scraper if in future the interface of the daraz.com.np is changed.( the scraping is totally based on CSS)

Guides to use the scrapper

  1. clone this repository to your computer
  2. Launch terminal
  3. Navigate to the folder with file scrapy.cfg
  4. Enter this code scrapy crawl darazall -o data.csv

This is the sample code

scrapy crawl darazall -o data.csv

Explanation of code :

  • *It will simply store all the scraped data into the data.csv file

Following data will be extracted

  • Product
  • Category
  • Brand
  • Price
  • Seller_name
  • Average_Product_rating
  • Buyer_comments
  • Buyer_comment_title
  • Buyer_product_review

About

This Scraper will extract all the product information from daraz.com.np. This Project is shared to assist Researchers in their projects.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages