This is a crawler that makes use of Scrapy to crawl bol.com. It can be downloaded as a Python package from PyPi.
The Crawler
instance has two functions you can use, crawl_products
and crawl_category
. See an example below.
from bol_crawler.crawler import Crawler
crawler = Crawler()
# to crawl products
products = crawler.crawl_products(
[
'https://www.bol.com/nl/p/lg-34gl750-b-ultragear-gaming-monitor/9200000115819731',
]
)
# to crawl a category
products = crawler.crawl_category(
[
'https://www.bol.com/nl/l/gaming-toetsenborden/N/18214/', 0 # the 0 value is how often you want to paginate. 0 is just crawling the first page
]
)