Rozetka Scraper

This small scraper fetches product data (title, price, old price, discount, url) from a Rozetka category page and saves the results to products.json.

Usage

Install dependencies:
- requests
- beautifulsoup4
- lxml
- (optional) fake-useragent
Example: pip install requests beautifulsoup4 lxml fake-useragent
Run the script: python script.py

Output

products.json: JSON file with structure: { "products": [ { "name": "...", "price": "...", "old_price": "...", "discount": "...", "url": "..." }, ... ] }

Notes

The scraper uses a User-Agent header. If fake_useragent is not available, a stable fallback is used.
The script includes basic error handling:
- Network errors return an empty product list and log an error.
- Parsing errors return an empty product list and log an error.
- File write errors are logged.
To adapt selectors or classes, update the find_all calls in rozetka_scraper.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.gitignore		.gitignore
README.md		README.md
products.json		products.json
script.py		script.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Rozetka Scraper

About

Uh oh!

Releases

Packages

Languages

psy-ger/Scraping_site_Rozetka_using_Python

Folders and files

Latest commit

History

Repository files navigation

Rozetka Scraper

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages