Simple Amazon Author Scraper

Install the following packages:

selenium: To automate scraping. You can download using pip through command line as:
```
pip install selenium
```
webdriver-manager: Install the chrome driver inplace so no need to download explicitly. You can download it through command line as:
```
 pip install webdriver_manager
```
pandas: For file manipulation (saving data to csv). You can download using:
```
 pip install pandas
```
word2number: Convert words to numbers. You can download using pip through command line as:
```
pip install word2number
```
Currently this script works on Chrome browser.

File structure:
--AuthorProfileConfigConfig.py: Contains user-defined functions to retrieve data.
--DriverSetup.py: Defines and initiate webdriver object of selenium.
--main.py: Run this file to scrape data for author profile.

--ProductMain.py: Run this file to scrape data for all the subprodcuts related to each author.

To run:

run main.py. Data will be scraped from main_product folder containing all the main product data.
Data will be stored in reviewers folder.

run ProductMain.py. Data will be scraped from reviewers folder containing all the author profile data. Data will be store in reviews folder. For example: \data_scraping_v2\

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
AuthorProfileConfig.py		AuthorProfileConfig.py
DriverSetup.py		DriverSetup.py
ProductMain.py		ProductMain.py
README.md		README.md
Requirements.txt		Requirements.txt
main.py		main.py
readme.txt		readme.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Simple Amazon Author Scraper

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Guide-Analytics/simple_amazon_author_scraper

Folders and files

Latest commit

History

Repository files navigation

Simple Amazon Author Scraper

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages