Skip to content

Specific LinkedIn scraper with Selenium and Pandas

License

Notifications You must be signed in to change notification settings

huenique/li-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Li Scraper

A LinkedIn scraper written to scrape specific parts of a user's profile and organization.

Logic Sequence

  • Takes the name, surname, company from a pandas dataframe row (.csv file) and
  • Finds the LinkedIn profile with these three fields (Google search for public profiles).
  • From the profile, we take: URL, Title, Current Role, Current Company, Company Location.
  • Take contact information; Website, Twitter, Email.
  • From the company the person is currently under: Industry, Organization URL, Company size and Specialties.
  • Finally, we add the scraped information to the dataframe columns.

Setup

From the package's root directory, run:

pip install -e .

Usage

from src.settings import Settings

email = 'myemail@email.com'
password = 'mypassword'
file = 'dataframe.csv'
driver_path = r'C:\Program Files\webdrivers\msedgedriver.exe'
scraper = Settings.start_scraper(email, password, file, driver_path)

About

Specific LinkedIn scraper with Selenium and Pandas

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages