Skip to content

devrohaan/kick-off-web-scraping-python-selenium-beautifulsoup

Repository files navigation

Wisdomic Panda Wisdomic Panda

Hold the Vision, Trust the Process.

Beginner's guide to Web Scraping using Beautifulsoap, Selenium and python!

... a technique used for extracting data from web/websites.

All Minds Meet’ 2018.

☕ Ingredients:

  • python
  • selenium
  • PhantomJS
  • beautifulsoap
  • requests
  • pandas
  • tabulate
  • Spyder IDE
  • Ubuntu 16.4 LTS

🚧 Table of Contents:

  1. Setup your local environment: Cookbook

❗ I run on Mac OS/Ubuntu so you might have to slightly modify the code to make it work in your env.

  1. Go through this for quick insights: Handbook

  2. Get hands on: Kick-off

  3. Examples:

    4.1 Glassdoor_jobs

    4.2 Pablo_quotes

    4.3 Premier_League_score_table

    4.4 Bhagavad Gita Lessons

    4.5 Akbar_Birbal_Stories

Hey Buddy!

This repository explains the rationale for web scraping in python. I have implemented few basic examples using selenium, have a dekko at it! This repo covers approximately 1% of the entire python web scraping. My motive is to get you familiar with the tools that python provides if you forsee your career as a Data Engineer. If you have any suggestions for more commands that should be on this page, let me know or consider submitting a pull request so others can benefit from your work. Thank you very much for reaching out! Please follow if you find it handy and hit ⭐ to get more kick-off repo updates.

📧 Drop In!! Seriously, it'd be great to discuss Technology.

Take risks in your life, If you win, you can lead! If you loose, you can guide! - Swami Vivekananda