Web Scraping with Beautiful Soup — A Use Case

You can find the Towards Data Science article at https://towardsdatascience.com/web-scraping-with-beautiful-soup-a-use-case-fc1c60c8005d

Intro

In this notebook, I will give a brief introduction to obtaining data from a webpage, i.e., web scraping, using Python and libraries such as Requests to get the data and Beautiful Soup to parse it. Web scraping becomes necessary when a website does not have an API, or one that suits your needs.

As an example, I use a webpage that has a consistent HTML structure, but this approach can be generalized. While there are some frameworks, such as Scrapy, that can provide such service, I decided to this as a learning experience.

The Use Case

A not-for-profit organization wants to reach out to the Community Foundations of Canada (CFC) sites across the nation. They asked me to find each contact person and their mailing address, and put all the information in a special format in a spreadsheet.

Doing this task manually, by copy-pasting each required field into the spreadsheet, would mean doing this 195 (foundations) * 11 (fields) = 2145 times! So my next thought was to automate the procedure by scraping the CFC website.

Here is the code used to scrape their website, get the requested information, and write it in the format requested into a CSV file.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.MD		README.MD
scrapingDSMeetUp_v2.pdf		scrapingDSMeetUp_v2.pdf
scrapingMailingAddresses-shortExplained.ipynb		scrapingMailingAddresses-shortExplained.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web Scraping with Beautiful Soup — A Use Case

Intro

The Use Case

About

Releases

Packages

Languages

License

brodriguezmilla/WebScrapingCFCBS4

Folders and files

Latest commit

History

Repository files navigation

Web Scraping with Beautiful Soup — A Use Case

Intro

The Use Case

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages