Skip to content

I scrapped all David Bowie's Lyrics, and count the unique words per song.

Notifications You must be signed in to change notification settings

mbbau/Web-scrapper-of-bowie-lyrics

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Web scrapper of bowie lyrics

This repository contains my final project for the "Python for Everybody Specialization" from coursera. Here you will find the code I wrote to scrappe all David Bowie's Lyrics using BeautifulSoup, and count the unique words per song, creating a data base with sqllite in the process to store all the data gathered.

I have divided the process into two different scripts, in order to make it more efficient.

1 - The first one, named "Initial Scrape", takes all the song names of the page www.lyrics.com and the links to their lyrics and place them in a Data Base called dbowie.sqlite.

2 - The second one, named lyrics extractor, goes link by link collecting the lyrics for all the songs of the data base, and counting the words used by David Bowie in his songs, to give the main 10 words.

There is a version called proyecto that goes through all the process.

The 10 most used words were:

[('down', 1114), ('little', 1126), ("don't", 1245), ('with', 1251), ('dance', 1302), ('like', 1335), ('that', 1477), ("it's", 1698), ('love', 1795), ('your', 2515)]

wordcloud Bowie

About

I scrapped all David Bowie's Lyrics, and count the unique words per song.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages