Skip to content

A repository for the Data Management and Data Visualization course project.

Notifications You must be signed in to change notification settings

malborroni/Just-Repeat-Hit

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

a Data Management and Data Visualization Project

Overview   |   Visualizations   |   References   |   Data   |   Word Cloud   |   Presentation   |   About us  

☍   Overview

The musical world of recent years seems to be characterized by increasingly repetitive, obvious and banal hits. This project, therefore, aims to measure the repetitiveness of the songs, analyzing them from a textual point of view, with the aim of assessing the existence or otherwise of certain trends.
The approach used exploits the calculation of an indicator, called the "Repetitiveness Index", which corresponds to the complement of the relationship between the unique words within a text, ie those words that are present once and only once, and the total words of the same.
Greater attention was paid to the study of the time course of the average Repetitiveness Index and to the analysis of the same index, focusing mainly on an aggregation based on the musical genre.

Main goals:

  • to identify the existence of a trend in the index;
  • to identify which musical genres were more or less repetitive, also trying to understand how the lyrics of the songs could influence these results.

☍   Visualizations

Through the use of the Tableau software, some infographics have been proposed that allow you to obtain a more detailed insight on the results, with the aim of understanding them better.


To access the interactive version of the infographics, click on the images above or on the following link, which leads to a page of Tableau Public:

☍   References

[1] Guy Harrison (2015), Next Generation Databa-ses: NoSQL and Big Data, Apress, Berkely (CA), USA, 1st ed

☍   Data

The data necessary for the development of what is described in the Overview has been obtained through the implementation of some scripts in Python language; these were mainly used to make specific requests through the Spotify and Genius APIs, which allowed to obtain, respectively, a list of artists with all the characteristics of the lyrics of the songs for each artist (lyrics) with adjoining audiometric peculiarities or, more simply, information relating to the artist or relating to each song.

☍   Word Cloud

In order to explore the words used in the songs of the analysed artists, one Word Cloud for each artist was created.
Three examples can be seen below:

☍   Presentation

Our slides presentation is available in the Slides folder.
Here we show only the cover:

☍   About us

⊜   Alessandro Borroni

  • Current Studies: Data Science M.Sc. Student at Università degli Studi di Milano-Bicocca (UniMiB);
  • Background: Bachelor degree in Business Economics at Università degli Studi di Milano-Bicocca (UniMiB).

⊜   Mirko Giugliano

  • Current Studies: Data Science M.Sc. Student at Università degli Studi di Milano-Bicocca (UniMiB);
  • Background: Bachelor degree in Marketing, Business Communication and Global Markets at Università degli Studi di Milano-Bicocca (UniMiB).

⊜   Angela Prade

  • Current Studies: Data Science M.Sc. Student at Università degli Studi di Milano-Bicocca (UniMiB);
  • Background: Bachelor degree in Information Technology at Università di Trento.

Releases

No releases published

Packages

No packages published