This repository contains part of the code and datasets that were used for the medium article The Emoji developers use most.
I used Python and the Gitter API to get the messages from the freeCodeCamp main chat room. Python libraries like multiprocessing
and emoji
were used to transform the data. Part of the transformations also required data available online, for which I made customized scrapers also with Python libraries (requests
, urllib
, BeautifulSoup4
). To analyze the data I used plain Python and some pandas
. Explorative visualizations were made using matplotlib
while the interactive ones where made in D3.js
.
d3.js visualizations can be found at:
Regarding the raw datasets used for this project they are now available on the freeCodeCamp’s Kaggle account.
The motivation of this project adheres to the mission of the freeCodeCamp’s Open Data Initiative. A big thanks to the people in the freeCodeCamp DataScience room and specially to mstellaluna in helping with this project.