-
-
Notifications
You must be signed in to change notification settings - Fork 74
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Better genre detection and track recommendation #ML #data #83
Comments
Examples that could be applied to Openwhyd:
#deeplearning |
This is a rather exciting feature to add! |
Hi @Marinlemaignan ! I'd be happy to replace plTags.js when we have a fully-functional solution that is better than the current one, while maintaining:
One way we could transition gently to a new system:
What do you think? |
/cc @florentpietot |
I have experimented extensively with discog's API. It's very complete, extremely promising but ... the number of request is of 60 requests .. per minute. https://www.discogs.com/developers/#page:home,header:home-rate-limiting There is no way to go around this. A partnership would be the only solution and I doubt that they would be attracted by a partnership that does not bring them anything. What we could do is identify albums and point to their products/sellings. They would not split in such a big showcase as openwhyd. A solution is to host their database. There is docker images to download their monthly dump and index it in mongodb. But even then a few other problems arise :
A solution that I studied would be scraping ... but they wouldn't like it and what a dirty solution. I'm not saying it's impossible, just that it's not a bulletproof approach. |
Thanks for sharing our ideas and notes with us, @SkinyMonkey ! |
WIP:
Florent Piétot is currently analysing Openwhyd's data set, and thinking of ways to leverage it (e.g. use clustering and/or machine learning techniques for better genre detection and music recommendation). |
During a "Hackergarten" meetup in Paris, Mihangy, Damien and I wrote a python script that turns 👉 2c095d2 The goal was to provide a starting point for the development of a music recommendation algorithm based on Openwhyd's playback logs, while preserving the privacy of its users. (i.e. data anonymisation) Next steps:
|
The data science cheatsheets provided on this repo may help :-) https://github.com/FavioVazquez/ds-cheatsheets |
This also may help: https://github.com/trekhleb/homemade-machine-learning (examples of machine learning techniques in Python, based on Andrew Ng's MOOC) |
During Hackergarten meetup, Sébastien Treguer suggested the following next step:
|
For reference: Mihangy is experimenting with Jupyter Notebooks and SurpriseLib. He opened a google group to discuss data analysis tasks on openwhyd's data using those tools. Aidan O'Donnell and Patrick Allain also showed interest in these initiatives, during this week's Hackergarten. |
For reference, I published a 700MB history/playlog file in https://github.com/openwhyd/openwhyd-data At some point, it may be worth picking a license and publishing the data on open data listings like awesomedata/awesome-public-datasets. Suggestions are welcome! |
This list of best practices could help: https://github.com/microsoft/recommenders |
Music genre detection and genre-based streams were removed in #399. => Closing. |
For music lovers, discovering new music is essential.
Spotify is well known for the quality of their "Discover Weekly" playlist, containing a personalised selection of tracks based on your listening history.
On Openwhyd, current ways to discover music are:
The first way is purely relying on humans and luck.
The second way relies on a list of 16 genres (a quite limited and vague selection of genres), in which popular tracks are classified, based on the names of the playlists that hold them. This kinda works but it's far from perfect. For example, we had to create a hard-coded rule to prevent Daft Punk songs from being recognised as Punk Rock music!
In order to discover new music by discovering relevant people to follow, we had also experienced showing a measure of profile similarity, but it was only based on the number of artists that were added by both users.
=> Anyone interested in exploring new ways to discover music on Openwhyd?
The text was updated successfully, but these errors were encountered: