For decades, female rappers have faced barriers in the music industry, including limited radio play, industry bias and underrepresentation in major playlists. In 2021, women accounted for 23.3% of artists on the Billboard Hot 100 Year-End Chart, compared to 76.7% men. In 2024, their percentage rose to 37.7%, but still showing a male majority. Even though the city of origin is crucial in rap music, shaping everything from sound and style to social commentary and the trajectory of artists’ careers, there are no official city-by-city streaming statistics for female rappers published by major music platforms or industry organizations. So which cities produce successful female rap artists? Is there a connection between a city's representation and its artists' streaming numbers or hit rates? And most of all, which US cities are most likely to produce and sustain female rap talent?
• Billboard Hot 100 Songs: Provides historical chart performance of songs, allowing to analyze how female rappers have evolved over time in mainstream rankings.
• Spotify Top 10k Streamed Songs: Helps in identifying the most streamed songs, enabling a comparison of the streaming success of female rappers versus male rappers.
• Spotify Dataset (Popular Hip Hop Artists and Tracks): Provides data about artists and songs streaming, allowing to analyze the popularity of female rap artists compared to their male counterparts.
• Tidal Playlist of Female Rappers: Features curated tracks from influential female rappers, offering a qualitative and quantitative look into their music styles and favorite themes.
• Complex Article: Best Rap Cities Ranking: Helps contextualize the geographic influence on the success of female rappers by mapping their origins to top ranked hip hop cities.
Used a mix of manual review and AI tagging to add missing gender and city info. Normalized cities (Harlem and Brooklyn → NYC) and removed non-rap artists like Beyoncé and Rihanna.
Explored distributions, top cities, and artist frequency across platforms using histograms and bar charts.
Grouped artists by city and platform and labeled artists as "hit = 1" if they had 5+ mentions. Applied one-hot encoding, scaling, and handled missing values.
Trained a Random Forest Classifier to predict hits from city and platform features.
Created visual summaries such as:
-
Top female rappers by total mentions
-
Leading cities for female rapper success
-
Confusion Matrix for model results
-
Duration of female rap tracks
- Nicki Minaj (NYC) had the highest total mentions across all platforms.
- NYC, Houston, and LA were the top cities for female rapper success.
- Billboard and Spotify Top 10k were most strongly correlated with hit status.
- TIDAL and Complex offered qualitative insights but less predictive value.
- Model results show promise, though further validation on larger samples is recommended.
- Scout where it counts: Focus A&R efforts on NYC, Houston, LA, and Atlanta.
- Watch early platform momentum: Use Spotify 10k and Billboard as early breakout signals.
- Use predictive tools: Combine city and platform data for scalable artist discovery.
- Add features like lyrics, mood, collaborations, and tempo.
- Explore international scenes (e.g., the UK, France, Brazil, South Africa).
- Incorporate TikTok and social media virality metrics.
- Use advanced ML models (e.g., XGBoost) for better prediction.
- Build a dashboard for label teams to explore artist metadata and predictions.
Author: Éloïse Bouton
LinkedIn: https://www.linkedin.com/in/eloisebouton/
This project is part of the Springboard Data Science Career Track – Capstone Project.
