Skip to content

Robin6551/YouTube-API-Data-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

📊 YouTube Content Strategy Deep Dive: A Data-Driven Analysis This project utilizes the YouTube Data API to analyze over 1,300 videos from high-performing educational and technical YouTube channels. The goal is to uncover the fundamental correlation patterns between key metrics (Views, Likes, Comments, and calculated Engagement Rate) to inform effective content strategies.

📝 Project Summary: Methodology and Outcomes What I Did (Methodology) The core of this project involved a rigorous, data-driven approach to understand YouTube content performance:

Data Acquisition: I used the YouTube Data API v3 to systematically collect data from the uploads of over five highly influential channels in the tech and data science space (including freeCodeCamp.org, Android Developers, and Programming with Mosh).

Dataset Construction: A comprehensive dataset of 1,369 videos was assembled, capturing raw metrics like view count, like count, and comment count for each video.

Feature Engineering: The Engagement Rate was calculated ( Views Likes ​ ) to normalize performance across videos of varying popularity, providing a clearer metric of content quality and audience satisfaction.

Statistical Analysis: A correlation matrix was computed on the key metrics (Views, Likes, Comments, Engagement Rate) to quantify the strength and direction of the relationships between them.

What is the Outcome (Key Strategic Insights) The analysis resulted in three critical, actionable insights for YouTube content creators, summarized below:

Metric 1

Metric 2

Correlation (r)

Strategic Insight

Likes

Comments

∼0.97

Conversation is King: The near-perfect correlation strongly suggests that genuine audience satisfaction (Likes) is the primary driver of community discussion (Comments). Prioritize content quality and value to foster conversation.

Views

Likes

∼0.53

Conversion Challenge: A moderate link shows that getting a click (View) is only half the battle. Creators must focus on high-quality delivery and strong calls to action to successfully convert viewers into satisfied engagers (Likes).

Views

Engagement Rate

∼−0.23

The Viral Drop-off: The slight negative correlation suggests that as videos reach a mass audience (going 'viral'), the normalized Engagement Rate often declines. Optimization should prioritize deep engagement over sheer reach.

🚀 Getting Started

  1. Prerequisites Python 3.8+

A Google Cloud Project with the YouTube Data API v3 enabled.

Your API Key.

  1. Setup and Installation Clone the repository and set up a virtual environment:

cd YouTube-Data-Analysis python -m venv venv source venv/bin/activate # On Windows: venv\Scripts\activate pip install -r requirements.txt

  1. API Key Configuration Create a file named .env in the project root directory and add your YouTube Data API key:

.env file

YOUTUBE_API_KEY="YOUR_PRIVATE_API_KEY_HERE"

Note: The .gitignore file ensures this sensitive key is never uploaded to GitHub.

  1. Running the Analysis Execute the main script. It will fetch data, perform the analysis, print the correlation matrix, and save a visualization.

python youtube_analyzer.py

⚙️ Project Structure File/Folder

Description

youtube_analyzer.py

Main script for API interaction, data fetching, processing, and statistical analysis.

YouTube Data Analysis .ipynb

Original Jupyter Notebook for exploratory data analysis and visualization (as provided).

requirements.txt

Lists required Python libraries.

.gitignore

Ensures API keys and generated data are ignored by Git.

data/

(Future) Directory to store raw and processed CSV data.

plots/

(Future) Directory to store generated correlation heatmaps and visualizations.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published