Skip to content

Latest commit

 

History

History
23 lines (19 loc) · 1 KB

README.md

File metadata and controls

23 lines (19 loc) · 1 KB

YouTube-Transcript-NLTK

Overview

This project leverages the YouTube Data API and Natural Language Processing (NLP) techniques to gain insights into YouTube video recommendations based on user watch history. By analyzing video transcripts and identifying keywords, it sheds light on the algorithm driving video suggestions.

Key Features

  • OAuth 2.0 Integration: Securely authenticate with YouTube API for data access.
  • Transcript Analysis: Extract and analyze video transcripts using NLP.
  • Keyword Extraction: Identify keywords from transcripts to understand video content.
  • Visualization: Visualize insights through charts and word clouds.
  • Export Data: Save results in JSON and Excel formats for further analysis.

Getting Started

  • Clone the repository.
  • Set up OAuth 2.0 credentials on Google Cloud Console.
  • Run the data pipeline using Kedro.
  • Analyze results, visualize insights, and gain a better understanding of YouTube recommendations.

Dependencies

  • Python 3.x
  • Kedro
  • YouTube Data API
  • NLTK