Skip to content

Project on Natural Language Processing (NLP) with YouTube API using Python, OAuth 2.0, and automation through Kedro

Notifications You must be signed in to change notification settings

natgluons/YouTube-Transcript-NLTK

Repository files navigation

YouTube-Transcript-NLTK

Overview

This project leverages the YouTube Data API and Natural Language Processing (NLP) techniques to gain insights into YouTube video recommendations based on user watch history. By analyzing video transcripts and identifying keywords, it sheds light on the algorithm driving video suggestions.

Key Features

  • OAuth 2.0 Integration: Securely authenticate with YouTube API for data access.
  • Transcript Analysis: Extract and analyze video transcripts using NLP.
  • Keyword Extraction: Identify keywords from transcripts to understand video content.
  • Visualization: Visualize insights through charts and word clouds.
  • Export Data: Save results in JSON and Excel formats for further analysis.

Getting Started

  • Clone the repository.
  • Set up OAuth 2.0 credentials on Google Cloud Console.
  • Run the data pipeline using Kedro.
  • Analyze results, visualize insights, and gain a better understanding of YouTube recommendations.

Dependencies

  • Python 3.x
  • Kedro
  • YouTube Data API
  • NLTK

About

Project on Natural Language Processing (NLP) with YouTube API using Python, OAuth 2.0, and automation through Kedro

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages