- The problem statement is to create a Streamlit application that allows users to access and analyze data from YouTube Channels:
-
Using _Google API we retrieve all the relevant data for a YouTube channel ID.
| Channel name | Subscribers | Total video count | Playlist ID | Video ID | Likes| Comments of each video |
-
Store the data in a MongoDB database.
-
Ability to collect data form the YouTube channels and store them in the data lake by clicking a button.
-
python -m streamlit run app.py
or streamlit run app.py
- provide your api key for the youtube
- provide your sql user, database name, password.
- provide your mongodb database name.
- Clone the Repository:
git clone git@github.com:AJIN-B/Youtube-data-Harvesting.git
cd Youtube-data-Harvesting
- Set Up a Virtual Environment (Optional but Recommended):
# For macOS and Linux:
python3 -m venv venv
# For Windows:
python -m venv venv
- Activate the Virtual Environment:
# For macOS and Linux:
source venv/bin/activate
# For Windows:
.\venv\Scripts\activate
- Install Required Dependencies:
pip install -r requirements.txt
- Set up the Environment Variables:
# add the following Keys
API_KEY="Your Youtube API KEY"
HOST="Your HOST ID"
USER="Your USER ID"
PASSWORD="Your PASSWORD"
PORT="Your PORT"
DATABASE_NAME="Your DATABASE NAME"
- Run:
python -m streamlit run app.py
or
streamlit run app.py
After running the command, Streamlit will provide a local URL (usually http://localhost:8501/
) which you can open in your web browser to access application.
=======