YT Summariser

YouTube Summary with Whisper API, ChatGPT API + Eleven Labs Voice

Purpose

To let users consume content in the forms they want, not necessarily the forms they are uploaded in
To reclaim time spent watching long form content when not necessary, summarisation helps understand whether longer content is worth your time
Proof of concept for a larger suite of personalised content summarisation tools
Learning more about user cases for AI & the ease of developing helpful micro tools with ChatGPT

Note: the Eleven Labs Voice Summary is optional ( Can be removed or #commented out )

You can choose from different OpenAI models in models.py - GPT3.5-turbo is default
The current prompts are just experiments and you are encouraged to try variations to find something that works for you
systembot.txt will set a system message for the gpt functions - leaving this blank has worked well so far

Open colab file from colab - you likely need to create a new notebook and upload to your drive
Browse to colab hosted resources from the folder icon on the left pane
Upload all necessary local files (can copy contents of the 'copy-my-contents' folder)
Set your keys in the openai and elevenlabs txt files - note this is not safe to share, feel free to secure colab api keys in other ways
Set intended Youtube URL in "URL.txt"
Run colab sections

Note: All summarised files are placed in a './summary/' subfolder

Downloads high quality version of youtube video via pytube
Extracts audio into segment files via pydub so it can be transcribed to text via Whisper API
Uses Whisper API to transcribe segments
Joins segments back up - full transcription can be found in 'podscript.txt'
'podscript.txt' fed to GPT in chunks with the task in 'prompt1.txt' to make a full summary, summarised to 'initialsummary.txt'
'initialsummary.txt' is broken into chunks and put through GPT with the task 'prompt2.txt' to make bulletpoints and 'prompt23txt' to make a concise summary, summaries saved in 'bulletpoints.txt' and 'shortsummary.txt'
'bulletpoints.txt' fed into gpt to generate a narration synopsis - 'synopsis.txt'
'synopsis.txt' is sent to elevenlabs API to be transcribed to an audio spoken file - audiosynopsis.mp3

Credit for initial codebase that has been a key building block for further development goes to allaboutai on Youtube

User assumes all responsibility in using the tool in ways that comply with their local and federal laws.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
copy-my-contents-to-colab		copy-my-contents-to-colab
dsai-hackathon-example-outputs		dsai-hackathon-example-outputs
.env.template		.env.template
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
URL.txt		URL.txt
models.py		models.py
podsummariser.py		podsummariser.py
prompt1-scriptsum.txt		prompt1-scriptsum.txt
prompt2-bulletpts.txt		prompt2-bulletpts.txt
prompt3-shortsum.txt		prompt3-shortsum.txt
prompt4-synopsis.txt		prompt4-synopsis.txt
requirements.txt		requirements.txt
summariser_FINAL.ipynb		summariser_FINAL.ipynb
systembot.txt		systembot.txt