Skip to content

badagui/my-live-guru

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

My Live Guru

A real time voice transcription assistant with single-click LLM feedback.


  • Uses Deepgram API for transcription
  • Uses OpenAI API for LLM
  • Uses PyAudio for audio capture
  • Uses TkInter for GUI

Run Instructions:

  • clone this repo
  • python -m venv venv
  • pip install -r requirements.txt
  • create a .env file with DEEPGRAM_API_KEY and OPENAI_API_KEY set
  • set up a system audio loopback: activate Stereo Mix (Windows) or set up PulseAudio to monitor your output device (Linux).
  • run python src/main.py

explanation: accessing the system audio output directly is hard, so we need a virtual input containing all the output audio (an audio loopback). There are many free softwares that can do this, and windows comes with this by default called Stereo Mix, just have to activate it. Getting the raw output audio this way ensures we can work with any source.

Use Instructions:

  • Prepare your prompt in the stage tab.
  • Use the [INPUT_TRANSCRIPTION] tag to indicate where the input transcription should be placed.
  • Select your input devices and click start capturing.
  • The transcription will be updated in real time and you will be identified as "user:" and the loopback audio as "system:".
  • Click ASK GURU to send the prompt to the LLM and get the response.

Example using a mock interview video (from 2:44 to 3:54):

https://www.youtube.com/watch?v=1qw5ITr3k9E&t=164s



Applications:

  • Training and/or assistance for job interviews (for both interviewers and candidates), sales reps, aviation communications, legal practice, political debates...
  • Customer reps assistant: on handling difficult customers or following protocols.
  • Meetings assistant: delivering unique viewpoints, augmenting presented suggestions, and contrasting diverse lines of reasoning.
  • Any kind of real-time conversation chatbot.
  • Tabletop RPG helper.

todo list

  • Support local processing.
  • Better prompt management.
  • Allow multiple buttons to call different prompts.
  • Add response automation.
  • Remove the need for a system audio loopback setup.
  • Improve GUI.



p.s: this is a proof of concept, not intended for live use. Do not use without everyones consent.

About

A real time voice transcription assistant with single-click LLM feedback.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages