Skip to content

Quickly extract content from audio or video and turn it into a structured Markdown note. Simple, smooth, and easy to read!

License

Notifications You must be signed in to change notification settings

hqman/AudioNotes

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AudioNotes

A Structured Note-Taking System for Audio and Video Based on FunASR and Qwen2

Quickly extracts content from audio and video files, and uses large language models to organize it into structured markdown notes for easy reading.

FunASR: https://github.com/modelscope/FunASR

Qwen2: https://ollama.com/library/qwen2

Demonstration

Audio/Video Recognition and Organization

image

Conversation with Audio/Video Content

image

Usage Instructions

① Install Ollama

Download and install the Ollama package for your system

https://ollama.com/download

② Pull the Model

Using Alibaba's Qwen2 7b as an example: https://ollama.com/library/qwen2

ollama pull qwen2:7b

③ Deploy the Service

There are two deployment methods: Docker deployment or local deployment

Docker Deployment (Recommended) 🐳

curl -fsSL https://github.com/harry0703/AudioNotes/raw/main/docker-compose.yml -o docker-compose.yml
docker-compose up

After Docker starts, visit http://localhost:15433/

Login account is admin, password is admin (can be modified in the docker-compose.yml file)

Local Deployment 📦

Requires an accessible PostgreSQL database

conda create -n AudioNotes python=3.10 -y
conda activate AudioNotes
git clone https://github.com/harry0703/AudioNotes.git
cd AudioNotes
pip install -r requirements.txt

Rename .env.example to .env and modify the relevant configuration information

chainlit run main.py

After the service starts, visit http://localhost:8000/

Login account is admin, password is admin (can be modified in the .env file)

About

Quickly extract content from audio or video and turn it into a structured Markdown note. Simple, smooth, and easy to read!

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 95.9%
  • Dockerfile 4.1%