Audio Analyser

Audio Analyser is a powerful tool for analyzing, transcribing, and translating audio content from YouTube videos. It provides detailed insights into audio characteristics, speech patterns, and emotional content, along with accurate transcription and translation capabilities.

Features

Audio Analysis
- Pitch analysis (average, variability, range)
- Signal quality metrics (RMS, noise floor, SNR)
- Voice quality assessment (jitter, shimmer, quality score)
- Emotion detection and intensity analysis
- Speech pattern analysis (pause count, duration, word rate)
Transcription & Translation
- Accurate YouTube video transcription
- Multi-language translation support
- Real-time translation streaming
- Timestamped transcript segments
Visualization
- Interactive charts and graphs
- Detailed audio metrics visualization
- Comparative analysis views
- Raw data and visual toggle

Technology Stack

Backend

Python 3.12
FastAPI
yt-dlp
SciPy
NumPy
NLTK

Frontend

React.js
Tailwind CSS
Chart.js
Axios

Installation

Backend Setup

Clone the repository:

git clone https://github.com/yourusername/audio-analyser.git
cd audio-analyser/backend

Create a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install dependencies:
```
pip install -r requirements.txt
```
Start the server:
```
uvicorn main:app --reload
```

Frontend Setup

Navigate to the frontend directory:
```
cd ../frontend
```
Install dependencies:
```
npm install
```
Start the development server:
```
npm start
```

Usage

Open the application in your browser (default: http://localhost:3000)
Enter a YouTube video URL
View the transcription and analysis results
Use the translation feature to translate the transcript
Explore the detailed audio analysis visualizations

Screenshots

API Documentation

The backend API is documented using Swagger UI. After starting the server, visit:

http://localhost:8000/docs

Configuration

Create a .env file in the backend directory with the following variables:

API_KEY=your_api_key_here

Contributing

We welcome contributions! Please follow these steps:

Fork the repository
Create a new branch (git checkout -b feature/YourFeatureName)
Commit your changes (git commit -m 'Add some feature')
Push to the branch (git push origin feature/YourFeatureName)
Open a pull request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments

FastAPI for the backend framework
React for the frontend framework
Chart.js for data visualization
yt-dlp for YouTube video processing

I have used Locally running LLM on my System. Feel free to use whatever you want.

Contact

For any inquiries, please contact aman1024soni@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
backend		backend
frontend		frontend
notebooks		notebooks
screenshots		screenshots
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Audio Analyser

Features

Technology Stack

Backend

Frontend

Installation

Backend Setup

Frontend Setup

Usage

Screenshots

API Documentation

Configuration

Contributing

License

Acknowledgments

I have used Locally running LLM on my System. Feel free to use whatever you want.

Contact

About

Uh oh!

Releases

Packages

Uh oh!

Languages

codingBuddh/Audio_Analyses_AI

Folders and files

Latest commit

History

Repository files navigation

Audio Analyser

Features

Technology Stack

Backend

Frontend

Installation

Backend Setup

Frontend Setup

Usage

Screenshots

API Documentation

Configuration

Contributing

License

Acknowledgments

I have used Locally running LLM on my System. Feel free to use whatever you want.

Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages