🎙️ Simple Transcriber

A powerful web application that combines advanced audio transcription with intelligent AI-powered text processing, all running seamlessly in your browser. Transform spoken content into polished documents with cutting-edge AI technology.

🚀 Try It Now

🌐 Live Demo - No installation required! You will need your own AI API keys.

https://jarchitect.org/whisper

✨ Features

🎤 Audio Transcription - Convert speech to text using OpenAI's advanced Whisper technology with exceptional accuracy and language support
🔍 AI Proofreading - Automatically correct grammar, spelling, and punctuation errors while preserving your original meaning and tone
📋 Meeting Minutes - Transform raw conversation transcripts into professional, structured meeting minutes with action items and key decisions
📝 Smart Summarization - Extract essential points and create concise summaries from lengthy transcripts and documents
📑 Outline Generation - Create well-structured outlines and organize unstructured content into logical, hierarchical formats

🛠️ Technical Architecture

This application leverages state-of-the-art AI technologies to deliver accurate, reliable results:

OpenAI Whisper - Advanced speech recognition
OpenAI Compatible Models - Flexible AI text processing
LocalStorage - Secure client-side data storage
Browser APIs - Native web technologies

All processing occurs securely through trusted AI APIs. Your sensitive data is never stored on our servers, ensuring complete privacy and security.

🚀 Getting Started

Prerequisites

To use this application, you'll need API keys from the following services:

Firework - For audio transcription
OpenRouter - For text processing

Setup Instructions

Create Accounts
- Sign up for Firework
- Sign up for OpenRouter
Purchase Credits
- Add a minimum of $5.00 to each service
Generate API Keys
- Create API keys from each service's dashboard
- Keep these keys secure and never share them
Configure Application
- Open the application in your browser
- Navigate to settings
- Enter your API keys

🎯 Recommended Models

🎙️ Transcription Model

Firework Whisper V3 Turbo (Recommended)

Cost Effective: $0.0009 per minute (standard) or $0.00126 per minute (with speaker diarization)
Privacy First: Zero data retention policy by default
High Accuracy: State-of-the-art speech recognition technology
Language Support: Multiple languages and dialects

📝 Text Processing Model

OpenRouter Models (Recommended)

Privacy Protected: No storage of prompts or responses
Model Variety: Access to multiple AI models
Competitive Pricing: Transparent, usage-based pricing

Note: While any OpenAI-compatible model can be used, be aware that many free AI services use your data for model training. The recommended services prioritize data privacy.

🔒 Privacy & Security

Data Handling

Your audio files and transcripts are processed directly in your browser or sent securely to AI APIs. No data passes through or is stored on our servers.

API Key Security

Your API keys are stored locally in your browser's secure storage and are only used for direct API requests. We never have access to your keys.

Storage & Cookies

This application uses localStorage to remember your preferences and settings. No tracking cookies or analytics are used.

Transparency

The complete source code is available for review in this repository. This ensures full transparency about how your data is handled and processed.

🏗️ Installation & Development

Local Development

Clone the repository

git clone https://github.com/jarchitect1/simple_transcriber.git
cd simple_transcriber

Open in browser
- Simply open index.html in your web browser
Configure API Keys
- Open the application
- Go to settings
- Enter your API keys

Project Structure

simple_transcriber/
├── index.html          # Main application page
├── settings.html       # Settings page
├── about.html          # About page
└── README.md           # This file

🌟 Use Cases

Journalists: Transcribe interviews and create polished articles
Students: Process lecture recordings into organized notes
Business Professionals: Generate meeting minutes and summaries
Content Creators: Transform audio content into written materials
Researchers: Analyze recorded interviews and discussions
Accessibility: Convert audio content for hearing-impaired users

🤝 Contributing

We welcome contributions!

📋 Roadmap

Yet to think of any.

🐛 Bug Reports & Feature Requests

Found a bug or have an idea for improvement? Please:

Check existing issues to avoid duplicates
Create a new issue with detailed information
Use appropriate labels (bug, enhancement, question)
Provide steps to reproduce for bugs
Include system information (browser, OS, etc.)

📧 Support

Need help or have questions?

Email: smartygab24@gmail.com
GitHub Issues: Use the Issues tab above
Response Time: I typically respond within 24 hours

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

OpenAI for the Whisper technology
Firework for providing cost-effective transcription services
OpenRouter for privacy-focused AI model access
Contributors who help improve this project

🤝 Develop Using Visual Studio Code with Kilo Code Extension

Mainly with DeepSeek R1, Gemini 2.5 Pro & Claude Sonnet 4

☕ Give Some Supports & Encouragement

Version: 1.0.0
Last Updated: July 2025

⭐ Star this repository if you find it helpful!

🔗 Share with others who might benefit from this tool!

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
LICENSE		LICENSE
README.md		README.md
about.html		about.html
app.js		app.js
icon-192.png		icon-192.png
icon-512.png		icon-512.png
index.html		index.html
settings.html		settings.html
settings.js		settings.js
style.css		style.css
translate.html		translate.html
translate.js		translate.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎙️ Simple Transcriber

🚀 Try It Now

✨ Features

🛠️ Technical Architecture

🚀 Getting Started

Prerequisites

Setup Instructions

🎯 Recommended Models

🎙️ Transcription Model

📝 Text Processing Model

🔒 Privacy & Security

Data Handling

API Key Security

Storage & Cookies

Transparency

🏗️ Installation & Development

Local Development

Project Structure

🌟 Use Cases

🤝 Contributing

📋 Roadmap

🐛 Bug Reports & Feature Requests

📧 Support

📄 License

🙏 Acknowledgments

🤝 Develop Using Visual Studio Code with Kilo Code Extension

☕ Give Some Supports & Encouragement

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎙️ Simple Transcriber

🚀 Try It Now

✨ Features

🛠️ Technical Architecture

🚀 Getting Started

Prerequisites

Setup Instructions

🎯 Recommended Models

🎙️ Transcription Model

📝 Text Processing Model

🔒 Privacy & Security

Data Handling

API Key Security

Storage & Cookies

Transparency

🏗️ Installation & Development

Local Development

Project Structure

🌟 Use Cases

🤝 Contributing

📋 Roadmap

🐛 Bug Reports & Feature Requests

📧 Support

📄 License

🙏 Acknowledgments

🤝 Develop Using Visual Studio Code with Kilo Code Extension

☕ Give Some Supports & Encouragement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages