RayVox is a cutting-edge Progressive Web Application (PWA) dedicated to the transcription of audio and video files, ensuring a smooth, accurate, and adaptable experience. Leveraging the advanced Whisper ASR technology. All these features are meticulously designed to function within your browser, ensuring a user-friendly experience.
This app is under development and it is not finished yet. Stay tuned for updates and enhancements as we continue to improve the platform.
- Next.js: A React framework for building user-friendly web applications with server-side rendering and static site generation capabilities.
- PocketBase: An all-in-one backend solution providing real-time databases, authentication, file storage, and more.
- Tailwind CSS: A utility-first CSS framework for rapidly building custom designs without leaving your HTML.
- Next UI: A React component library designed for building beautiful and modern interfaces with ease.
- OpenAI's Whisper ASR: An advanced automatic speech recognition system for accurate transcriptions.
- Improved Language Support: Expand the range of languages available for transcription and translation.
- User Authentication: Implement user authentication and authorization for personalized experiences and data protection.
- Enhanced File Management: Develop a more robust system for managing and organizing files within the application.
- Real-Time Transcription: Integrate real-time transcription capabilities for live audio and video streams.
- Customizable Subtitle Styling: Allow users to customize subtitle styles, including font, color, and positioning.
- Accessibility Features: Incorporate accessibility features such as voice commands and screen reader support.
-
Clone the Repository: Start by cloning this repository to your local machine.
git clone <repository-url>
-
Install Dependencies: Navigate to the project directory and install the necessary dependencies.
cd <project-directory> npm install
-
Run the Application: Launch the application on your local development server.
npm run dev
Your application should now be running on
http://localhost:3000
. -
Upload Your Files: Simply drag and drop your audio or video files into the designated area and let our tool do the rest.
We welcome contributions from the community! Whether it's a feature request, bug report, or a pull request, your input is valuable. Please refer to our contributing guidelines for more information on how you can contribute.
This project is licensed under the MIT License - see the LICENSE file for details.