Skip to content

Audio and Video transcription with whisper API and Next.js

License

Notifications You must be signed in to change notification settings

sabber-slt/RayVox

Repository files navigation

RayVox: Audio and Video Transcription & Translation Solution

✏️ Introduction

RayVox is a cutting-edge Progressive Web Application (PWA) dedicated to the transcription of audio and video files, ensuring a smooth, accurate, and adaptable experience. Leveraging the advanced Whisper ASR technology. All these features are meticulously designed to function within your browser, ensuring a user-friendly experience.

⚡ Note

This app is under development and it is not finished yet. Stay tuned for updates and enhancements as we continue to improve the platform.

🚀 Technologies

  • Next.js: A React framework for building user-friendly web applications with server-side rendering and static site generation capabilities.
  • PocketBase: An all-in-one backend solution providing real-time databases, authentication, file storage, and more.
  • Tailwind CSS: A utility-first CSS framework for rapidly building custom designs without leaving your HTML.
  • Next UI: A React component library designed for building beautiful and modern interfaces with ease.
  • OpenAI's Whisper ASR: An advanced automatic speech recognition system for accurate transcriptions.

⭐ Future Enhancements

  • Improved Language Support: Expand the range of languages available for transcription and translation.
  • User Authentication: Implement user authentication and authorization for personalized experiences and data protection.
  • Enhanced File Management: Develop a more robust system for managing and organizing files within the application.
  • Real-Time Transcription: Integrate real-time transcription capabilities for live audio and video streams.
  • Customizable Subtitle Styling: Allow users to customize subtitle styles, including font, color, and positioning.
  • Accessibility Features: Incorporate accessibility features such as voice commands and screen reader support.

⚙️ Getting Started

  1. Clone the Repository: Start by cloning this repository to your local machine.

    git clone <repository-url>
    
    
  2. Install Dependencies: Navigate to the project directory and install the necessary dependencies.

    cd <project-directory>
    npm install
    
    
  3. Run the Application: Launch the application on your local development server.

    npm run dev
    
    

    Your application should now be running on http://localhost:3000.

  4. Upload Your Files: Simply drag and drop your audio or video files into the designated area and let our tool do the rest.

👩‍🏫 Contributing

We welcome contributions from the community! Whether it's a feature request, bug report, or a pull request, your input is valuable. Please refer to our contributing guidelines for more information on how you can contribute.

License

This project is licensed under the MIT License - see the LICENSE file for details.

About

Audio and Video transcription with whisper API and Next.js

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published