You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The purpose of this ticket is to extract transcripts from videos and audios by implementing a speech-to-text solution. The task involves identifying suitable tools and techniques to process audio inputs and generate accurate free-text transcripts.
Tasks:
Tool evaluation: Research and identify tools or libraries that can handle audio processing and speech-to-text conversion effectively. Consider factors such as accuracy, language support, ease of integration, and scalability.
Audio extraction: Develop a process to extract audio from video files, as this will be the input for the speech-to-text conversion. Identify and utilize appropriate tools or libraries to extract audio streams from different video formats.
Speech-to-text conversion: Implement a speech-to-text solution using the chosen tool or library (can use Bhashini APIs). This should convert the audio files into free-text transcripts.
Error handling and refinement: Show entire transcript to user and allow user input errors to correct errors if any.
The text was updated successfully, but these errors were encountered:
Description:
The purpose of this ticket is to extract transcripts from videos and audios by implementing a speech-to-text solution. The task involves identifying suitable tools and techniques to process audio inputs and generate accurate free-text transcripts.
Tasks:
Tool evaluation: Research and identify tools or libraries that can handle audio processing and speech-to-text conversion effectively. Consider factors such as accuracy, language support, ease of integration, and scalability.
Audio extraction: Develop a process to extract audio from video files, as this will be the input for the speech-to-text conversion. Identify and utilize appropriate tools or libraries to extract audio streams from different video formats.
Speech-to-text conversion: Implement a speech-to-text solution using the chosen tool or library (can use Bhashini APIs). This should convert the audio files into free-text transcripts.
Error handling and refinement: Show entire transcript to user and allow user input errors to correct errors if any.
The text was updated successfully, but these errors were encountered: