Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request]: Speech to text and audio files support (OpenAI Whisper) #1514

Closed
1 task done
Palvr opened this issue Jul 15, 2024 · 0 comments
Closed
1 task done
Labels

Comments

@Palvr
Copy link

Palvr commented Jul 15, 2024

Is there an existing issue for the same feature request?

  • I have checked the existing issues.

Is your feature request related to a problem?

No response

Describe the feature you'd like

Give the option to upload audio files to the knowledge bases, and support the option on model providers to use speech to text models like OpenAI Whisper, so we can extract text from the audio files and integrate it as knowledge on the "Rag" process.
Whisper could be loaded via an API key from OpenAI or from XInference.
whisper

Describe implementation you've considered

No response

Documentation, adoption, use case

No response

Additional information

No response

KevinHuSh pushed a commit that referenced this issue Jul 22, 2024
### What problem does this PR solve?

#1514 

### Type of change

- [x] New Feature (non-breaking change which adds functionality)
KevinHuSh pushed a commit that referenced this issue Jul 22, 2024
### What problem does this PR solve?

#1514 

### Type of change

- [x] New Feature (non-breaking change which adds functionality)
@KevinHuSh KevinHuSh mentioned this issue Jul 22, 2024
27 tasks
@yingfeng yingfeng mentioned this issue Aug 6, 2024
46 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants