[Feature Request]: Speech to text and audio files support (OpenAI Whisper) #1514

Palvr · 2024-07-15T09:35:31Z

Is there an existing issue for the same feature request?

I have checked the existing issues.

Is your feature request related to a problem?

No response

Describe the feature you'd like

Give the option to upload audio files to the knowledge bases, and support the option on model providers to use speech to text models like OpenAI Whisper, so we can extract text from the audio files and integrate it as knowledge on the "Rag" process.
Whisper could be loaded via an API key from OpenAI or from XInference.

Describe implementation you've considered

No response

Documentation, adoption, use case

No response

Additional information

No response

### What problem does this PR solve? #1514 ### Type of change - [x] New Feature (non-breaking change which adds functionality)

KevinHuSh added the Feature label Jul 16, 2024

guoyuhao2330 mentioned this issue Jul 22, 2024

Add sequence2txt model.py #1633

Merged

6 tasks

KevinHuSh pushed a commit that referenced this issue Jul 22, 2024

Add sequence2txt model.py (#1633)

29a7b7a

### What problem does this PR solve? #1514 ### Type of change - [x] New Feature (non-breaking change which adds functionality)

guoyuhao2330 mentioned this issue Jul 22, 2024

Add ParsertType Audio #1637

Merged

6 tasks

KevinHuSh pushed a commit that referenced this issue Jul 22, 2024

Add ParsertType Audio (#1637)

ac7a0d4

### What problem does this PR solve? #1514 ### Type of change - [x] New Feature (non-breaking change which adds functionality)

KevinHuSh mentioned this issue Jul 22, 2024

ROADMAP 2024 #162

Closed

27 tasks

KevinHuSh closed this as completed Jul 30, 2024

yingfeng mentioned this issue Aug 6, 2024

ROADMAP 2024 #1821

Open

46 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request]: Speech to text and audio files support (OpenAI Whisper) #1514

[Feature Request]: Speech to text and audio files support (OpenAI Whisper) #1514

Palvr commented Jul 15, 2024

[Feature Request]: Speech to text and audio files support (OpenAI Whisper) #1514

[Feature Request]: Speech to text and audio files support (OpenAI Whisper) #1514

Comments

Palvr commented Jul 15, 2024

Is there an existing issue for the same feature request?

Is your feature request related to a problem?

Describe the feature you'd like

Describe implementation you've considered

Documentation, adoption, use case

Additional information