Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] GPT-4o with audio support #500

Open
codewithkenzo opened this issue May 15, 2024 · 3 comments
Open

[Feature] GPT-4o with audio support #500

codewithkenzo opened this issue May 15, 2024 · 3 comments

Comments

@codewithkenzo
Copy link

Mr. end-4, I know you want it too

@H0mire
Copy link
Contributor

H0mire commented May 16, 2024

Can't wait for OpenAI to release GPT-4o with stt and tts support. :o

end-4 added a commit that referenced this issue May 17, 2024
@end-4
Copy link
Owner

end-4 commented May 17, 2024

oxygen api provides that for free
idk if it's real but it does use emojis like the gpt4o on poe.com
4 or 4o? no clue

idk how to include sound yet

@H0mire
Copy link
Contributor

H0mire commented May 17, 2024

Yeah currently "Audio" is usually generated through a tts service, which you would have to integrate separately. OpenAI hinted that they will release the GPT 4o with Audio processing, which is basically native tts and stt without a separate model or service. This Results to a low latency like a normal human conversation and capability to process emotional expression.

end-4 added a commit to Soliprem/dots-hyprland that referenced this issue May 24, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants