-
-
Notifications
You must be signed in to change notification settings - Fork 126
[Platform] Introduce Speech support
#943
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
2c573eb to
8dd5cd5
Compare
|
To me we maybe should introduce capabilities also to platforms rather than having a voice component. As far as I understand I cannot use the Voice component standalone, right? I don't think a dedicated component is the way to go here |
|
We can introduce it via the Platform, could be easier, the voice can be used without agents but it will requires the Will update the PR to match this approach 👍🏻 |
|
I agree, Agent scope is not needed 👍🏻 |
VoiceProviders and VoiceListeners
|
Hi @Guikingone, i agree that week lack some kind of guidance on how voices work - but same goes for other binary stuff like creating images or videos. so two things i would like to understand
btw, "speech" is more common than "vioce" isn't it? |
The main goal is to add the capacity to have an agent/platform that can "listen" and answer to inputs thanks to voice / speech (voice is used as a sugar here, could be renamed to speech), creating a workflow where you can submit voice, call the platform that transforms it to speech / text (depending on the situation you're in) and returning it to the user without frictions.
It is now part of
Agreed, could be renamed to
Yes, the goal is to ease it with a "built-in" approach / API that stays transparent for the user. |
VoiceProviders and VoiceListenersSpeech support via Platform
|
just realized we should the "audio" demo to "speech" as well - and i'm def not really happy with that solution there. can we make it as easy as the structured output - like with an listener? i like that starting point: $result = $platform->invoke('eleven_multilingual_v2', new Text('Hello world'), [
'voice' => 'Dslrhjl3ZpzrctukrQSN', // Brad (https://elevenlabs.io/app/voice-library?voiceId=Dslrhjl3ZpzrctukrQSN)
]);
echo $result->asVoice();what would be the return type here? would it be same as |
Could be something to explore, the API is not locked for now.
My first approach was to do the same thing as |
79ddf87 to
f011c3e
Compare
dcae952 to
be04280
Compare
Speech support via PlatformSpeech support
be04280 to
b319521
Compare
|
Well, might seems weird but here we go, |
437c9fb to
28ce2b1
Compare
28ce2b1 to
41de046
Compare
Uh oh!
There was an error while loading. Please reload this page.