-
Notifications
You must be signed in to change notification settings - Fork 11.4k
Msingh openai voice solutions gpt4o audio #1506
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Msingh openai voice solutions gpt4o audio #1506
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Some comments!
" \n", | ||
"**- Language** refers to the spoken or written system of communication. For instance, Hindi and Marathi are different languages, but both use the Devanagari script. Similarly, English and French are different languages, but are written in Latin script. \n", | ||
" \n", | ||
"**- Script** refers to the set of characters or symbols used to write the language. For example, Serbian language traditionally written in Cyrillic Script, is also written in Latin script.\n", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I would link to the wikipedia page on "Writing system" https://en.wikipedia.org/wiki/Writing_system
"\n", | ||
"A note on semantics used in this Cookbook regarding **Language** and written **Script**. These words are generally used interchangeably, though it's important to understand the distinction, given the task at hand. \n", | ||
" \n", | ||
"**- Language** refers to the spoken or written system of communication. For instance, Hindi and Marathi are different languages, but both use the Devanagari script. Similarly, English and French are different languages, but are written in Latin script. \n", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Same here, would link to https://en.wikipedia.org/wiki/Language
" }\n", | ||
"\n", | ||
" # Construct the request data\n", | ||
" data = {\n", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is there a reason you're using the API directly? Why not use the python library since you made the users install it already?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm overall! Let's add SDK when it gets fully released
Summary
This document provides a step-by-step guide on how to use OpenAI’s GPT-4o to translate and dub audio files from one language to another, specifically focusing on translating English audio into Hindi. It outlines the key concepts of language and script, as well as the benefits of GPT-4o’s audio-in and audio-out modality, which simplifies the dubbing process by handling transcription and translation in one step.
Motivation
New audio-in and audio-out modality for GPT-4o.
For new content
When contributing new content, read through our contribution guidelines, and mark the following action items as completed:
We will rate each of these areas on a scale from 1 to 4, and will only accept contributions that score 3 or higher on all areas. Refer to our contribution guidelines for more details.