Skip to content

G2F - Multimodal Live - TEXT returned in foreign language #232

@OldManMeta

Description

@OldManMeta
  • Programming language: Python, C++, JS
  • OS: Windows 11

Version 0.7.0

I have a client app that is sending the Setup to the python websocket server

setup_client_message = {
                setup: {
                    generation_config: { response_modalities: ['TEXT'] },
                },
            };

Now I am an English speaker, and I just happen to be in Thailand at the moment.
My client application is not a webbrowser - so I am assuming that language locale is normally being set by the browser?

When I send an audio message in, gemini responds with Thai language

receiving from gemini
response: setup_complete=None server_content=LiveServerContent(model_turn=Content(parts=[Part(video_metadata=None, thought=None, code_execution_result=None, executable_code=None, file_data=None, function_call=None, function_response=None, inline_data=None, text='อาจ')], role=None), turn_complete=None, interrupted=None) tool_call=None tool_call_cancellation=None
part: video_metadata=None thought=None code_execution_result=None executable_code=None file_data=None function_call=None function_response=None inline_data=None text='อาจ'
response: setup_complete=None server_content=LiveServerContent(model_turn=Content(parts=[Part(video_metadata=None, thought=None, code_execution_result=None, executable_code=None, file_data=None, function_call=None, function_response=None, inline_data=None, text='ารย์บอกว่าทำไมคุณถึงพูดภาษาไทยกับฉัน? ฉันไม่')], role=None), turn_complete=None, interrupted=N        None) tool_call=None tool_call_cancellation=None
part: video_metadata=None thought=None code_execution_result=None executable_code=None file_data=None function_call=None function_response=None inline_data=None text='ารย์บอกว่าทำไมคุณถึงพูดภาษาไทยกับฉัน? ฉันไม่'
response: setup_complete=None server_content=LiveServerContent(model_turn=Content(parts=[Part(video_metadata=None, thought=None, code_execution_result=None, executable_code=None, file_data=None, function_call=None, function_response=None, inline_data=None, text='เข้าใจ')], role=None), turn_complete=None, interrupted=None) tool_call=None tool_call_cancellaation=None
part: video_metadata=None thought=None code_execution_result=None executable_code=None file_data=None function_call=None function_response=None inline_data=None text='เข้าใจ'
response: setup_complete=None server_content=LiveServerContent(model_turn=None, turn_complete=True, interrupted=None) tool_call=None tool_call_cancellation=None

<Turn complete>

I have tried to set it in the generation_config to no avail.

How can I force it to return a response in only English text?

Metadata

Metadata

Assignees

Labels

api: gemini-apipriority: p2Moderately-important priority. Fix may not be included in next release.type: bugError or flaw in code with unintended results or allowing sub-optimal usage patterns.

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions