Add voice interactions with Gemini Live and ros-mcp-server to Gemini example. #115

tracelarue · 2025-09-24T15:55:44Z

Gemini Live for low-latency bidirectional voice interactions with ros-mcp-server.

Added gemini_live to examples/2_gemini.
Enables audio input from the user and audio output from Gemini.
Enables Gemini Live to use ros-mcp-server.
Tested in ubuntu 22.04, python 3.10, ros2 humble.

stex2005 · 2025-09-24T16:09:17Z

Thank you for your contribution, @tracelarue. I will give it a try soon.

@rjohn-v — I’d suggest adding a client/ folder in the repository to store installation packages and runnable clients. This would help keep different client implementations (e.g., Gemini API client) and their installation steps organized in one place. I’m not sure I’d keep this under examples/.

stex2005 · 2025-09-24T16:14:58Z

I connected issue #62 to this PR.
After, we can close it and reopen for other APIs.

stex2005 · 2025-09-26T00:25:00Z

Raises this error during installation, seems that I need a system-package: portaudio.

I woudl recommend trying to include this into dependencies in the README.md + comamnd to install:

sudo apt install portaudio19-dev

stex2005 · 2025-09-26T00:35:33Z

Couldn't run uv run on my WSL Ubuntu. Please specify that this works only on Ubuntu, will try on Ubuntu soon.

stex2005 · 2025-09-26T00:41:06Z

@tracelarue @rjohn-v Another good next step would be to provide a dockerized version of the Gemini client, so it can be run more easily in different environments. Since the client is only a tool within this project, I don’t think we should invest too much effort in tightly integrating it into the repo. A simplified version of client_gemini (without audio support) would already be a good, lightweight solution and would find a good place in clients folder.

examples/2_gemini/gemini_live/mcp_config.json

stex2005 · 2025-09-26T00:16:26Z

examples/2_gemini/gemini_live/pyproject.toml

I would avoid creating a project inside the repository, but since this is a client project, it makes sense. Maybe we should move this example into a clients folder

Would you like me to create a new folder for clients or leave it here for now? If yes, where should I create the clients folder?

@rjohn-v what are your thoughts?

stex2005 · 2025-09-26T00:16:59Z

examples/2_gemini/gemini_live/README.md

+
+2. **Get Google API Key**: Visit [Google AI Studio](https://aistudio.google.com) and create an API key
+
+3. **Create `.env` file**:


Please specify where the .env file should be. In this folder?

stex2005 · 2025-09-26T00:19:20Z

examples/2_gemini/gemini_live/README.md

+
+**Start Gemini Live:**
+```bash
+cd ros-mcp-server/examples/2_gemini/gemini_live


Can we rename this to gemini_client.py or client_gemini.py?

stex2005 · 2025-09-26T00:20:32Z

examples/2_gemini/gemini_live/README.md

+
+**Pre-requisites** See the [installation instructions](../../../docs/installation.md) for detailed setup steps.
+
+**Tested In:** Ubuntu 22.04, Python 3.10, ROS2 Humble


Does this work also on WSL?

I am unsure, I have not used WSL before.

Ok, then let's make sure we specify it only works with Ubuntu.

Ok, I'll specify that for now. I'll work on learning WSL so I can support it in the future.

stex2005 · 2025-09-26T00:44:10Z

examples/2_gemini/gemini_live/mcp_handler.py

Unclear what is the role of this file. Do we needed as a dependencies or is it a "test_script"? If not strictly need we can consider removing it from the example.

stex2005 · 2025-09-26T00:58:53Z

Couldn't run uv run on my WSL Ubuntu. Please specify that this works only on Ubuntu, will try on Ubuntu soon.

This is the same error when I try to run mcp_handler.py

tracelarue · 2025-09-26T20:05:03Z

@stex2005 Thank you for the review and feedback. I'll work on getting these changes and fixes implemented.

mokcontoro · 2025-09-27T14:57:48Z

@tracelarue wow, voice command sounds super cool. thanks for your contributions. I cannot wait for trying this feature soon!

Added Gemini Live with ros-mcp-server example to Gemini example.

b90dc09

tracelarue changed the title ~~Voice interactions with Gemini Live and ros-mcp-server added to Gemini example.~~ Add voice interactions with Gemini Live and ros-mcp-server to Gemini example. Sep 24, 2025

stex2005 requested review from stex2005, lpigeon and rjohn-v and removed request for lpigeon September 24, 2025 16:05

stex2005 linked an issue Sep 24, 2025 that may be closed by this pull request

Add example for running a local/on-prem LLM with MCP #62

Open

stex2005 removed a link to an issue Sep 24, 2025

Add example for running a local/on-prem LLM with MCP #62

Open

stex2005 linked an issue Sep 24, 2025 that may be closed by this pull request

Add example for running a local/on-prem LLM with MCP #62

Open

stex2005 requested changes Sep 26, 2025

View reviewed changes

renamed to gemini_client.py, readme updates

2412663


		2. Get Google API Key: Visit [Google AI Studio](https://aistudio.google.com) and create an API key

		3. Create `.env` file:


		Pre-requisites See the [installation instructions](../../../docs/installation.md) for detailed setup steps.

		Tested In: Ubuntu 22.04, Python 3.10, ROS2 Humble

Add voice interactions with Gemini Live and ros-mcp-server to Gemini example. #115

Are you sure you want to change the base?

Add voice interactions with Gemini Live and ros-mcp-server to Gemini example. #115

Uh oh!

Conversation

tracelarue commented Sep 24, 2025

Uh oh!

stex2005 commented Sep 24, 2025

Uh oh!

stex2005 commented Sep 24, 2025

Uh oh!

stex2005 commented Sep 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stex2005 commented Sep 26, 2025

Uh oh!

stex2005 commented Sep 26, 2025

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

stex2005 commented Sep 26, 2025

Uh oh!

tracelarue commented Sep 26, 2025

Uh oh!

mokcontoro commented Sep 27, 2025

Uh oh!

Uh oh!

stex2005 commented Sep 26, 2025 •

edited

Loading