Split multimodal group into vision and audio #419

sjmonson · 2025-10-17T21:37:47Z

Summary

Turns the guidellm[multimodal] extras group into guidellm[audio] and guidellm[vision].

"I certify that all code in this PR is my own, except as noted below."

Use of AI

Includes AI-assisted code completion
Includes code generated by an AI application
Includes AI-generated tests (NOTE: AI written tests should have a docstring that includes ## WRITTEN BY AI ##)

Signed-off-by: Samuel Monson <smonson@redhat.com>

Copilot

Pull Request Overview

This PR splits the guidellm[multimodal] extras group into separate guidellm[audio] and guidellm[vision] extras to allow more granular installation of dependencies. Audio-related functionality is extracted into a new audio.py module, while vision-related functionality remains in vision.py.

Key Changes:

Audio functionality moved from vision.py to new audio.py module
Package extras split from single multimodal to separate audio and vision groups
Import statements updated to reflect new module structure

Reviewed Changes

Copilot reviewed 4 out of 5 changed files in this pull request and generated 3 comments.

File	Description
src/guidellm/extras/vision.py	Removed audio-related imports, functions (`encode_audio`, `_decode_audio`, `_encode_audio`), and updated error message to reference `guidellm[vision]`
src/guidellm/extras/audio.py	New module containing audio encoding/decoding functions moved from vision.py, including `encode_audio`, `_decode_audio`, `_encode_audio`, `is_url`, and `get_file_name`
src/guidellm/data/preprocessors/formatters.py	Updated import statements to reference new module locations (`audio` and `vision`)
pyproject.toml	Split `multimodal` extras into separate `audio` and `vision` groups with appropriate dependencies

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

src/guidellm/extras/audio.py

src/guidellm/extras/vision.py

pyproject.toml

jaredoconnell

I haven't tested it, but the code change looks good. The only thing I want to make sure is tested is that it works correctly with only one of vision or audio and not both in terms of handling the import errors.

Split multimodal group into vision and audio

af6b6b8

Signed-off-by: Samuel Monson <smonson@redhat.com>

sjmonson requested review from Copilot and jaredoconnell October 17, 2025 21:38

Copilot AI reviewed Oct 17, 2025

View reviewed changes

src/guidellm/extras/audio.py Show resolved Hide resolved

src/guidellm/extras/vision.py Show resolved Hide resolved

pyproject.toml Show resolved Hide resolved

jaredoconnell approved these changes Oct 17, 2025

View reviewed changes

markurtz approved these changes Oct 17, 2025

View reviewed changes

sjmonson merged commit c245fbd into main Oct 17, 2025
8 of 17 checks passed

sjmonson deleted the feat/seperate_multimodal branch October 17, 2025 23:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Split multimodal group into vision and audio #419

Split multimodal group into vision and audio #419

Uh oh!

sjmonson commented Oct 17, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jaredoconnell left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Split multimodal group into vision and audio #419

Split multimodal group into vision and audio #419

Uh oh!

Conversation

sjmonson commented Oct 17, 2025

Summary

Use of AI

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jaredoconnell left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants