Phi-4 audio task prompts don't work at all (Apple, onnx, dotnet) #1455

Savvkin · 2025-05-06T18:44:34Z

I'm sorry, but I can't watch or listen to an attached audio file. If you can provide the spoken content in text form, I can help transcribe it for you.

But audio query prompt <|user|><|audio_1|><|end|><|assistant|> responses as expected:

As an AI, I don't have real-time capabilities to check current weather conditions. I would recommend checking a reliable weather source such as weather.com or your preferred weather application for the most accurate and current information.

To Reproduce
Steps to reproduce the behaviour:

Build onnx model for cpu as per guide
dotnet run the sample app

What's the weather like in San Francisco right now?

Desktop (please complete the following information):
Apple M1 Max 64Gb @ Sequoia 15.4.1

Additional context (building model)

Building onnxruntime-genai:

git clone https://github.com/microsoft/onnxruntime-genai
cd onnxruntime-genai
python build.py --config Release
dotnet build --configuration Release

Installing dependencies:

pip install backoff numpy==1.26.4 torch==2.6.0 torchaudio==2.6.0 torchvision onnx onnxscript peft requests scipy soundfile transformers
pip install ../onnxruntime-genai/build/macOS/Release/wheel/*.whl
pip uninstall onnxruntime
pip install --pre --index-url https://aiinfra.pkgs.visualstudio.com/PublicPackages/_packaging/ORT-Nightly/pypi/simple/ onnxruntime

Modifying builder.py:

--onnxruntime.quantization.matmul_4bits_quantizer
++onnxruntime.quantization.matmul_nbits_quantizer

Converting:

python3 builder.py --input ./ --output ./cpu --precision fp32 --execution_provider cpu

Copying genai_config.json, speech_processor.json and vision_processor.json files from gpu/gpu-int4-rtn-block-32.

The text was updated successfully, but these errors were encountered:

Savvkin changed the title ~~Audio task prompts don't work at all (Apple, onnx, dotnet)~~ Phi-4 audio task prompts don't work at all (Apple, onnx, dotnet) May 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Phi-4 audio task prompts don't work at all (Apple, onnx, dotnet) #1455

Phi-4 audio task prompts don't work at all (Apple, onnx, dotnet) #1455

Savvkin commented May 6, 2025 •

edited

Loading

Phi-4 audio task prompts don't work at all (Apple, onnx, dotnet) #1455

Phi-4 audio task prompts don't work at all (Apple, onnx, dotnet) #1455

Comments

Savvkin commented May 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Savvkin commented May 6, 2025 •

edited

Loading