An integration Unity project integrated the smallest local language model (LM) I can find (and also supported by Unity inference) The LM download URL: https://huggingface.co/onnx-community/TinyLlama-1.1B-Chat-v1.0-ONNX/tree/main/onnx
- download the model_fp16.onnx and model_fp16.onnx_data
- create a new Unity project, has to be Unity 6.2 or newer
- clone the repo from GitHub, replace the files in the new project folder
- place model_fp16.onnx and model_fp16.onnx_data in the Assets/LLM folder
- open Unity and wait for compiling
- select the model_fp16.onnx in project window, click "Serialize To StreamingAssets" button and wait for the serialization
- Assign the model to Canvas/conversation's script in the Unity inspector
- Run it, should be good to go