Implementing local OpenAI API-style chat completions on any given inference server #1055
unit_tests.yml
on: pull_request
Linters
17s
Matrix: CPU Tests
Artifacts
Produced during runtime
Name | Size | |
---|---|---|
output_results
Expired
|
3 KB |
|