the test cases are inside test_router.py
currently, there is one one case: loading one model.
it can be interesting to add more of them (but not too many tests - as it will make the CI runs too slow)
these test cases can be interesting to add:
- test unloading model
- set
--models-max 2, load 2 models at the same time, load one more model, the first model should be unloaded automatically
- test with
--no-models-autoload
- test with API key