Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement fallbacking for lang model pool #43

Closed
roma-glushko opened this issue Dec 26, 2023 · 0 comments · Fixed by #68
Closed

Implement fallbacking for lang model pool #43

roma-glushko opened this issue Dec 26, 2023 · 0 comments · Fixed by #68

Comments

@roma-glushko
Copy link
Member

roma-glushko commented Dec 26, 2023

Implement fallbacking on provider failures to another healthy provider.

Create a GEP to explain how will that work (GEP0005).

@roma-glushko roma-glushko self-assigned this Dec 30, 2023
@roma-glushko roma-glushko linked a pull request Jan 7, 2024 that will close this issue
roma-glushko added a commit that referenced this issue Jan 7, 2024
roma-glushko added a commit that referenced this issue Jan 7, 2024
roma-glushko added a commit that referenced this issue Jan 7, 2024
roma-glushko added a commit that referenced this issue Jan 7, 2024
roma-glushko added a commit that referenced this issue Jan 8, 2024
roma-glushko added a commit that referenced this issue Jan 8, 2024
roma-glushko added a commit that referenced this issue Jan 8, 2024
roma-glushko added a commit that referenced this issue Jan 8, 2024
roma-glushko added a commit that referenced this issue Jan 8, 2024
roma-glushko added a commit that referenced this issue Jan 8, 2024
roma-glushko added a commit that referenced this issue Jan 8, 2024
roma-glushko added a commit that referenced this issue Jan 8, 2024
roma-glushko added a commit that referenced this issue Jan 9, 2024
roma-glushko added a commit that referenced this issue Jan 9, 2024
roma-glushko added a commit that referenced this issue Jan 11, 2024
roma-glushko added a commit that referenced this issue Jan 11, 2024
- Implemented lang router handling flow
- Implemented the priority routing strategy
- Implemented exponential retry when no model is healthy
- Implemented model health (based on the token bucket) and basic rate limit tracker
- Implemented a custom error budget field to accept budgets in form of human-friendly string e.g. "10/m".
- Moved build/factory methods on the config level
roma-glushko added a commit that referenced this issue Jan 21, 2024
- ✨ [providers] Support for OpenAI Chat API #3 (@mkrueger12 )
- ✨ [API] #54 Unified Chat API  (@mkrueger12 )
- ✨ [providers] Support for Cohere Chat API #5 (@mkrueger12 )
- ✨ [providers] Support for Azure OpenAI Chat API #4  (@mkrueger12 )
- ✨ [providers] Support for OctoML Chat API #58 (@mkrueger12 )
- ✨ [routing] The Routing Mechanism, Adaptive Health Tracking, and Fallbacks #42 #43 #51 (@roma-glushko)
- ✨ [routing] Support for round robin routing strategy #44 (@roma-glushko)
- ✨ [routing] Support for the least latency routing strategy #46 (@roma-glushko)
- ✨ [routing] Support for weighted round robin routing strategy #45 (@roma-glushko)
- ✨ [providers] Support for Anthropic Chat API #60 (@mkrueger12 )
- ✨ [docs] OpenAPI specifications #22 (@roma-glushko )
- 🔧 [chores] Inited the project #6 (@roma-glushko)
- 🔊 [telemetry] Inited logging  #14 (@roma-glushko)
- 🔧 [chores] Inited Glide's CLI #12 (@roma-glushko)
- 👷 [chores] Setup CI workflows #8 (@roma-glushko)
- ⚙️ [config] Inited configs #11 (@roma-glushko)
-  🔧 [chores] Automatic coverage reports #39 (@roma-glushko)
- 👷 [build] Setup release workflows #9 (@roma-glushko)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant