Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix(modelgateway): KafkaAdmin error when fetching OIDC token #5428

Merged
merged 1 commit into from Mar 12, 2024

Conversation

lc525
Copy link
Member

@lc525 lc525 commented Mar 12, 2024

When a new model is added, modelgateway creates quite a few connections to the kafka cluster, in quick succession (producer, consumer, admin).

When using Confluent Cloud Kafka with OAUTHBEARER/OIDC token auth, the KafkaAdmin client failed to fetch the OIDC token sometimes. In tests we were using MS Entra ID, unsure if there is some rate-limiting on fetching tokens.

Because we're not using the admin and producer connections concurrently (admin mostly used to create topics on model add), here we're letting the kafka admin client reuse the producer connection (and the same token).

Fixed issues:

  • INFRA-867: Modelgateway sometimes fails to fetch OIDC tokens

When a new model is added, modelgateway creates quite a few connections to the
kafka cluster, in quick succession (producer, consumer, admin).

When using Confluent Cloud Kafka with OAUTHBEARER/OIDC token auth, the
KafkaAdmin client failed to fetch the OIDC token sometimes. In tests we were
using MS Entra ID, unsure if there is some rate-limiting on fetching tokens.

Because we're not using the admin and producer connections concurrently
(admin mostly used to create topics on model add), here we're letting the
kafka admin client reuse the producer connection (and the same token).

**Fixed issues**:
- INFRA-867: Modelgateway sometimes fails to fetch OIDC tokens
@lc525 lc525 added the v2 label Mar 12, 2024
@lc525 lc525 requested a review from sakoush as a code owner March 12, 2024 11:42
Copy link
Member

@sakoush sakoush left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@lc525 lc525 merged commit 0eef530 into SeldonIO:v2 Mar 12, 2024
3 checks passed
@lc525 lc525 deleted the fix.dataflow.infra-867.pipeline-stuck branch March 12, 2024 11:58
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants