Allow setting multiple index prefixes and choose from them in the UI #2726

yoave23 · 2021-01-13T11:57:08Z

Requirement - what kind of business use case are you trying to solve?

this is a slim version of #2509
We'd like to allow the user to query different (pre-configured) data sources

Problem - what in Jaeger blocks you from solving the requirement?

Currently, jaeger does not have a solution for querying from different data sources (es indices in our case) under the same deployment

Proposal - what do you suggest to solve the problem or improve the existing situation?

We'd like to add a configuration flag similar to the --es.index-prefix flag that will take a collection of prefixes and will let the user select one of them before querying using some kind of a dropdown in the UI (dropdown will only be displayed if this configuration exist).
a real life example: an organization that wants to ship his traces based on the current environment (production / staging)

The text was updated successfully, but these errors were encountered:

yurishkuro · 2021-01-13T19:12:39Z

I have conceptual problems with this approach:

ES indices is an implementation detail of the storage backend, not a concept that the UI users need to be aware of
ES indices is just one of implementations, whereas "tenancy" could just as well apply to Cassandra backend, e.g. by using different namespaces

I don't think we can achieve the desired result via this shortcut. The UI needs a proper notion of tenancy, expressed in terms that are natural to the end users. The notion of "tenant" could be multi-dimensional, e.g. in your example it was prod vs. staging (one dimension), but some users may have other dimensions, e.g. environment + department. A specific tenant (by concretizing all possible dimensions) can map to a certain configuration of the backend, such as ES index prefix.

I think this more generic approach is not that much more difficult to implement, but it is significantly more flexible.

Caveat: there are some implementation details of the query service that may still be tricky regardless of how the frontend tenancy is handled. For example, Uber deployment implemented a self-refreshing cache of service names, because loading 3k entries from the storage on every UI load was taking too long. I don't think this was implemented generically in the OSS version (we still have open ticket #1743), but this is an example of functionality that will need to be aware of the tenancy.

jkowall · 2021-01-14T14:42:22Z

This can get really complex especially without using a database to manage configs which we wanted to avoid I am sure. Also have dynamic names for the dimensions becomes difficult from a UI perspective. I do think having tenants and environments would give a lot of flexibility and likely meet 95% of the requirements that come up. We could have the environment selector in the search which would provide the necessary division of "services".

yurishkuro · 2021-01-14T16:02:47Z

I think in the first implementation the tenancy should be described in a config file, not a database.

jkowall · 2021-01-14T18:28:50Z

Just wouldn't work for us since we have thousands or tens of thousands of tenants and we'd have to dynamically change the config files on the fly which is not ideal. I'll let @albertteoh chime in when he's back at the keys.

yurishkuro · 2021-01-14T18:37:54Z

So then you do need a database for tenants :-) Not sure how to interpret your previous #2726 (comment)

I think either way, we'd need an API on the query service for accessing this data, which can be backed by a set of configs or by a database.

jpkrohling · 2021-01-15T10:34:36Z

we have thousands or tens of thousands of tenants

If they are all configured following a pattern, the multi-tenancy proposal would cover this by applying a generic configuration that uses the tenant value from the bearer token. Like this:

tenants: [] # no special rules for individual tenants
default: # all tenants follow the same pattern
  storageType: elasticsearch
  es:
    server-urls: "big-cluster.es.acme.example.com"
    index-prefix: "jaeger-%s" # %s is replaced by the tenant name

The token might also include a membership list that can act as a list of tenants that this user has access to. The UI then can use the information from the token to allow the user to select which tenant to use for the current query.

jurgenweber · 2021-05-06T11:49:17Z

however you tackle this, the idea is great and sorely needed. :)

github-actions bot added the needs-triage label Jan 13, 2021

jpkrohling removed the needs-triage label Feb 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow setting multiple index prefixes and choose from them in the UI #2726

Allow setting multiple index prefixes and choose from them in the UI #2726

yoave23 commented Jan 13, 2021

yurishkuro commented Jan 13, 2021

jkowall commented Jan 14, 2021

yurishkuro commented Jan 14, 2021

jkowall commented Jan 14, 2021

yurishkuro commented Jan 14, 2021

jpkrohling commented Jan 15, 2021 •

edited

jurgenweber commented May 6, 2021

Allow setting multiple index prefixes and choose from them in the UI #2726

Allow setting multiple index prefixes and choose from them in the UI #2726

Comments

yoave23 commented Jan 13, 2021

Requirement - what kind of business use case are you trying to solve?

Problem - what in Jaeger blocks you from solving the requirement?

Proposal - what do you suggest to solve the problem or improve the existing situation?

yurishkuro commented Jan 13, 2021

jkowall commented Jan 14, 2021

yurishkuro commented Jan 14, 2021

jkowall commented Jan 14, 2021

yurishkuro commented Jan 14, 2021

jpkrohling commented Jan 15, 2021 • edited

jurgenweber commented May 6, 2021

jpkrohling commented Jan 15, 2021 •

edited