Allow only certain whitelisted chars to be present in messages in order to avoid too much diversity #30

flaviostutz · 2021-05-12T17:12:20Z

In some cases the error-info message comes with variable parts in the message, like "message code" or "account code". As the message is a metrics label, each different string then creates whole new metrics, skyrocketing the number of different metrics (they are not summed), leading to memory leak on the client application, creating network bottlenecks and generating tons of data on Prometheus/Cortex or other infrastructure that will handle the metrics unnecessarily.

Some examples are "Account 546775 couldn't not be created", "There was an error processing your request. erroid=538-433.456", "There are 4 pending approvals for your profile".

Proposal

The proposal is to limit the permitted characters to a fixed whitelist through a regex expression, specially removing all numbers present in the message, thus avoiding the type of issues described as examples.

By applying the following regex cleanup code to the messages, the examples would become:

const regex = /[^A-zÀ-ú\s\.\,]+/ig;
p.replaceAll(regex, "Account 546775 couldn't not be created");

"Account couldn't not be created"
"There was an error processing your request. erroid=-."
"There are pending approvals for your profile"

This way, even if the client sets the error-info attribute with numeric variables we won't have metrics diversification as it will be handled as the same metrics/message (after transformation).

What do you think?

CarlosPanarello · 2021-05-13T22:40:16Z

We can use this regex for default, but it can be changed with some environment. But limited by 50 chars.

flaviostutz · 2021-05-14T01:01:12Z

Agree. Maybe init-param "info-regex"?

CarlosPanarello · 2021-05-17T11:01:34Z

what about error-info-regex? info-regex maybe is too generic, info about what?
This name will be relation only with error-info http header parameter.

flaviostutz · 2021-05-17T12:22:26Z

That’s perfect! Even makes it more aligned to the attribute name set in request (error-info).

…

Sent from my iPhone

On 17 May 2021, at 08:01, Carlos Eduardo Panarello ***@***.***> wrote: what about error-info-regex? info-regex maybe is too generic, info about what? This name will be relation only with error-info http header parameter. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub, or unsubscribe.

fixes #29 and fixes #30 - support custom regex to filter - support custom max value - support to select the correct group of the regex Signed-off-by: Luiz Oliveira <ziuloliveira@gmail.com>

Ziul · 2021-05-20T12:01:27Z

I created the PR #33 that accepts up to three environment variables: filter-regex, filter-index, and filter-max-size. With it, we can solve the problem about the max size of the message, apply a regex to the message and even get which matching group of the regex the user desires. To get more aligned with your discussion I can rename the variables to error-info-regex, error-info-index and error-info-max-size.

CarlosPanarello · 2021-05-21T22:47:48Z

I created the PR #33 that accepts up to three environment variables: filter-regex, filter-index, and filter-max-size. With it, we can solve the problem about the max size of the message, apply a regex to the message and even get which matching group of the regex the user desires. To get more aligned with your discussion I can rename the variables to error-info-regex, error-info-index and error-info-max-size.

it would be great if you did that.

flaviostutz added enhancement New feature or request good first issue Good for newcomers help wanted Extra attention is needed labels May 12, 2021

Ziul added a commit that referenced this issue May 19, 2021

Implements filter on ErrorMessage

d329437

fixes #29 and fixes #30 - support custom regex to filter - support custom max value - support to select the correct group of the regex Signed-off-by: Luiz Oliveira <ziuloliveira@gmail.com>

Ziul mentioned this issue May 20, 2021

Implements filter on ErrorMessage #33

Merged

flaviostutz closed this as completed in #33 Jun 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow only certain whitelisted chars to be present in messages in order to avoid too much diversity #30

Allow only certain whitelisted chars to be present in messages in order to avoid too much diversity #30

flaviostutz commented May 12, 2021

CarlosPanarello commented May 13, 2021

flaviostutz commented May 14, 2021

CarlosPanarello commented May 17, 2021

flaviostutz commented May 17, 2021 via email

Ziul commented May 20, 2021

CarlosPanarello commented May 21, 2021

Allow only certain whitelisted chars to be present in messages in order to avoid too much diversity #30

Allow only certain whitelisted chars to be present in messages in order to avoid too much diversity #30

Comments

flaviostutz commented May 12, 2021

Proposal

CarlosPanarello commented May 13, 2021

flaviostutz commented May 14, 2021

CarlosPanarello commented May 17, 2021

flaviostutz commented May 17, 2021 via email

Ziul commented May 20, 2021

CarlosPanarello commented May 21, 2021