Reduce network bandwidth with log variables #95025
Labels
:Distributed/CRUD
A catch all label for issues around indexing, updating and getting a doc by id. Not search.
>enhancement
feedback_needed
Team:Distributed
Meta label for distributed team
Description
Issue to resolve:
Cloud ingestion is expensive! I think we can reduce our ESS ingest costs significantly with a little work. My first test reduced log ingest by 80%!
This will also help reduce traffic in buildings/datacenters with many logs being created and a single pipe out of the building
How:
Most logs consist of the same data, only a few variables are changed between logs. Timestamp, CPU, packets sent, alert name, etc. All other data in the log is always the same
What if we only send those variables across the network. This can be done in many ways, below is a method. Not that it is the best method but simply to show what I am thinking
Attached is an example:
The "Raw JSON" is 5000B and the "Variable JSON" is 612 Bytes or 87% smaller
The pdf is the "Raw JSON" file but with variable lines highlighted to show what was and what was not part of the variable JSON
The Variable JSON starts with a line defining the template it is utilizing. Everything else in the JSON are variables. I imagine that I missed a few lines required to really make this work. I am sure that my above process may be faulty but the general idea is here. The general idea is to send over the changing variables in logs, not the whole log
rawauditbeat.pdf
variablesjson.txt
rawauditbeat.txt
The text was updated successfully, but these errors were encountered: