Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Integrate filters #165

Open
szareiangm opened this issue Jun 30, 2020 · 0 comments
Open

Integrate filters #165

szareiangm opened this issue Jun 30, 2020 · 0 comments

Comments

@szareiangm
Copy link
Contributor

KCL has a good feature that we are missing: Filters

We are currently using AllPassFilter:

These are my proposals that are using at The Globe and Mail successfully:

  • Event Latency Filter: Sometimes, the tracker sends the events late or there is a bug and the event has the data like 1970. When storing the event, it doesn't make any sense. We can just ignore them.
  • Field Value Filter: For example, filtering events based on event app_id. We have a lot of stakeholders with different interests and this way we just send them the data from the source that they need.
  • Deduplication filter: We can select a field, i.e. event_fingerprint and keep a cache of recent events fingerprints. If we have seen them recently, we just discard them. This helped us with 5% less duplicated events being sent to our Elasticsearch/Postgres/Kinesis modules emitters.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant