Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Refactor scoring so it can be done at signal collection time as well. #224

Merged
merged 29 commits into from
Oct 28, 2022

Conversation

calebbrown
Copy link
Contributor

This change moves scoring logic into a library that can be used from any binary. The collect_signals binary is now able to score the data directly.

This change helps support productionization and the deprecation of Python, by providing default scores as signals are collected, and by providing a replacement for the Python criticality_score binary.

This normalizes the logic of scorer and collect_signals.

If someone has multiple inputs to collect_signals they can be aggregated
by using cat. e.g: cat file1.txt file2.txt | collect_signals - out.csv

Signed-off-by: Caleb Brown <calebbrown@google.com>
Signed-off-by: Caleb Brown <calebbrown@google.com>
Signed-off-by: Caleb Brown <calebbrown@google.com>
Signed-off-by: Caleb Brown <calebbrown@google.com>
Signed-off-by: Caleb Brown <calebbrown@google.com>
Signed-off-by: Caleb Brown <calebbrown@google.com>
Signed-off-by: Caleb Brown <calebbrown@google.com>
This is to free up the name "collector", and to represent better that
each of the structs is meant to represent a single source of signals.

This is a part of larger refactoring work to support productionization
and a simpler CLI tool.

Signed-off-by: Caleb Brown <calebbrown@google.com>
Signed-off-by: Caleb Brown <calebbrown@google.com>
Signed-off-by: Caleb Brown <calebbrown@google.com>
This allows the eventual simple reuse of signal collection across
binaries.

Signed-off-by: Caleb Brown <calebbrown@google.com>
Signed-off-by: Caleb Brown <calebbrown@google.com>
Signed-off-by: Caleb Brown <calebbrown@google.com>
Signed-off-by: Caleb Brown <calebbrown@google.com>
Signed-off-by: Caleb Brown <calebbrown@google.com>
Signed-off-by: Caleb Brown <calebbrown@google.com>
Signed-off-by: Caleb Brown <calebbrown@google.com>
Signed-off-by: Caleb Brown <calebbrown@google.com>
This is the last step in allowing two binaries two exist that can do
collection:
- a productionized worker for large scale collection
- a small scale CLI focused tool

Signed-off-by: Caleb Brown <calebbrown@google.com>
Signed-off-by: Caleb Brown <calebbrown@google.com>
Signed-off-by: Caleb Brown <calebbrown@google.com>
Signed-off-by: Caleb Brown <calebbrown@google.com>
Signed-off-by: Caleb Brown <calebbrown@google.com>
Signed-off-by: Caleb Brown <calebbrown@google.com>
Signed-off-by: Caleb Brown <calebbrown@google.com>
Signed-off-by: Caleb Brown <calebbrown@google.com>
Signed-off-by: Caleb Brown <calebbrown@google.com>
Signed-off-by: Caleb Brown <calebbrown@google.com>
@calebbrown calebbrown merged commit 82747b6 into main Oct 28, 2022
@calebbrown calebbrown deleted the workerize_3 branch October 28, 2022 01:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants