Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: add custom-data processing #991

Merged
merged 6 commits into from
Jun 24, 2024
Merged

Conversation

PsypherPunk
Copy link
Collaborator

@PsypherPunk PsypherPunk commented Jun 21, 2024

Context

adds custom-data message processing.

AB#213664

Change proposed in this pull request

  • refactor messages into an explicit enum (stricter typing would be nice but there's no defined "default" message format…yet).
  • I've refactored the comparator-set processing to just skip over the existing logic if a target_urn is passed, mostly to reduce the impact of any future refactoring.
  • pre-processing largely uses the existing data, mapping the message's financial data over the already-extant pre-processed data.
  • central-services columns in the existing data are set to zero, where they exist.

Guidance to review

I've changed year to run_id in a lot of places; year largely serves as a run-identifier for the default job and I'd ended up with a lot of run_id or year logic in places. However, I've largely left the existing pre-processing as-is.

Checklist

  • Work items have been linked (use AB#)
  • Your code builds clean without any errors or warnings
  • You have run all unit/integration tests and they pass
  • Your branch has been rebased onto main
  • You have tested by running locally

for "default" and "default-with-user-defined-comparator-set" for now;
"custom" to follow.
re-use existing comparator-set logic but skip any processing for all but
the URN in question.
- add mappings for custom-data message to existing, pre-processed data;
- add pre-processing steps for custom data;
- update comparator-set/RAG processing to accommodate custom data.
- `run_id` is correct as per use; `year` _might_ the value used;
- add tests for new messages;
- omit `log.py` from coverage.
for "custom" data, only the financial information for the target URN
need be persisted in the DB.
for custom-data runs, central-services values must be zero.
Copy link
Collaborator

@jrabbott jrabbott left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me. But would be good to get another set of eyes on it

@PsypherPunk PsypherPunk merged commit 9a9df26 into main Jun 24, 2024
9 checks passed
@PsypherPunk PsypherPunk deleted the feature/213664/custom-data branch June 24, 2024 14:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants