-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Use encodings from db instead of minio #522
Conversation
Fix up opentracing spans
Remove unused ijson import
Remove unused ijson import Remove convert_encodings_from_json_to_binary and unused serialization.
Also ensure encodings are returned in order
b7781f8
to
631aa67
Compare
# retrieval of encoding ids should be much faster than insertion | ||
assert fetch_time < elapsed_time | ||
|
||
assert id_fetch_time < elapsed_time |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
these timing assertions might not always hold. (e.g. if there is a load spike in the db) So should we really fail if it goes wrong once?
|
||
parent_span = g.flask_tracer.get_span() | ||
|
||
def precheck_encoding_upload(project_id, headers, parent_span): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
docstring would be nice
Builds ontop of PR that fixes Jaeger: #523This PR completes the switch from storing encodings in minio to postgres. It makes several short term assumptions such as assuming there will be one default block per data provider and that encoding ids will be sequential non-repeating integers. These assumptions will be addressed over the next several sprints.
Chunking information previously included object store filenames, now each chunk comprises a data provider ID and a range of encoding ids. E.g.
dp_id=45, range=[2000,5000]
, the comparison task now queries the database to fetch these encodings (currently ignoring blocks).Tracing child spans was not working inside of celery tasks so I've fixed the tasks I touched.