-
Notifications
You must be signed in to change notification settings - Fork 68
[QQC-2355] Limit number of data rows to check for processing status at once #1218
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
wait_processing_max_seconds: int = _wait_processing_max_seconds, | ||
sleep_interval=30): | ||
""" Wait until all the specified data rows are processed""" | ||
start_time = datetime.now() |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Since the start_time is set to the beginning of this function, I think _poll_data_row_processing_status
may not wait the full wait_processing_max_seconds
per chunk
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
My understanding was that the wait_processing_max_seconds
was to be applied to the entire call for _wait_until_data_rows_are_processed
, and not the individual chunks. This should preserve original behavior
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ah gotcha makes sense
|
||
max_data_rows_per_poll = 100_000 | ||
if data_row_ids is not None: | ||
for i in range(0, len(data_row_ids), max_data_rows_per_poll): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
nice use of step for chunking
labelbox/schema/project.py
Outdated
"Maximum wait time exceeded while waiting for data rows to be processed. Try creating a batch a bit later" | ||
) | ||
"Maximum wait time exceeded while waiting for data rows to be processed. " | ||
"Try creating a batch a bit later") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
is it supposed to be a multi-line string? does not work? I though smth like
"Try creating a batch a bit later") | |
raise ProcessingWaitTimeout("""Maximum wait time exceeded while waiting for data rows to be processed. | |
Try creating a batch a bit later""") |
There is a limit of 100k data rows that can be checked at once for processing status. Split this call to the backend into chunks of 100k