Get job runs by timeframe #3833
Labels
feature-request
This issue requests a feature.
glue
investigating
This issue is being investigated and/or work is in progress to resolve the issue.
p3
This is a minor priority issue
Describe the feature
For glue service,get-job-runs method return all the job runs from the time the job is created and we have to filter the response basing on completed-on attribute. It will be nice to have get job runs accept time as an input parameter,so the response can be limited to fewer number of records.
Use Case
We need to calculate max concurrent dpu utilized in an account at some point in time, so basing on which we can calculate dpu concurrency and can request the quota for the account.
For that we had query glue service for job runs and it is returning all job runs from the time job is created. In an account where we more than thousand jobs and to retrieve job runs for each job from the time they are created ..the script takes a day to run!!! As the job get older the latency increases
Proposed Solution
Expose time as input parameter for glue Job runs. And remove if possible older job runs basing on some rention period.so we don’t get large response data to scan results.
Other Information
No response
Acknowledgements
SDK version used
Boto1.28.30
Environment details (OS name and version, etc.)
Mac
The text was updated successfully, but these errors were encountered: