Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

output: cache loaded tree objects #6301

Merged
merged 1 commit into from Jul 24, 2021
Merged

output: cache loaded tree objects #6301

merged 1 commit into from Jul 24, 2021

Conversation

efiop
Copy link
Member

@efiop efiop commented Jul 11, 2021

A quick naive fix that reduces dvc diff from hours to around ~20sec for a dataset of dozens of thousands of files. That is still bad, but in order to speed it up more, we need to revisit our .dir storage and switch to using staging in get_dir_cache. That will be revisited after followups for #6308

Related to #6173

Thank you for the contribution - we'll try to review it as soon as possible. πŸ™

@efiop efiop requested a review from a team as a code owner July 11, 2021 01:02
@efiop efiop requested a review from daavoo July 11, 2021 01:02
@efiop efiop force-pushed the fix-6173 branch 2 times, most recently from 52015c2 to 7bd469e Compare July 11, 2021 01:08
@efiop efiop changed the title [WIP] output: get rid of get_dir_cache [WIP] output: cache loaded objects Jul 11, 2021
@efiop efiop removed the request for review from daavoo July 11, 2021 01:10
@efiop efiop force-pushed the fix-6173 branch 6 times, most recently from 453a322 to 90f119f Compare July 22, 2021 22:35
@efiop efiop changed the title [WIP] output: cache loaded objects output: cache loaded objects Jul 22, 2021
@efiop efiop changed the title output: cache loaded objects output: cache loaded tree objects Jul 22, 2021
@efiop efiop merged commit 1e43c30 into iterative:master Jul 24, 2021
@efiop efiop added optimize Optimizes DVC performance improvement over resource / time consuming tasks labels Jul 24, 2021
@efiop efiop deleted the fix-6173 branch July 25, 2021 18:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
optimize Optimizes DVC performance improvement over resource / time consuming tasks
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant