[Prototype] [PyTorch Edge] Speed up model loading by 12% by directly calling the C file API from FileAdapter #61997

dhruvbird · 2021-07-21T22:16:45Z

Stack from ghstack:

-> [Prototype] [PyTorch Edge] Speed up model loading by 12% by directly calling the C file API from FileAdapter #61997
[RFC] [PyTorch Edge] Cache operator lambda during model loading [7% faster model loading] #61996
[WIP] [PyTorch Edge] Add test for lite interpreter model caching #62306
[PyTorch Edge] Add test_lite_interpreter to fbsource xplat BUCK files #62305

After profiling the model loading latency on AI Bench (Android Galaxy S8 US), it seems like a significant amount of time was spent reading data using FileAdapter, which internally calls IStreamAdapter. However, IStreamAdapter uses std::istream under the hood, which is not that efficient. This change reduces the model loading time from ~293ms to ~254ms, which is a reduction of ~12%.

Differential Revision: D29812191

…calling the C file API from FileAdapter After profiling the model loading latency on AI Bench (Android Galaxy S8 US), it seems like a significant amount of time was spent reading data using FileAdapter, which internally calls IStreamAdapter. However, IStreamAdapter uses `std::istream` under the hood, which is not that efficient. This change reduces the model loading time from [~293ms](https://www.internalfb.com/intern/aibench/details/600870874797229) to [~254ms](https://www.internalfb.com/intern/aibench/details/163731416457694), which is a reduction of ~12%. Differential Revision: [D29812191](https://our.internmc.facebook.com/intern/diff/D29812191/) [ghstack-poisoned]

facebook-github-bot · 2021-07-21T22:16:51Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/61997
📄 Preview docs built from this PR

💊 CI failures summary and remediations

As of commit 19c3dfc (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

…y directly calling the C file API from FileAdapter" After profiling the model loading latency on AI Bench (Android Galaxy S8 US), it seems like a significant amount of time was spent reading data using FileAdapter, which internally calls IStreamAdapter. However, IStreamAdapter uses `std::istream` under the hood, which is not that efficient. This change reduces the model loading time from [~293ms](https://www.internalfb.com/intern/aibench/details/600870874797229) to [~254ms](https://www.internalfb.com/intern/aibench/details/163731416457694), which is a reduction of ~12%. Differential Revision: [D29812191](https://our.internmc.facebook.com/intern/diff/D29812191/) [ghstack-poisoned]

…calling the C file API from FileAdapter Pull Request resolved: #61997 After profiling the model loading latency on AI Bench (Android Galaxy S8 US), it seems like a significant amount of time was spent reading data using FileAdapter, which internally calls IStreamAdapter. However, IStreamAdapter uses `std::istream` under the hood, which is not that efficient. This change reduces the model loading time from [~293ms](https://www.internalfb.com/intern/aibench/details/600870874797229) to [~254ms](https://www.internalfb.com/intern/aibench/details/163731416457694), which is a reduction of ~12%. ghstack-source-id: 134206924 Differential Revision: [D29812191](https://our.internmc.facebook.com/intern/diff/D29812191/)

…y directly calling the C file API from FileAdapter" After profiling the model loading latency on AI Bench (Android Galaxy S8 US), it seems like a significant amount of time was spent reading data using FileAdapter, which internally calls IStreamAdapter. However, IStreamAdapter uses `std::istream` under the hood, which is not that efficient. This change reduces the model loading time from [~293ms](https://www.internalfb.com/intern/aibench/details/600870874797229) to [~254ms](https://www.internalfb.com/intern/aibench/details/163731416457694), which is a reduction of ~12%. Differential Revision: [D29812191](https://our.internmc.facebook.com/intern/diff/D29812191/) [ghstack-poisoned]

…calling the C file API from FileAdapter Pull Request resolved: #61997 After profiling the model loading latency on AI Bench (Android Galaxy S8 US), it seems like a significant amount of time was spent reading data using FileAdapter, which internally calls IStreamAdapter. However, IStreamAdapter uses `std::istream` under the hood, which is not that efficient. This change reduces the model loading time from [~293ms](https://www.internalfb.com/intern/aibench/details/600870874797229) to [~254ms](https://www.internalfb.com/intern/aibench/details/163731416457694), which is a reduction of ~12%. ghstack-source-id: 134223858 Differential Revision: [D29812191](https://our.internmc.facebook.com/intern/diff/D29812191/)

…calling the C file API from FileAdapter Pull Request resolved: pytorch/pytorch#61997 After profiling the model loading latency on AI Bench (Android Galaxy S8 US), it seems like a significant amount of time was spent reading data using FileAdapter, which internally calls IStreamAdapter. However, IStreamAdapter uses `std::istream` under the hood, which is not that efficient. This change reduces the model loading time from [~293ms](https://www.internalfb.com/intern/aibench/details/600870874797229) to [~254ms](https://www.internalfb.com/intern/aibench/details/163731416457694), which is a reduction of ~12%. ghstack-source-id: 134042630 Differential Revision: [D29812191](https://our.internmc.facebook.com/intern/diff/D29812191/)

…y directly calling the C file API from FileAdapter" After profiling the model loading latency on AI Bench (Android Galaxy S8 US), it seems like a significant amount of time was spent reading data using FileAdapter, which internally calls IStreamAdapter. However, IStreamAdapter uses `std::istream` under the hood, which is not that efficient. This change reduces the model loading time from [~293ms](https://www.internalfb.com/intern/aibench/details/600870874797229) to [~254ms](https://www.internalfb.com/intern/aibench/details/163731416457694), which is a reduction of ~12%. Differential Revision: [D29812191](https://our.internmc.facebook.com/intern/diff/D29812191/) [ghstack-poisoned]

…calling the C file API from FileAdapter Pull Request resolved: #61997 After profiling the model loading latency on AI Bench (Android Galaxy S8 US), it seems like a significant amount of time was spent reading data using FileAdapter, which internally calls IStreamAdapter. However, IStreamAdapter uses `std::istream` under the hood, which is not that efficient. This change reduces the model loading time from [~293ms](https://www.internalfb.com/intern/aibench/details/600870874797229) to [~254ms](https://www.internalfb.com/intern/aibench/details/163731416457694), which is a reduction of ~12%. ghstack-source-id: 134489363 Differential Revision: [D29812191](https://our.internmc.facebook.com/intern/diff/D29812191/)

…y directly calling the C file API from FileAdapter" After profiling the model loading latency on AI Bench (Android Galaxy S8 US), it seems like a significant amount of time was spent reading data using FileAdapter, which internally calls IStreamAdapter. However, IStreamAdapter uses `std::istream` under the hood, which is not that efficient. This change reduces the model loading time from [~293ms](https://www.internalfb.com/intern/aibench/details/600870874797229) to [~254ms](https://www.internalfb.com/intern/aibench/details/163731416457694), which is a reduction of ~12%. Differential Revision: [D29812191](https://our.internmc.facebook.com/intern/diff/D29812191/) [ghstack-poisoned]

…calling the C file API from FileAdapter Pull Request resolved: #61997 After profiling the model loading latency on AI Bench (Android Galaxy S8 US), it seems like a significant amount of time was spent reading data using FileAdapter, which internally calls IStreamAdapter. However, IStreamAdapter uses `std::istream` under the hood, which is not that efficient. This change reduces the model loading time from [~293ms](https://www.internalfb.com/intern/aibench/details/600870874797229) to [~254ms](https://www.internalfb.com/intern/aibench/details/163731416457694), which is a reduction of ~12%. ghstack-source-id: 134529654 Differential Revision: [D29812191](https://our.internmc.facebook.com/intern/diff/D29812191/)

…y directly calling the C file API from FileAdapter" After profiling the model loading latency on AI Bench (Android Galaxy S8 US), it seems like a significant amount of time was spent reading data using FileAdapter, which internally calls IStreamAdapter. However, IStreamAdapter uses `std::istream` under the hood, which is not that efficient. This change reduces the model loading time from [~293ms](https://www.internalfb.com/intern/aibench/details/600870874797229) to [~254ms](https://www.internalfb.com/intern/aibench/details/163731416457694), which is a reduction of ~12%. Differential Revision: [D29812191](https://our.internmc.facebook.com/intern/diff/D29812191/) [ghstack-poisoned]

…calling the C file API from FileAdapter Pull Request resolved: #61997 After profiling the model loading latency on AI Bench (Android Galaxy S8 US), it seems like a significant amount of time was spent reading data using FileAdapter, which internally calls IStreamAdapter. However, IStreamAdapter uses `std::istream` under the hood, which is not that efficient. This change reduces the model loading time from [~293ms](https://www.internalfb.com/intern/aibench/details/600870874797229) to [~254ms](https://www.internalfb.com/intern/aibench/details/163731416457694), which is a reduction of ~12%. ghstack-source-id: 134634610 Differential Revision: [D29812191](https://our.internmc.facebook.com/intern/diff/D29812191/)

facebook-github-bot · 2021-07-30T03:15:10Z

This pull request has been merged in 725d98b.

facebook-github-bot added the cla signed label Jul 21, 2021

This was referenced Jul 21, 2021

[RFC] [PyTorch Edge] Cache operator lambda during model loading [7% faster model loading] #61996

Closed

[Not For Landing/Review] [PyTorch Edge] Try to speed up module loading by reserving space in various vectors #61998

Closed

dhruvbird added 3 commits July 21, 2021 16:09

This was referenced Jul 28, 2021

[PyTorch Edge] Add test_lite_interpreter to fbsource xplat BUCK files #62305

Closed

[WIP] [PyTorch Edge] Add test for lite interpreter model caching #62306

Closed

facebook-github-bot closed this in 725d98b Jul 30, 2021

facebook-github-bot added the Merged label Jul 30, 2021

facebook-github-bot deleted the gh/dhruvbird/54/head branch August 2, 2021 14:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Prototype] [PyTorch Edge] Speed up model loading by 12% by directly calling the C file API from FileAdapter #61997

[Prototype] [PyTorch Edge] Speed up model loading by 12% by directly calling the C file API from FileAdapter #61997

Uh oh!

dhruvbird commented Jul 21, 2021 •

edited

Loading

Uh oh!

facebook-github-bot commented Jul 21, 2021 •

edited

Loading

Uh oh!

facebook-github-bot commented Jul 30, 2021

Uh oh!

Uh oh!

[Prototype] [PyTorch Edge] Speed up model loading by 12% by directly calling the C file API from FileAdapter #61997

[Prototype] [PyTorch Edge] Speed up model loading by 12% by directly calling the C file API from FileAdapter #61997

Uh oh!

Conversation

dhruvbird commented Jul 21, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

facebook-github-bot commented Jul 21, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful links

💊 CI failures summary and remediations

Uh oh!

facebook-github-bot commented Jul 30, 2021

Uh oh!

Uh oh!

dhruvbird commented Jul 21, 2021 •

edited

Loading

facebook-github-bot commented Jul 21, 2021 •

edited

Loading