Remove PromoteTensorLoads pass, convert ExtractOp in TensorToFlow. #6852

ScottTodd · 2021-08-25T00:16:10Z

Fixes #6756 (the tosa if.mlir file compiles successfully using -iree-flow-enable-linalg-detensorize with this change)

The PromoteTensorLoads pass was converting i1 loads to i8 loads using ZeroExtendIOp and TruncateIOp. That was producing weird cycles during compilation when detensoring was applied, and flow ops should be fine with i1 types. We still need to handle i1 types when going to the HAL (since storage is incompatible) on the outside (external interface) and inside (codegen).

iree-github-actions-bot · 2021-08-25T17:01:53Z

Abbreviated Benchmark Summary

@ commit 434c94aa23bc5215ca73902a9cc3b948cba8fdde (vs. base ce7b475992b38588fe73d8a16b4d2ab64284f8ea)

Regressed Benchmarks 🚩

Benchmark Name	Average Latency (ms)	Median Latency (ms)	Latency Standard Deviation (ms)
MobileNetV2 [fp32,imagenet] (TensorFlow) full-inference with IREE-Vulkan @ SM-G980F (GPU-Mali-G77)	84 (vs. 62, 35.48%↑)	86	5
MobileNetV3Small [fp32,imagenet] (TensorFlow) 3-thread,big-core,full-inference with IREE-Dylib @ Pixel-4 (CPU-ARMv8.2-A)	17 (vs. 15, 13.33%↑)	17	0

Improved Benchmarks 🎉

Benchmark Name	Average Latency (ms)	Median Latency (ms)	Latency Standard Deviation (ms)
MobileBertSquad [fp32] (TensorFlow) big-core,full-inference with IREE-Dylib-Sync @ Pixel-4 (CPU-ARMv8.2-A)	738 (vs. 910, 18.90%↓)	738	1

For more information:

ScottTodd · 2021-08-25T17:09:20Z

Benchmarks regressed a bit. May need to diff the IR for MobileNetV2 through the Vulkan target to see what happened...

ScottTodd · 2021-08-25T18:03:51Z

Benchmarks regressed a bit. May need to diff the IR for MobileNetV2 through the Vulkan target to see what happened...

IR diff shows no change. Benchmark results appear to be in the noise on closer inspection.

* 30463f1 Adding #util.composite attribute. (#6854) * 7a1c579 Implement function.py return type coercion (#6832) * d90f0fc Remove PromoteTensorLoads pass, convert ExtractOp in TensorToFlow. (#6852) * 7fa8c20 Update SwiftShader to 2021-08-25 (#6859) * a328761 Rename Bazel repo iree_vulkan_headers to vulkan_headers (#6862) * 7660e49 Merge pull request #6856 from google/benvanik-shared-target-backend * ebc5eb5 Removing the pipeline caching during executable translation. (#6855) * 2fb7a8d Reapply "Update TFLite concrete function conversion codes" (#6800) * ce7b475 NFC: Merge ConvertToFlow passes into dedicated before/after passes. (#6850) * 9fce2d6 Support f32 in the VM by default in the compiler. (#6744) * b6baea9 Support global ref ops and fix passing of refs on function boundaries in the C.. * 4b5d2ed Bump flatcc version (#6853) * c8a8f5d Add e2e tests for mhlo.bitcast_convert (#6846) COPYBARA_INTEGRATE_REVIEW=#6865 from hanhanW:main-to-google 30463f1 PiperOrigin-RevId: 392952973

* 30463f1 Adding #util.composite attribute. (#6854) * 7a1c579 Implement function.py return type coercion (#6832) * d90f0fc Remove PromoteTensorLoads pass, convert ExtractOp in TensorToFlow. (#6852) * 7fa8c20 Update SwiftShader to 2021-08-25 (#6859) * a328761 Rename Bazel repo iree_vulkan_headers to vulkan_headers (#6862) * 7660e49 Merge pull request #6856 from google/benvanik-shared-target-backend * ebc5eb5 Removing the pipeline caching during executable translation. (#6855) * 2fb7a8d Reapply "Update TFLite concrete function conversion codes" (#6800) * ce7b475 NFC: Merge ConvertToFlow passes into dedicated before/after passes. (#6850) * 9fce2d6 Support f32 in the VM by default in the compiler. (#6744) * b6baea9 Support global ref ops and fix passing of refs on function boundaries in the C.. * 4b5d2ed Bump flatcc version (#6853) * c8a8f5d Add e2e tests for mhlo.bitcast_convert (#6846) PiperOrigin-RevId: 392952973

ScottTodd added the compiler/dialects Relating to the IREE compiler dialects (flow, hal, vm) label Aug 25, 2021

ScottTodd requested a review from benvanik August 25, 2021 00:16

google-cla bot added the cla: yes label Aug 25, 2021

benvanik approved these changes Aug 25, 2021

View reviewed changes

ScottTodd added 2 commits August 25, 2021 08:47

Remove PromoteTensorLoads, convert ExtractOp in TensorToFlow.

75c2bb5

Fix lit tests.

434c94a

ScottTodd force-pushed the promote-tensor-loads branch from 34d1273 to 434c94a Compare August 25, 2021 16:03

ScottTodd added the (deprecated) buildkite:benchmark-android Deprecated. Please use benchmarks:android-* label Aug 25, 2021

ScottTodd mentioned this pull request Aug 25, 2021

Flip -iree-flow-enable-linalg-detensorize on by default. #6863

Closed

ScottTodd merged commit d90f0fc into iree-org:main Aug 25, 2021

ScottTodd deleted the promote-tensor-loads branch August 25, 2021 18:16

hanhanW mentioned this pull request Aug 25, 2021

Merge main -> google #6865

Merged

ScottTodd mentioned this pull request Sep 20, 2021

Fix bugs in PromoteI1ToI8Pass and move it to common input conversion. #7078

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove PromoteTensorLoads pass, convert ExtractOp in TensorToFlow. #6852

Remove PromoteTensorLoads pass, convert ExtractOp in TensorToFlow. #6852

ScottTodd commented Aug 25, 2021 •

edited

Loading

iree-github-actions-bot commented Aug 25, 2021

ScottTodd commented Aug 25, 2021

ScottTodd commented Aug 25, 2021

Remove PromoteTensorLoads pass, convert ExtractOp in TensorToFlow. #6852

Remove PromoteTensorLoads pass, convert ExtractOp in TensorToFlow. #6852

Conversation

ScottTodd commented Aug 25, 2021 • edited Loading

iree-github-actions-bot commented Aug 25, 2021

Abbreviated Benchmark Summary

Regressed Benchmarks 🚩

Improved Benchmarks 🎉

ScottTodd commented Aug 25, 2021

ScottTodd commented Aug 25, 2021

ScottTodd commented Aug 25, 2021 •

edited

Loading