[VTA] Fix an issue in updating uop_idx in the TensorGemm module #4694

liangfu · 2020-01-13T07:24:15Z

This PR fixed an issue in updating uop_idx counter in the TensorGemm module.

Problem

When evaluating deploy_vision_on_vta.py with TSIM backend, it is required to run the module with

m.run()

However, it throws an instance of dmlc::Error, saying

virtual_memory.cc:48: Check failed: phy_addr != 0 (0 vs. 0) : trying to get address that is nullptr

The root cause is that uop_idx increases unexpectedly.

This PR fixes the above issue and enables successful evaluation of deploy_vision_on_vta.py with TSIM backend.

liangfu · 2020-01-13T07:33:39Z

@kevinyuan Can you help reproduce the results?

A hard requirement is that we should run the module only once with

m.run()

, instead of running it in time_evaluator.

In addition, I have a patch

--- a/vta/hardware/dpi/tsim_device.cc
+++ b/vta/hardware/dpi/tsim_device.cc
@@ -141,6 +141,8 @@ int VTADPISim() {
       tfp->dump(static_cast<vluint64_t>(trace_count * 2 + 1));
 #endif
     trace_count++;
+    if ((trace_count % 1000000) == 1)
+      fprintf(stderr, "[traced %dM cycles]\n", trace_count / 1000000);
     while (top->sim_wait) {
       top->clock = 0;
       std::this_thread::sleep_for(std::chrono::milliseconds(100));

to trace the number of cycles in TSIM.
It requires 31M cycles to run resnet18_v1 prediction.

kevinyuan · 2020-01-13T15:56:04Z

Hi @liangfu,

With your recommended changes, tsim produced the same prediction results as fsim.

Can you explain why m.run() must be used instead of time_evaluator to generate the correct prediction result ?

Furthermore, the cycle_count in "Execution statistics" is equal to 0. Is this expected or it can be improved ?

Thanks.

liangfu · 2020-01-14T02:17:47Z

Can you explain why m.run() must be used instead of time_evaluator to generate the correct prediction result?

time_evaluator takes the first run as a warm-up, and the timing results would be ignored. The prediction results are generated from the second run, which might use the buffer contaminated by the first run.

Furthermore, the cycle_count in "Execution statistics" is equal to 0. Is this expected or it can be improved?

It prints out correct "Execution statistics" when we use time_evaluator. The problem is how to get correct prediction results with time_evaluator, instead of running the module directly.

cc @tmoreau89

tmoreau89

Thank you @liangfu for the patch

kevinyuan · 2020-01-22T14:53:19Z

Hi @liangfu,

Have you got a chance to figure out the root cause of "The problem is how to get correct prediction results with time_evaluator, instead of running the module directly" ?

Also, I found using the following wrong code #1 will also generate wrong prediction, which looks like the inference was run with 11 passed, since I saw [traced 329M cycles] at the end, while the correct run show [traced 29M cycles] at the end.

----------- wrong code #1 ------------

from tvm.contrib.debugger import debug_runtime as graph_runtime
m = graph_runtime.create(graph, lib, ctx)
m.run()

Output

...
[traced 329M cycles]

----------- wrong code #2 ------------

from tvm.contrib import graph_runtime, util, download
m = graph_runtime.create(graph, lib, ctx)
timer = m.module.time_evaluator("run", ctx, number=1, repeat=1)

Output

...
[traced 59M cycles]

----------- correct ---------

from tvm.contrib import graph_runtime, util, download
m = graph_runtime.create(graph, lib, ctx)
m.run()

Output

...
[traced 29M cycles]

…he#4694)

[VTA] Fix an issue in updating uop_idx in the TensorGemm module

499de47

tmoreau89 approved these changes Jan 14, 2020

View reviewed changes

tmoreau89 merged commit 5699637 into apache:master Jan 14, 2020

alexwong pushed a commit to alexwong/tvm that referenced this pull request Feb 26, 2020

[VTA] Fix an issue in updating uop_idx in the TensorGemm module (apac…

283c20c

…he#4694)

alexwong pushed a commit to alexwong/tvm that referenced this pull request Feb 28, 2020

[VTA] Fix an issue in updating uop_idx in the TensorGemm module (apac…

ede10d2

…he#4694)

zhiics pushed a commit to neo-ai/tvm that referenced this pull request Mar 2, 2020

[VTA] Fix an issue in updating uop_idx in the TensorGemm module (apac…

4266f4b

…he#4694)

tqchen pushed a commit to tqchen/tvm that referenced this pull request Mar 29, 2020

[VTA] Fix an issue in updating uop_idx in the TensorGemm module (apac…

9d7a962

…he#4694)

liangfu deleted the patch-22 branch April 14, 2020 14:34

liangfu mentioned this pull request Jun 23, 2020

[BACKPORT-0.6][Bugfix][VTA] Enable streamlined GEMM execution #5893

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[VTA] Fix an issue in updating uop_idx in the TensorGemm module #4694

[VTA] Fix an issue in updating uop_idx in the TensorGemm module #4694

liangfu commented Jan 13, 2020 •

edited

Loading

liangfu commented Jan 13, 2020

kevinyuan commented Jan 13, 2020

liangfu commented Jan 14, 2020

tmoreau89 left a comment

kevinyuan commented Jan 22, 2020

[VTA] Fix an issue in updating uop_idx in the TensorGemm module #4694

[VTA] Fix an issue in updating uop_idx in the TensorGemm module #4694

Conversation

liangfu commented Jan 13, 2020 • edited Loading

Problem

liangfu commented Jan 13, 2020

kevinyuan commented Jan 13, 2020

liangfu commented Jan 14, 2020

tmoreau89 left a comment

Choose a reason for hiding this comment

kevinyuan commented Jan 22, 2020

liangfu commented Jan 13, 2020 •

edited

Loading