[random] support random fill #5913

FrozenGene · 2020-06-24T09:54:43Z

This pr support us to allocate non empty values of nd array, which could solve the issue of autotvm measurement is not correct (see #5200). This is one standalone part of Ansor (#5883).

@kevinthesun @merrymercy @minminsun @jcf94

golang/Makefile

src/runtime/ndarray.cc

python/tvm/runtime/ndarray.py

tqchen · 2020-06-24T15:29:38Z

see comments, we don't need to expose the(non_empty) function as a primitive function, given that it is not a numpy function, and is only used in AutoTVM, instead we can achieve the same goal by

x = nd.empty(..)
random_fill  = get_packed_func("contrib.random.random_fill") 
random_fill(x)

Notably, the above solution is better because:

Minimum interface exposure(non autotvm devs don't need to be aware of the change)
Random initialization is performed on the device, the current impl actually will results in random number generated on the host then transfering to the device.

src/runtime/ndarray.cc

junrushao · 2020-06-24T17:01:32Z

I agree with TQ's point. Also, I am a bit concerned if we really need a C API for this.

FrozenGene · 2020-06-28T02:39:26Z

I will handle it as comments

tqchen · 2020-07-10T16:45:48Z

@FrozenGene please followup

FrozenGene · 2020-07-10T17:15:23Z

Thanks for reminding. I want to complete the clflush pr and model based runtime pr, then handle this pr. Could you help to handle model based runtime pr now?

FrozenGene · 2020-07-17T10:31:56Z

see comments, we don't need to expose the(non_empty) function as a primitive function, given that it is not a numpy function, and is only used in AutoTVM, instead we can achieve the same goal by
x = nd.empty(..)
random_fill  = get_packed_func("contrib.random.random_fill") 
random_fill(x)
Notably, the above solution is better because:

Minimum interface exposure(non autotvm devs don't need to be aware of the change)

Random initialization is performed on the device, the current impl actually will results in random number generated on the host then transfering to the device.

@tqchen I restart this work. This way we could make the random initialization on the device (i.e. AllocaWorkSpace use correct device api). But as far as I know we still can not avoid generating random numbers on host if we don't call specific apis like cuRAND (if we are for NV GPU) . That is to say, we still have to generate random numbers and copy to device. However, we should accomplish the goal we could generate random numbers on the remote cpu (if we are in rpc mode) and copy the data from remote cpu (like arm) to remote GPU (like mali) directly, not the path x86 cpu -> arm cpu -> mali gpu.

merrymercy · 2020-07-27T20:18:43Z

@FrozenGene Please followup. It is okay to do the path CPU@remote_device -> GPU@remote_device for now, as long as there is no RPC communication cost (i.e. no local_device -> remote device)
I remembered that we tried to do this in our internal repo but failed. What's the problem at that time?

tqchen · 2020-08-06T03:33:07Z

Ping

FrozenGene · 2020-08-06T06:31:13Z

@FrozenGene Please followup. It is okay to do the path CPU@remote_device -> GPU@remote_device for now, as long as there is no RPC communication cost (i.e. no local_device -> remote device)
I remembered that we tried to do this in our internal repo but failed. What's the problem at that time?

@merrymercy Our current method is we will introduce one dummy cpu context in the remote and pass the data to the remote target (like OpenCL, CUDA). Previous time we want to do is to generate non empty data in the remote target but failed.

@tqchen 's suggestion we could leverage empty interface and fill the data into the allocated tensor to avoid introducing new non_empty api in the C / ndarray interface and generate random data directly in the remote device. Previous comment is to make sure that we maybe have to introduce cpu like our current way.

I will follow up my pr that move our implementation to the contrib/random/random.cc and turn it on always as our auto scheduler has local builder / local runner also rely on it (not just rpc).

FrozenGene · 2020-08-11T07:53:11Z

@FrozenGene Please followup. It is okay to do the path CPU@remote_device -> GPU@remote_device for now, as long as there is no RPC communication cost (i.e. no local_device -> remote device)
I remembered that we tried to do this in our internal repo but failed. What's the problem at that time?

@merrymercy Our current method is we will introduce one dummy cpu context in the remote and pass the data to the remote target (like OpenCL, CUDA). Previous time we want to do is to generate non empty data in the remote target but failed.

@tqchen 's suggestion we could leverage empty interface and fill the data into the allocated tensor to avoid introducing new non_empty api in the C / ndarray interface and generate random data directly in the remote device. Previous comment is to make sure that we maybe have to introduce cpu like our current way.

I will follow up my pr that move our implementation to the contrib/random/random.cc and turn it on always as our auto scheduler has local builder / local runner also rely on it (not just rpc).

@merrymercy @tqchen I have updated the code and verified it in the remote cpu / remote mali gpu. We could do CPU@remote_device -> GPU@remote_device directly, not CPU@host->CPU@remote_device -> GPU@remote_device.

src/runtime/contrib/random/mt_random_engine.cc

jcf94

Looks good.
So we will hava a later pr to add clflush & random_fill to AutoTVM & Auto_Scheduler?

FrozenGene · 2020-08-13T03:49:31Z

Looks good.
So we will hava a later pr to add clflush & random_fill to AutoTVM & Auto_Scheduler?

Yes.

FrozenGene · 2020-08-14T04:05:56Z

@tqchen @merrymercy gental ping. Code has been updated.

cmake/config.cmake

FrozenGene · 2020-08-17T02:37:10Z

@tqchen @merrymercy @comaniac do you have other comments?

tqchen · 2020-08-17T16:38:40Z

Thanks @FrozenGene @comaniac @merrymercy

merrymercy · 2020-08-27T12:05:18Z

@FrozenGene Can you send the follow-up PRs to enable this in ansor and autotvm?

FrozenGene · 2020-08-27T12:23:17Z

Thanks for reminding @merrymercy. My agenda is completely full tomorrow and weekend. I could do this next week.

jcf94 mentioned this pull request Jun 24, 2020

[WIP][Ansor][AutoTVM v2.0] Part 0: Infrastructures for Automatic Schedule Search #5883

Closed

7 tasks

tqchen requested changes Jun 24, 2020

View reviewed changes

golang/Makefile Outdated Show resolved Hide resolved

src/runtime/ndarray.cc Outdated Show resolved Hide resolved

python/tvm/runtime/ndarray.py Outdated Show resolved Hide resolved

tqchen reviewed Jun 24, 2020

View reviewed changes

src/runtime/ndarray.cc Outdated Show resolved Hide resolved

tqchen self-assigned this Jun 24, 2020

tqchen added the status: need update need update based on feedbacks label Jun 24, 2020

FrozenGene force-pushed the non_empty branch 2 times, most recently from de37081 to 696fdd9 Compare August 10, 2020 09:44

FrozenGene changed the title ~~[ndarray][autotvm] support ndarray.non_empty~~ [random] support random fill Aug 10, 2020

FrozenGene force-pushed the non_empty branch from 696fdd9 to 1ea9db6 Compare August 10, 2020 09:49

FrozenGene requested review from merrymercy and tqchen August 10, 2020 10:06

support random fill

730798c

FrozenGene force-pushed the non_empty branch from 1ea9db6 to 730798c Compare August 10, 2020 12:14

tqchen requested changes Aug 12, 2020

View reviewed changes

src/runtime/contrib/random/mt_random_engine.cc Outdated Show resolved Hide resolved

Change to NDArray API

da43f23

FrozenGene requested a review from tqchen August 12, 2020 06:07

Trigger CI

86cbe24

jcf94 approved these changes Aug 13, 2020

View reviewed changes

comaniac reviewed Aug 14, 2020

View reviewed changes

cmake/config.cmake Show resolved Hide resolved

merrymercy approved these changes Aug 17, 2020

View reviewed changes

comaniac approved these changes Aug 17, 2020

View reviewed changes

tqchen approved these changes Aug 17, 2020

View reviewed changes

tqchen merged commit f731652 into apache:master Aug 17, 2020

tqchen added status: accepted and removed status: need update need update based on feedbacks labels Aug 17, 2020

FrozenGene deleted the non_empty branch August 18, 2020 04:16

trevor-m pushed a commit to trevor-m/tvm that referenced this pull request Aug 26, 2020

[random] support random fill (apache#5913)

3fb8a70

trevor-m pushed a commit to trevor-m/tvm that referenced this pull request Aug 26, 2020

[random] support random fill (apache#5913)

b6a0758

trevor-m pushed a commit to trevor-m/tvm that referenced this pull request Aug 26, 2020

[random] support random fill (apache#5913)

f115c74

trevor-m pushed a commit to trevor-m/tvm that referenced this pull request Sep 2, 2020

[random] support random fill (apache#5913)

ae9be5f

FrozenGene mentioned this pull request Sep 3, 2020

[AutoTVM][Ansor] Enable random fill and CPU cache flush for AutoTVM and Ansor #6391

Merged

trevor-m pushed a commit to neo-ai/tvm that referenced this pull request Sep 3, 2020

[random] support random fill (apache#5913)

c0c69da

ZihengJiang mentioned this pull request Sep 25, 2020

TVM v0.7 Release Note Candidate #6486

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[random] support random fill #5913

[random] support random fill #5913

FrozenGene commented Jun 24, 2020

tqchen commented Jun 24, 2020

junrushao commented Jun 24, 2020

FrozenGene commented Jun 28, 2020

tqchen commented Jul 10, 2020

FrozenGene commented Jul 10, 2020 via email •

edited

FrozenGene commented Jul 17, 2020 •

edited

merrymercy commented Jul 27, 2020 •

edited

tqchen commented Aug 6, 2020

FrozenGene commented Aug 6, 2020

FrozenGene commented Aug 11, 2020 •

edited

jcf94 left a comment •

edited

FrozenGene commented Aug 13, 2020

FrozenGene commented Aug 14, 2020

FrozenGene commented Aug 17, 2020

tqchen commented Aug 17, 2020

merrymercy commented Aug 27, 2020 •

edited

FrozenGene commented Aug 27, 2020

[random] support random fill #5913

[random] support random fill #5913

Conversation

FrozenGene commented Jun 24, 2020

tqchen commented Jun 24, 2020

junrushao commented Jun 24, 2020

FrozenGene commented Jun 28, 2020

tqchen commented Jul 10, 2020

FrozenGene commented Jul 10, 2020 via email • edited

FrozenGene commented Jul 17, 2020 • edited

merrymercy commented Jul 27, 2020 • edited

tqchen commented Aug 6, 2020

FrozenGene commented Aug 6, 2020

FrozenGene commented Aug 11, 2020 • edited

jcf94 left a comment • edited

Choose a reason for hiding this comment

FrozenGene commented Aug 13, 2020

FrozenGene commented Aug 14, 2020

FrozenGene commented Aug 17, 2020

tqchen commented Aug 17, 2020

merrymercy commented Aug 27, 2020 • edited

FrozenGene commented Aug 27, 2020

FrozenGene commented Jul 10, 2020 via email •

edited

FrozenGene commented Jul 17, 2020 •

edited

merrymercy commented Jul 27, 2020 •

edited

FrozenGene commented Aug 11, 2020 •

edited

jcf94 left a comment •

edited

merrymercy commented Aug 27, 2020 •

edited