Fix unnecessary sync_memops cuda pointer attr in GDR #12170

byronyi · 2017-08-10T09:10:01Z

See the comments here. I've tested on my local boxes and it seems removing CU_POINTER_ATTRIBUTE_SYNC_MEMOPS slightly improves the training throughput while introducing no data race.

tensorflow-jenkins · 2017-08-10T09:10:02Z

Can one of the admins verify this patch?

rmlarsen · 2017-08-10T15:42:39Z

@tensorflow-jenkins test this please

rmlarsen · 2017-08-10T17:25:02Z

Test failure is unrelated.

fix unnecessary sync_memops cuda pointer attr in gdr

331be4a

googlebot added the cla: yes label Aug 10, 2017

rmlarsen requested a review from zheng-xq August 10, 2017 15:41

rmlarsen assigned zheng-xq Aug 10, 2017

rmlarsen requested review from poxvoculi and removed request for zheng-xq August 10, 2017 15:42

rmlarsen assigned poxvoculi and unassigned zheng-xq Aug 10, 2017

poxvoculi approved these changes Aug 10, 2017

View reviewed changes

rmlarsen merged commit 703fd44 into tensorflow:master Aug 10, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix unnecessary sync_memops cuda pointer attr in GDR #12170

Fix unnecessary sync_memops cuda pointer attr in GDR #12170

byronyi commented Aug 10, 2017

tensorflow-jenkins commented Aug 10, 2017

rmlarsen commented Aug 10, 2017

rmlarsen commented Aug 10, 2017

Fix unnecessary sync_memops cuda pointer attr in GDR #12170

Fix unnecessary sync_memops cuda pointer attr in GDR #12170

Conversation

byronyi commented Aug 10, 2017

tensorflow-jenkins commented Aug 10, 2017

rmlarsen commented Aug 10, 2017

rmlarsen commented Aug 10, 2017