AMD max. 4 GB per allocation workaround for 8GB cards #1977

jean-m-cyr · 2020-03-23T23:22:34Z

Run AMD in split DAG memory mode such that individual memory allocations do not exceed 4GB.
DAG memory is allocated in two equal size parts. One for even index entries, and one for odd index entries.
Update binary kernels support split DAG

- Run AMD in split DAG memory mode such that individual memory allocations do not exceed 4GB. - DAG memory is allocated in two equal size parts. One for even index entries, and one for odd index entries. - Update binary kernels support split DAG

AndreaLanfranchi · 2020-03-24T11:49:58Z

Will review but I have a question.
Do we really need to split DAG ? Have anyone tried to issue two allocation and see if the resulting pointers do point to two adjacent memory locations ? If the latter it'd be only a problem of allocation and the rest of the code would be the same.

ddobreff · 2020-03-24T12:40:59Z

Tested on Polaris and Radeon VII
19.30 Opencl driver and worked, log is posted in DM to Jean M. Cyr.

jean-m-cyr · 2020-03-24T14:27:47Z

@AndreaLanfranchi There no way to guarantee two allocated block will be adjacent. Other users, such as a desktop GUI, can be allocating and freeing concurrently.

AndreaLanfranchi · 2020-03-24T14:56:48Z

libethash-cl/kernels/cl/ethash.cl

+    g_dag = (__global hash128_t const*) _g_dag0; \
+    if (idx & 1) \
+        g_dag = (__global hash128_t const*) _g_dag1; \


Suggested change

g_dag = (__global hash128_t const*) _g_dag0; \

if (idx & 1) \

g_dag = (__global hash128_t const*) _g_dag1; \

if (!(idx & 1)) \

g_dag = (__global hash128_t const*) _g_dag0; \

else \

g_dag = (__global hash128_t const*) _g_dag1; \

This should save an address translation and semantically similar to same test in DAG generation

You could do it that way but tried it and see no perceptible speed difference on 480. There is no need for translation _g_dag0 and _g_dag1 are already in GPU context.

AndreaLanfranchi · 2020-03-24T15:00:45Z

As far as I understand this change imposes the split regardless whether or not it's necessary.
For sure there will be a decrease in hashrate for, say, private chains - or other ethash like, where DAG_SIZE < MAX_ALLOC_SIZE.

If we had only to maintain .cl kernel (source) a simple preprocessor directive would solve the problem: I understand the maintenance of binary files is a PITA

jean-m-cyr

I had it working with a compiler directive to control spit vs non-split mode, including for binary kernels.

no measured speed difference between split vs. non-split.
would double the number binary kernels

AndreaLanfranchi · 2020-03-24T15:32:00Z

no measured speed difference between split vs. non-split.

I hardly believe it : a conditional plus a an index "re-index" for every thread is something.
Anyway have to take your word as I don't have any AMD card to test on.

If @ddobreff is ok with the test I'm also ok with it.

AndreaLanfranchi · 2020-03-24T15:35:00Z

Voids the need for #1969

jean-m-cyr · 2020-03-24T15:36:31Z

no measured speed difference between split vs. non-split.

I hardly believe it : a conditional plus a an index "re-index" for every thread is something.
Anyway have to take your word as I don't have any AMD card to test on.

If @ddobreff is ok with the test I'm also ok with it.

Non split mode opencl

 m 11:33:58 ethminer 0:00 A0 43.60 Mh - cl0 27.74 47C 62% A0, cu1 15.86 60C 31% A0

Split mode opencl

 m 11:30:06 ethminer 0:13 A4 44.76 Mh - cl0 28.90 60C 62% A2, cu1 15.86 60C 30% A2

Split mode is actually faster!

AndreaLanfranchi · 2020-03-24T15:42:04Z

I'm puzzled ... anyway won't investigate. AMD and its drivers weirdness lost any interest for me.

Bottom line : good job @jean-m-cyr

joaogti36 · 2020-04-03T22:20:18Z

where can we test the ethminer with those changes in ethminer with cuda cards...gtx 1070/1080

ddobreff · 2020-04-03T23:33:54Z

Changes are related to AMD OpenCL, nvidia is not affected.

joaogti36 · 2020-04-04T01:29:32Z

nvidias also start losing hashrate since november.... the bigger the dag... the less speed it gets from cards... 1070 that is!

ddobreff requested a review from AndreaLanfranchi March 24, 2020 11:34

Add support for gfx906

fe7efc4

AndreaLanfranchi reviewed Mar 24, 2020

View reviewed changes

jean-m-cyr commented Mar 24, 2020

View reviewed changes

AndreaLanfranchi approved these changes Mar 24, 2020

View reviewed changes

ddobreff merged commit 3ef76aa into ethereum-mining:master Mar 24, 2020

AndreaLanfranchi mentioned this pull request Apr 1, 2020

What happens to RX 4xx/5xx with 8GB vRAM after DAG file is larger than 4GB? #1982

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AMD max. 4 GB per allocation workaround for 8GB cards #1977

AMD max. 4 GB per allocation workaround for 8GB cards #1977

jean-m-cyr commented Mar 23, 2020 •

edited

AndreaLanfranchi commented Mar 24, 2020

ddobreff commented Mar 24, 2020

jean-m-cyr commented Mar 24, 2020 •

edited

AndreaLanfranchi Mar 24, 2020

jean-m-cyr Mar 24, 2020 •

edited

AndreaLanfranchi commented Mar 24, 2020 •

edited

jean-m-cyr left a comment •

edited

AndreaLanfranchi commented Mar 24, 2020

AndreaLanfranchi commented Mar 24, 2020

jean-m-cyr commented Mar 24, 2020

AndreaLanfranchi commented Mar 24, 2020

joaogti36 commented Apr 3, 2020

ddobreff commented Apr 3, 2020

joaogti36 commented Apr 4, 2020

AMD max. 4 GB per allocation workaround for 8GB cards #1977

AMD max. 4 GB per allocation workaround for 8GB cards #1977

Conversation

jean-m-cyr commented Mar 23, 2020 • edited

AndreaLanfranchi commented Mar 24, 2020

ddobreff commented Mar 24, 2020

jean-m-cyr commented Mar 24, 2020 • edited

AndreaLanfranchi Mar 24, 2020

Choose a reason for hiding this comment

jean-m-cyr Mar 24, 2020 • edited

Choose a reason for hiding this comment

AndreaLanfranchi commented Mar 24, 2020 • edited

jean-m-cyr left a comment • edited

Choose a reason for hiding this comment

AndreaLanfranchi commented Mar 24, 2020

AndreaLanfranchi commented Mar 24, 2020

jean-m-cyr commented Mar 24, 2020

AndreaLanfranchi commented Mar 24, 2020

joaogti36 commented Apr 3, 2020

ddobreff commented Apr 3, 2020

joaogti36 commented Apr 4, 2020

jean-m-cyr commented Mar 23, 2020 •

edited

jean-m-cyr commented Mar 24, 2020 •

edited

jean-m-cyr Mar 24, 2020 •

edited

AndreaLanfranchi commented Mar 24, 2020 •

edited

jean-m-cyr left a comment •

edited