Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

OpenCL error in JCE that is fixed in XMRSTAK - on HD 6990, 7850 with 14.4 drivers #26

Open
xmronadaily opened this issue Oct 29, 2018 · 14 comments

Comments

@xmronadaily
Copy link

This was an issue also happening in XMR stak (fireice-uk/xmr-stak#1922), but it is fixed here fireice-uk/xmr-stak#1945 and fireice-uk/xmr-stak#1951, please incorporate the fix also to your miner.

14.4 drivers are the best when it comes to HD 6990s, other drivers don't detect the memory of the cards properly and therefore intensity cannot be set higher than what is maybe 10% of max card performance.

Let me know if any other details are needed!

Here is the error pasted:

For Windows 64-bits
Analyzing Processors topology...
AMD Athlon(tm) II X2 250 Processor
Assembly codename: generic
SSE2 : Yes
SSE3 : Yes
SSE4 : No
AES : No
AVX : No
AVX2 : No

Found CPU 0, with:
L1 Cache: 64 KB
L2 Cache: 1024 KB
Found CPU 1, with:
L1 Cache: 64 KB
L2 Cache: 1024 KB

Detecting OpenCL-capable GPUs...
Found GPU 0, with:
Vendor: AMD
Processor: Cayman
Device: 0b:00
Compute-Units: 24
Cache Memory: 0 KB
Local Memory: 32 KB
Global Memory: 2048 MB
Addressing: 32-bits
Found GPU 1, with:
Vendor: AMD
Processor: Cayman
Device: 0b:00
Compute-Units: 24
Cache Memory: 0 KB
Local Memory: 32 KB
Global Memory: 2048 MB
Addressing: 32-bits
Found GPU 2, with:
Vendor: AMD
Processor: Cayman
Device: 0b:00
Compute-Units: 24
Cache Memory: 0 KB
Local Memory: 32 KB
Global Memory: 2048 MB
Addressing: 32-bits
Found GPU 3, with:
Vendor: AMD
Processor: Cayman
Device: 0b:00
Compute-Units: 24
Cache Memory: 0 KB
Local Memory: 32 KB
Global Memory: 2048 MB
Addressing: 32-bits
Found GPU 4, with:
Vendor: AMD
Processor: Pitcairn
Device: 01:00
Compute-Units: 16
Cache Memory: 16 KB
Local Memory: 32 KB
Global Memory: 2048 MB
Addressing: 64-bits

Preparing 1 Mining Threads...

+-- Thread 0 config ------------------------+
| Run on GPU: 0 |
| Multi-hash: 224 |
| Worksize: 8 |
| Factor Alpha 64 |
| Factor Beta 8 |
+-------------------------------------------+

Cryptonight Variation: Cryptonight V8 fork of Oct-2018

Low intensity.

Starting GPU Thread 0, on GPU 0
Created OpenCL Context for GPU 0 at 000001739fb71920
Created OpenCL Thread 0 Command-Queue for GPU 0 at 00000173a03f1930
Scratchpad Allocation success for OpenCL Thread 0
Allocating big 448MB scratchpad for OpenCL Thread 0...
Compiling kernels of OpenCL Thread 0...
LLVM ERROR: Cannot select: 0x1739f7c0c60: i32 = setcc 0x173a1bcaf10, 0x173a1bcad10, 0x173a0443ed0 [ORD=4762] [ID=3015]
0x173a1bcaf10: i64 = add 0x173a1bcad10, 0x173a1bcec30 [ORD=4756] [ID=3013]
0x173a1bcad10: i64 = add 0x173a1bcac10, 0x173a1bca810 [ORD=4755] [ID=3012]
0x173a1bcac10: i64 = mul 0x173a1bca910, 0x173a1bcab10 [ORD=4754] [ID=3010]
0x173a1bca910: i64 = zero_extend 0x173a1bca410 [ORD=4751] [ID=3008]
0x173a1bca410: i32 = add 0x173a1bcf130, 0x173a1bca310 [ORD=4746] [ID=3007]
0x173a1bcf130: i32 = AMDILISD::VEXTRACT 0x173a1bca210, 0x173a1bc35d0 [ORD=4744] [ID=3006]
0x173a1bca210: v2i32 = AMDILISD::BITCONV 0x173a1bca010 [ORD=4743] [ID=3005]
0x173a1bca010: i64 = add 0x173a1bc9f10, 0x173a1bc9d00 [ORD=4742] [ID=3004]
0x173a1bc9f10: i64 = add 0x173a19bf310, 0x173a1bc9a00 [ORD=4741] [ID=3003]

              0x173a1bc9d00: i64 = mul 0x173a1bc9c00, 0x173a1bc9b00 [ORD=4740] [ID=3001]


          0x173a1bc35d0: i32 = TargetConstant<2> [ORD=4827] [ID=59]
        0x173a1bca310: i32 = llvm.AMDIL.bfi 0x1739f7bf850, 0x173a1bc9700, 0x1739f7caa00, 0x173a1bc8b00 [ORD=4745] [ID=3000]
          0x1739f7bf850: i32 = TargetConstant<2907> [ORD=2666] [ID=32]
          0x173a1bc9700: i32 = sra 0x173a1bca110, 0x1739f7c63b0 [ORD=4734] [ID=2996]
            0x173a1bca110: i32 = AMDILISD::VEXTRACT 0x173a1be24d0, 0x173a1bc35d0 [ORD=4733] [ID=2992]
              0x173a1be24d0: v2i32 = AMDILISD::BITCONV 0x173a19bf310 [ORD=4732] [ID=2989]

              0x173a1bc35d0: i32 = TargetConstant<2> [ORD=4827] [ID=59]
            0x1739f7c63b0: i32 = Constant<31> [ORD=2655] [ID=31]
          0x1739f7caa00: i32 = Constant<0> [ORD=2497] [ID=2]
          0x173a1bc8b00: i32 = sub 0x173a1bc8000, 0x173a1bc8a00 [ORD=4722] [ID=2956]
            0x173a1bc8000: i32,ch = load 0x173a1bc63f0, 0x173a1bc7f00, 0x1739f7cef40<LD4[%conv415.7]> [ORD=4710] [ID=2933]
              0x173a1bc7f00: i32 = add 0x1739f7cf440, 0x173a1bc7af0 [ORD=4708] [ID=2928]


              0x1739f7cef40: i32 = undef [ORD=2505] [ID=8]
            0x173a1bc8a00: i32 = sra 0x173a1bc8900, 0x1739f7c55b0 [ORD=4721] [ID=2953]
              0x173a1bc8900: i32 = llvm.AMDIL.mul24.i32 0x1739f848300, 0x173a1bc8800, 0x173a1bc7cf0 [ORD=4720] [ID=2950]



              0x1739f7c55b0: i32 = Constant<6> [ORD=2642] [ID=29]
    0x173a1bcab10: i64 = zero_extend 0x173a1bca610 [ORD=4753] [ID=2827]
      0x173a1bca610: i32 = AMDILISD::VEXTRACT 0x173a1bcaa10, 0x173a1bc35d0 [ORD=4752] [ID=2817]
        0x173a1bcaa10: v2i32 = AMDILISD::BITCONV 0x173a1bcec30 [ORD=4747] [ID=2806]
          0x173a1bcec30: i64 = AMDILISD::VEXTRACT 0x1739f0b8830, 0x173a1bc35d0 [ORD=4700] [ID=2799]
            0x1739f0b8830: v2i64 = AMDILISD::BITCONV 0x173a1bd4460 [ORD=4699] [ID=2792]
              0x173a1bd4460: v4i32 = AMDILISD::VINSERT 0x173a1be3ad0, 0x1739f0cc5d0, 0x1739f0ba440, 0x173a1bbfcb0 [ORD=4663] [ID=2786]




            0x173a1bc35d0: i32 = TargetConstant<2> [ORD=4827] [ID=59]
        0x173a1bc35d0: i32 = TargetConstant<2> [ORD=4827] [ID=59]
  0x173a1bca810: i64 = zero_extend 0x173a1bca710 [ORD=4750] [ID=3011]
    0x173a1bca710: i32 = llvm.AMDIL.mulhi.u32 0x1739f7be540, 0x173a1bc6ef0, 0x173a1bca410 [ORD=4749] [ID=3009]
      0x1739f7be540: i32 = TargetConstant<3113> [ORD=2646] [ID=30]
      0x173a1bc6ef0: i32 = AMDILISD::VEXTRACT 0x173a1bcaa10, 0x173a1bcb910 [ORD=4748] [ID=2816]
        0x173a1bcaa10: v2i32 = AMDILISD::BITCONV 0x173a1bcec30 [ORD=4747] [ID=2806]
          0x173a1bcec30: i64 = AMDILISD::VEXTRACT 0x1739f0b8830, 0x173a1bc35d0 [ORD=4700] [ID=2799]
            0x1739f0b8830: v2i64 = AMDILISD::BITCONV 0x173a1bd4460 [ORD=4699] [ID=2792]
              0x173a1bd4460: v4i32 = AMDILISD::VINSERT 0x173a1be3ad0, 0x1739f0cc5d0, 0x1739f0ba440, 0x173a1bbfcb0 [ORD=4663] [ID=2786]




            0x173a1bc35d0: i32 = TargetConstant<2> [ORD=4827] [ID=59]
        0x173a1bcb910: i32 = TargetConstant<1> [ORD=4776] [ID=61]
      0x173a1bca410: i32 = add 0x173a1bcf130, 0x173a1bca310 [ORD=4746] [ID=3007]
        0x173a1bcf130: i32 = AMDILISD::VEXTRACT 0x173a1bca210, 0x173a1bc35d0 [ORD=4744] [ID=3006]
          0x173a1bca210: v2i32 = AMDILISD::BITCONV 0x173a1bca010 [ORD=4743] [ID=3005]
            0x173a1bca010: i64 = add 0x173a1bc9f10, 0x173a1bc9d00 [ORD=4742] [ID=3004]
              0x173a1bc9f10: i64 = add 0x173a19bf310, 0x173a1bc9a00 [ORD=4741] [ID=3003]


              0x173a1bc9d00: i64 = mul 0x173a1bc9c00, 0x173a1bc9b00 [ORD=4740] [ID=3001]


          0x173a1bc35d0: i32 = TargetConstant<2> [ORD=4827] [ID=59]
        0x173a1bca310: i32 = llvm.AMDIL.bfi 0x1739f7bf850, 0x173a1bc9700, 0x1739f7caa00, 0x173a1bc8b00 [ORD=4745] [ID=3000]
          0x1739f7bf850: i32 = TargetConstant<2907> [ORD=2666] [ID=32]
          0x173a1bc9700: i32 = sra 0x173a1bca110, 0x1739f7c63b0 [ORD=4734] [ID=2996]
            0x173a1bca110: i32 = AMDILISD::VEXTRACT 0x173a1be24d0, 0x173a1bc35d0 [ORD=4733] [ID=2992]
              0x173a1be24d0: v2i32 = AMDILISD::BITCONV 0x173a19bf310 [ORD=4732] [ID=2989]

              0x173a1bc35d0: i32 = TargetConstant<2> [ORD=4827] [ID=59]
            0x1739f7c63b0: i32 = Constant<31> [ORD=2655] [ID=31]
          0x1739f7caa00: i32 = Constant<0> [ORD=2497] [ID=2]
          0x173a1bc8b00: i32 = sub 0x173a1bc8000, 0x173a1bc8a00 [ORD=4722] [ID=2956]
            0x173a1bc8000: i32,ch = load 0x173a1bc63f0, 0x173a1bc7f00, 0x1739f7cef40<LD4[%conv415.7]> [ORD=4710] [ID=2933]
              0x173a1bc7f00: i32 = add 0x1739f7cf440, 0x173a1bc7af0 [ORD=4708] [ID=2928]


              0x1739f7cef40: i32 = undef [ORD=2505] [ID=8]
            0x173a1bc8a00: i32 = sra 0x173a1bc8900, 0x1739f7c55b0 [ORD=4721] [ID=2953]
              0x173a1bc8900: i32 = llvm.AMDIL.mul24.i32 0x1739f848300, 0x173a1bc8800, 0x173a1bc7cf0 [ORD=4720] [ID=2950]



              0x1739f7c55b0: i32 = Constant<6> [ORD=2642] [ID=29]
0x173a1bcec30: i64 = AMDILISD::VEXTRACT 0x1739f0b8830, 0x173a1bc35d0 [ORD=4700] [ID=2799]
  0x1739f0b8830: v2i64 = AMDILISD::BITCONV 0x173a1bd4460 [ORD=4699] [ID=2792]
    0x173a1bd4460: v4i32 = AMDILISD::VINSERT 0x173a1be3ad0, 0x1739f0cc5d0, 0x1739f0ba440, 0x173a1bbfcb0 [ORD=4663] [ID=2786]
      0x173a1be3ad0: v4i32 = AMDILISD::VINSERT 0x173a1be39d0, 0x1739f0cdbd0, 0x173a1be53e0, 0x173a1be4be0 [ORD=4663] [ID=2781]
        0x173a1be39d0: v4i32 = AMDILISD::VINSERT 0x1739f0c7fa0, 0x173a1bc0fc0, 0x173a1be55e0, 0x173a1be4ae0 [ORD=4663] [ID=2776]
          0x1739f0c7fa0: v4i32 = AMDILISD::VBUILD 0x1739f0caec0 [ORD=4663] [ID=2772]
            0x1739f0caec0: i32 = xor 0x1739f0cadc0, 0x1739f0caac0 [ORD=4608] [ID=2769]
              0x1739f0cadc0: i32 = xor 0x1739f0cacc0, 0x1739f0ca5c0 [ORD=4607] [ID=2763]


              0x1739f0caac0: i32,ch = load 0x1739f0ca2c0, 0x1739f0ca9c0, 0x1739f7cef40<LD4[%arrayidx205.7]> [ORD=4604] [ID=2729]


          0x173a1bc0fc0: i32 = xor 0x173a1bc0ec0, 0x173a1bc0bc0 [ORD=4662] [ID=2767]
            0x173a1bc0ec0: i32 = xor 0x173a1bc0dc0, 0x173a1bc07c0 [ORD=4661] [ID=2761]
              0x173a1bc0dc0: i32 = xor 0x173a1bc0cc0, 0x173a1bc03c0 [ORD=4660] [ID=2755]


              0x173a1bc07c0: i32,ch = load 0x1739f0ca2c0, 0x173a1bc06c0, 0x1739f7cef40<LD4[%arrayidx305.7]> [ORD=4655] [ID=2740]


            0x173a1bc0bc0: i32,ch = load 0x1739f0ca2c0, 0x173a1bc0ac0, 0x1739f7cef40<LD4[%arrayidx313.7]> [ORD=4658] [ID=2743]
              0x173a1bc0ac0: i32 = add 0x173a1be4fe0, 0x173a1bc09c0 [ORD=4657] [ID=2723]


              0x1739f7cef40: i32 = undef [ORD=2505] [ID=8]
          0x173a1be55e0: i32 = TargetConstant<197121> [ORD=2580] [ID=75]
          0x173a1be4ae0: i32 = TargetConstant<16777216> [ORD=2580] [ID=74]
        0x1739f0cdbd0: i32 = xor 0x1739f0cdad0, 0x1739f0cd7d0 [ORD=4644] [ID=2768]
          0x1739f0cdad0: i32 = xor 0x1739f0cd9d0, 0x1739f0cd3d0 [ORD=4643] [ID=2762]
            0x1739f0cd9d0: i32 = xor 0x1739f0cd8d0, 0x1739f0ccfd0 [ORD=4642] [ID=2756]
              0x1739f0cd8d0: i32 = xor 0x1739f0ccbd0, 0x1739f0cb0c0 [ORD=4641] [ID=2749]


              0x1739f0ccfd0: i32,ch = load 0x1739f0ca2c0, 0x1739f0cced0, 0x1739f7cef40<LD4[%arrayidx261.7]> [ORD=4634] [ID=2742]


            0x1739f0cd3d0: i32,ch = load 0x1739f0ca2c0, 0x1739f0cd2d0, 0x1739f7cef40<LD4[%arrayidx269.7]> [ORD=4637] [ID=2732]
              0x1739f0cd2d0: i32 = add 0x173a1be50e0, 0x173a1bc05c0 [ORD=4636] [ID=2712]


              0x1739f7cef40: i32 = undef [ORD=2505] [ID=8]
          0x1739f0cd7d0: i32,ch = load 0x1739f0ca2c0, 0x1739f0cd6d0, 0x1739f7cef40<LD4[%arrayidx277.7]> [ORD=4640] [ID=2734]
            0x1739f0cd6d0: i32 = add 0x173a1be4fe0, 0x1739f0cd5d0 [ORD=4639] [ID=2714]
              0x173a1be4fe0: i32 = AMDILISD::ADDADDR 0x1739f7cc310, 0x1739f7caa00 [ORD=2520] [ID=85]


              0x1739f0cd5d0: i32 = shl 0x1739f0cd4d0, 0x1739f7ca0f0 [ORD=4639] [ID=2694]


            0x1739f7cef40: i32 = undef [ORD=2505] [ID=8]
        0x173a1be53e0: i32 = TargetConstant<67109377> [ORD=2580] [ID=73]
        0x173a1be4be0: i32 = TargetConstant<65536> [ORD=2580] [ID=72]
      0x1739f0cc5d0: i32 = xor 0x1739f0cc4d0, 0x1739f0cc1d0 [ORD=4626] [ID=2766]
        0x1739f0cc4d0: i32 = xor 0x1739f0cc3d0, 0x1739f0cbcc0 [ORD=4625] [ID=2760]
          0x1739f0cc3d0: i32 = xor 0x1739f0cc2d0, 0x1739f0cb8c0 [ORD=4624] [ID=2754]
            0x1739f0cc2d0: i32 = xor 0x1739f0cb4c0, 0x1739f0bae40 [ORD=4623] [ID=2747]
              0x1739f0cb4c0: i32,ch = load 0x1739f0ca2c0, 0x1739f0cb3c0, 0x1739f7cef40<LD4[%arrayidx218.7]> [ORD=4613] [ID=2730]


              0x1739f0bae40: i32 = AMDILISD::VEXTRACT 0x1739f0cddd0, 0x173a1bc35d0 [ORD=4610] [ID=2626]


            0x1739f0cb8c0: i32,ch = load 0x1739f0ca2c0, 0x1739f0cb7c0, 0x1739f7cef40<LD4[%arrayidx225.7]> [ORD=4616] [ID=2733]
              0x1739f0cb7c0: i32 = add 0x173a1be51e0, 0x1739f0cb6c0 [ORD=4615] [ID=2713]


              0x1739f7cef40: i32 = undef [ORD=2505] [ID=8]
          0x1739f0cbcc0: i32,ch = load 0x1739f0ca2c0, 0x1739f0cbbc0, 0x1739f7cef40<LD4[%arrayidx233.7]> [ORD=4619] [ID=2744]
            0x1739f0cbbc0: i32 = add 0x173a1be50e0, 0x1739f0cd1d0 [ORD=4618] [ID=2724]
              0x173a1be50e0: i32 = AMDILISD::ADDADDR 0x1739f7cb300, 0x1739f7caa00 [ORD=2516] [ID=86]


              0x1739f0cd1d0: i32 = and 0x1739f0cd0d0, 0x173a1bcb310 [ORD=4618] [ID=2704]


            0x1739f7cef40: i32 = undef [ORD=2505] [ID=8]
        0x1739f0cc1d0: i32,ch = load 0x1739f0ca2c0, 0x1739f0cbfc0, 0x1739f7cef40<LD4[%arrayidx241.7]> [ORD=4622] [ID=2738]
          0x1739f0cbfc0: i32 = add 0x173a1be4fe0, 0x1739f0cbec0 [ORD=4621] [ID=2718]
            0x173a1be4fe0: i32 = AMDILISD::ADDADDR 0x1739f7cc310, 0x1739f7caa00 [ORD=2520] [ID=85]
              0x1739f7cc310: i32 = TargetGlobalAddress<[256 x i32] addrspace(3)* @scratchrounds_cllocal_AES1> 0 [ORD=2520] [ID=68]
              0x1739f7caa00: i32 = Constant<0> [ORD=2497] [ID=2]
            0x1739f0cbec0: i32 = shl 0x1739f0cbdc0, 0x1739f7ca0f0 [ORD=4621] [ID=2698]
              0x1739f0cbdc0: i32 = llvm.AMDIL.bit.extract.u32 0x1739f7c92e0, 0x1739f7c9af0, 0x1739f7c9af0, 0x173a1bd4a60 [ORD=4620] [ID=2674]




              0x1739f7ca0f0: i32 = Constant<2> [ORD=2508] [ID=10]
          0x1739f7cef40: i32 = undef [ORD=2505] [ID=8]
      0x1739f0ba440: i32 = TargetConstant<67305473> [ORD=4698] [ID=63]
      0x173a1bbfcb0: i32 = TargetConstant<256> [ORD=4698] [ID=62]
  0x173a1bc35d0: i32 = TargetConstant<2> [ORD=4827] [ID=59]

0x173a1bcad10: i64 = add 0x173a1bcac10, 0x173a1bca810 [ORD=4755] [ID=3012]
0x173a1bcac10: i64 = mul 0x173a1bca910, 0x173a1bcab10 [ORD=4754] [ID=3010]
0x173a1bca910: i64 = zero_extend 0x173a1bca410 [ORD=4751] [ID=3008]
0x173a1bca410: i32 = add 0x173a1bcf130, 0x173a1bca310 [ORD=4746] [ID=3007]
0x173a1bcf130: i32 = AMDILISD::VEXTRACT 0x173a1bca210, 0x173a1bc35d0 [ORD=4744] [ID=3006]
0x173a1bca210: v2i32 = AMDILISD::BITCONV 0x173a1bca010 [ORD=4743] [ID=3005]
0x173a1bca010: i64 = add 0x173a1bc9f10, 0x173a1bc9d00 [ORD=4742] [ID=3004]
0x173a1bc9f10: i64 = add 0x173a19bf310, 0x173a1bc9a00 [ORD=4741] [ID=3003]
0x173a19bf310: i64 = AMDILISD::LCREATE 0x173a19bf820, 0x173a19c6890 [ORD=4731] [ID=2986]

              0x173a1bc9a00: i64 = zero_extend 0x173a1bc9900 [ORD=4737] [ID=3002]

            0x173a1bc9d00: i64 = mul 0x173a1bc9c00, 0x173a1bc9b00 [ORD=4740] [ID=3001]
              0x173a1bc9c00: i64 = zero_extend 0x173a1bca110 [ORD=4739] [ID=2997]

              0x173a1bc9b00: i64 = zero_extend 0x173a1bc8b00 [ORD=4738] [ID=2958]

        0x173a1bc35d0: i32 = TargetConstant<2> [ORD=4827] [ID=59]
      0x173a1bca310: i32 = llvm.AMDIL.bfi 0x1739f7bf850, 0x173a1bc9700, 0x1739f7caa00, 0x173a1bc8b00 [ORD=4745] [ID=3000]
        0x1739f7bf850: i32 = TargetConstant<2907> [ORD=2666] [ID=32]
        0x173a1bc9700: i32 = sra 0x173a1bca110, 0x1739f7c63b0 [ORD=4734] [ID=2996]
          0x173a1bca110: i32 = AMDILISD::VEXTRACT 0x173a1be24d0, 0x173a1bc35d0 [ORD=4733] [ID=2992]
            0x173a1be24d0: v2i32 = AMDILISD::BITCONV 0x173a19bf310 [ORD=4732] [ID=2989]
              0x173a19bf310: i64 = AMDILISD::LCREATE 0x173a19bf820, 0x173a19c6890 [ORD=4731] [ID=2986]


            0x173a1bc35d0: i32 = TargetConstant<2> [ORD=4827] [ID=59]
          0x1739f7c63b0: i32 = Constant<31> [ORD=2655] [ID=31]
        0x1739f7caa00: i32 = Constant<0> [ORD=2497] [ID=2]
        0x173a1bc8b00: i32 = sub 0x173a1bc8000, 0x173a1bc8a00 [ORD=4722] [ID=2956]
          0x173a1bc8000: i32,ch = load 0x173a1bc63f0, 0x173a1bc7f00, 0x1739f7cef40<LD4[%conv415.7]> [ORD=4710] [ID=2933]
            0x173a1bc7f00: i32 = add 0x1739f7cf440, 0x173a1bc7af0 [ORD=4708] [ID=2928]
              0x1739f7cf440: i32 = AMDILISD::ADDADDR 0x1739f0ca3c0, 0x1739f7caa00 [ORD=2628] [ID=84]


              0x173a1bc7af0: i32 = and 0x173a1bc79f0, 0x1739f7ce840 [ORD=4705] [ID=2921]


            0x1739f7cef40: i32 = undef [ORD=2505] [ID=8]
          0x173a1bc8a00: i32 = sra 0x173a1bc8900, 0x1739f7c55b0 [ORD=4721] [ID=2953]
            0x173a1bc8900: i32 = llvm.AMDIL.mul24.i32 0x1739f848300, 0x173a1bc8800, 0x173a1bc7cf0 [ORD=4720] [ID=2950]
              0x1739f848300: i32 = TargetConstant<3099> [ORD=2641] [ID=28]
              0x173a1bc8800: i32 = and 0x173a1bc8700, 0x1739f6f3ac0 [ORD=4719] [ID=2948]


              0x173a1bc7cf0: i32 = add 0x173a1bc7bf0, 0x1739f7c65c0 [ORD=4707] [ID=2920]


            0x1739f7c55b0: i32 = Constant<6> [ORD=2642] [ID=29]
  0x173a1bcab10: i64 = zero_extend 0x173a1bca610 [ORD=4753] [ID=2827]
    0x173a1bca610: i32 = AMDILISD::VEXTRACT 0x173a1bcaa10, 0x173a1bc35d0 [ORD=4752] [ID=2817]
      0x173a1bcaa10: v2i32 = AMDILISD::BITCONV 0x173a1bcec30 [ORD=4747] [ID=2806]
        0x173a1bcec30: i64 = AMDILISD::VEXTRACT 0x1739f0b8830, 0x173a1bc35d0 [ORD=4700] [ID=2799]
          0x1739f0b8830: v2i64 = AMDILISD::BITCONV 0x173a1bd4460 [ORD=4699] [ID=2792]
            0x173a1bd4460: v4i32 = AMDILISD::VINSERT 0x173a1be3ad0, 0x1739f0cc5d0, 0x1739f0ba440, 0x173a1bbfcb0 [ORD=4663] [ID=2786]
              0x173a1be3ad0: v4i32 = AMDILISD::VINSERT 0x173a1be39d0, 0x1739f0cdbd0, 0x173a1be53e0, 0x173a1be4be0 [ORD=4663] [ID=2781]




              0x1739f0cc5d0: i32 = xor 0x1739f0cc4d0, 0x1739f0cc1d0 [ORD=4626] [ID=2766]


              0x1739f0ba440: i32 = TargetConstant<67305473> [ORD=4698] [ID=63]
              0x173a1bbfcb0: i32 = TargetConstant<256> [ORD=4698] [ID=62]
          0x173a1bc35d0: i32 = TargetConstant<2> [ORD=4827] [ID=59]
      0x173a1bc35d0: i32 = TargetConstant<2> [ORD=4827] [ID=59]
0x173a1bca810: i64 = zero_extend 0x173a1bca710 [ORD=4750] [ID=3011]
  0x173a1bca710: i32 = llvm.AMDIL.mulhi.u32 0x1739f7be540, 0x173a1bc6ef0, 0x173a1bca410 [ORD=4749] [ID=3009]
    0x1739f7be540: i32 = TargetConstant<3113> [ORD=2646] [ID=30]
    0x173a1bc6ef0: i32 = AMDILISD::VEXTRACT 0x173a1bcaa10, 0x173a1bcb910 [ORD=4748] [ID=2816]
      0x173a1bcaa10: v2i32 = AMDILISD::BITCONV 0x173a1bcec30 [ORD=4747] [ID=2806]
        0x173a1bcec30: i64 = AMDILISD::VEXTRACT 0x1739f0b8830, 0x173a1bc35d0 [ORD=4700] [ID=2799]
          0x1739f0b8830: v2i64 = AMDILISD::BITCONV 0x173a1bd4460 [ORD=4699] [ID=2792]
            0x173a1bd4460: v4i32 = AMDILISD::VINSERT 0x173a1be3ad0, 0x1739f0cc5d0, 0x1739f0ba440, 0x173a1bbfcb0 [ORD=4663] [ID=2786]
              0x173a1be3ad0: v4i32 = AMDILISD::VINSERT 0x173a1be39d0, 0x1739f0cdbd0, 0x173a1be53e0, 0x173a1be4be0 [ORD=4663] [ID=2781]




              0x1739f0cc5d0: i32 = xor 0x1739f0cc4d0, 0x1739f0cc1d0 [ORD=4626] [ID=2766]


              0x1739f0ba440: i32 = TargetConstant<67305473> [ORD=4698] [ID=63]
              0x173a1bbfcb0: i32 = TargetConstant<256> [ORD=4698] [ID=62]
          0x173a1bc35d0: i32 = TargetConstant<2> [ORD=4827] [ID=59]
      0x173a1bcb910: i32 = TargetConstant<1> [ORD=4776] [ID=61]
    0x173a1bca410: i32 = add 0x173a1bcf130, 0x173a1bca310 [ORD=4746] [ID=3007]
      0x173a1bcf130: i32 = AMDILISD::VEXTRACT 0x173a1bca210, 0x173a1bc35d0 [ORD=4744] [ID=3006]
        0x173a1bca210: v2i32 = AMDILISD::BITCONV 0x173a1bca010 [ORD=4743] [ID=3005]
          0x173a1bca010: i64 = add 0x173a1bc9f10, 0x173a1bc9d00 [ORD=4742] [ID=3004]
            0x173a1bc9f10: i64 = add 0x173a19bf310, 0x173a1bc9a00 [ORD=4741] [ID=3003]
              0x173a19bf310: i64 = AMDILISD::LCREATE 0x173a19bf820, 0x173a19c6890 [ORD=4731] [ID=2986]


              0x173a1bc9a00: i64 = zero_extend 0x173a1bc9900 [ORD=4737] [ID=3002]

            0x173a1bc9d00: i64 = mul 0x173a1bc9c00, 0x173a1bc9b00 [ORD=4740] [ID=3001]
              0x173a1bc9c00: i64 = zero_extend 0x173a1bca110 [ORD=4739] [ID=2997]

              0x173a1bc9b00: i64 = zero_extend 0x173a1bc8b00 [ORD=4738] [ID=2958]

        0x173a1bc35d0: i32 = TargetConstant<2> [ORD=4827] [ID=59]
      0x173a1bca310: i32 = llvm.AMDIL.bfi 0x1739f7bf850, 0x173a1bc9700, 0x1739f7caa00, 0x173a1bc8b00 [ORD=4745] [ID=3000]
        0x1739f7bf850: i32 = TargetConstant<2907> [ORD=2666] [ID=32]
        0x173a1bc9700: i32 = sra 0x173a1bca110, 0x1739f7c63b0 [ORD=4734] [ID=2996]
          0x173a1bca110: i32 = AMDILISD::VEXTRACT 0x173a1be24d0, 0x173a1bc35d0 [ORD=4733] [ID=2992]
            0x173a1be24d0: v2i32 = AMDILISD::BITCONV 0x173a19bf310 [ORD=4732] [ID=2989]
              0x173a19bf310: i64 = AMDILISD::LCREATE 0x173a19bf820, 0x173a19c6890 [ORD=4731] [ID=2986]


            0x173a1bc35d0: i32 = TargetConstant<2> [ORD=4827] [ID=59]
          0x1739f7c63b0: i32 = Constant<31> [ORD=2655] [ID=31]
        0x1739f7caa00: i32 = Constant<0> [ORD=2497] [ID=2]
        0x173a1bc8b00: i32 = sub 0x173a1bc8000, 0x173a1bc8a00 [ORD=4722] [ID=2956]
          0x173a1bc8000: i32,ch = load 0x173a1bc63f0, 0x173a1bc7f00, 0x1739f7cef40<LD4[%conv415.7]> [ORD=4710] [ID=2933]
            0x173a1bc7f00: i32 = add 0x1739f7cf440, 0x173a1bc7af0 [ORD=4708] [ID=2928]
              0x1739f7cf440: i32 = AMDILISD::ADDADDR 0x1739f0ca3c0, 0x1739f7caa00 [ORD=2628] [ID=84]


              0x173a1bc7af0: i32 = and 0x173a1bc79f0, 0x1739f7ce840 [ORD=4705] [ID=2921]


            0x1739f7cef40: i32 = undef [ORD=2505] [ID=8]
          0x173a1bc8a00: i32 = sra 0x173a1bc8900, 0x1739f7c55b0 [ORD=4721] [ID=2953]
            0x173a1bc8900: i32 = llvm.AMDIL.mul24.i32 0x1739f848300, 0x173a1bc8800, 0x173a1bc7cf0 [ORD=4720] [ID=2950]
              0x1739f848300: i32 = TargetConstant<3099> [ORD=2641] [ID=28]
              0x173a1bc8800: i32 = and 0x173a1bc8700, 0x1739f6f3ac0 [ORD=4719] [ID=2948]


              0x173a1bc7cf0: i32 = add 0x173a1bc7bf0, 0x1739f7c65c0 [ORD=4707] [ID=2920]


            0x1739f7c55b0: i32 = Constant<6> [ORD=2642] [ID=29]

In function: __OpenCL_scratchrounds_kernel

@jceminer
Copy link
Owner

Hello,

The current code for v8 is a partially optimized version of the reference code, the same as in xmrstak, as you guessed. I normally write all my code from scratch, but I had to rush the v8 GPU version so I did an exception and just took the reference code provided by Monero team. You can read the Bitcointalk topic where i give details of the causes of such delay.

Depending on what is the faster, i will either rewrite my implementation and so that broken line of code will just disappear, or i will apply the reference fix.
Thanks for report.

@jceminer
Copy link
Owner

jceminer commented Nov 1, 2018

Fixed the good way: I rewrote everything and the problem is away. I tested on Drivers 14.12 with a HD6950 (i bet that's the same as with your 14.4).
on 0.33b3

@xmronadaily
Copy link
Author

Awesome! Those are great news! Were you able to get more than ~220h/s on that 6950? The max possible I've been getting on 6990 (which are 2x 6970s) is around 220h/s with 14.4 drivers on xmr stak. The intensity for those cards was 874 i believe and worksize of either 18, 19 or 22 produced best results. Work size of 8 just gave poor performance which was also the case in v7 algo before the upgrade.
The core ran at 830 and memory was 1250.

I will your new version later today, but just wondering for now, that might some sort of benchmark for those old HD69 series :)

@jceminer
Copy link
Owner

jceminer commented Nov 1, 2018

my card is on a rig with a weak psu that cannot afford let it mine at full power, i can only test the mining works, but if i configure it at max, i know the psu will burn. that rig with hd6000 has the lowest priority so i'm continuously taking spare parts from it and replace them by undersized ones.
this is sad because it was my first mining card and it survived all my psu problems, including some burnt cables or capacitor explosions :-p
so i cannot answer, but i remember that on CN-classic i pulled 420 H/s from it with custom memory timings, back in 2016

@jceminer
Copy link
Owner

jceminer commented Nov 3, 2018

Please upgrade to 0.33b4, there's a nice boost for v8 on HD6900

@xmronadaily
Copy link
Author

Awesome, I'll check it out in the next few hours! :)

@xmronadaily
Copy link
Author

xmronadaily commented Nov 3, 2018

Win 10 64, 14.4 drivers. Used default clocks for gpu: 830 core and 1250 memory. No memory timing mods or anything.
Ok I tested the miner now, best results I achieved is 270 h/s per core on 6990.

So that is 2x 270 h/s = ~ 540 h/s total for 6990 (which is 2 gpu cores of 6970s in it)
Settings used : 32 worksize and 864 intensity.

This is a very good improvement over current xmr stak which is giving ~ 220h/ per core or 440 total.

However,
I'm having problems trying to use 19 worksize which was found to be best on cn v7, whenever I try to set 19 worksize and 874 intensity (which works on xmr stak), here on JCE it gives me error that intensity must be multiple of worksize 16, even though I'm setting it to 19... etc. Only way I found this to work for higher worksizes is to set intensity that has a common multiplier with 16, but that is very limiting.

These cards should be able to handle up to 888 intensity I think before cl buffer error, ( they work with 880 intensity and 20 worksize for example 'cause it's also a multiple of 16)

Is there a possibility to implement some mode that would disable this automatic error that requires intensity to always be a multiple of 16? Even when we're setting completely different worksize not related to 16?

All props for the speed improvement so far though, great job!

@jceminer I think you should be able to replicate 270h/s on 6950s with 32 worksize and 864 intensity :)

@jceminer
Copy link
Owner

jceminer commented Nov 3, 2018

This is not an error, this is a technical requirement in JCE. 16, because that's 4x4, that's how some data are grouped inside the miner memory.
Don't try to use xmr-stak parameters as-is, my miner is completely different.

@xmronadaily
Copy link
Author

Okay, thanks for letting me know! Do you think further optimization is possible for these gpus, to go beyond 270h per core?

@Wacholek
Copy link

Wacholek commented Nov 10, 2018

I can confirm that JCE works quite nice. @930Mhz 282h/s. Stability is far better than xmr-stak but for 8 GPUs (4xHD6990) 14.4 have problems. And Win8.1+ only.

@SomethingGettingWrong
Copy link

SomethingGettingWrong commented Nov 18, 2018

Windows 7with 14.4 on a 6970 gets 320 hash on windows ten same driver 220 hash

Also if you update past 14.4 even if you ddu uninstall it's permantly messes up opencl... Not sure why. Need to reinstall windows and don't go past 14.4 it makes you get 220hashs then... Something gets messed up.

@xmronadaily
Copy link
Author

xmronadaily commented Nov 19, 2018 via email

@Wacholek
Copy link

Windows 7with 14.4 on a 6970 gets 320 hash on windows ten same driver 220 hash

Also if you update past 14.4 even if you ddu uninstall it's permantly messes up opencl... Not sure why. Need to reinstall windows and don't go past 14.4 it makes you get 220hashs then... Something gets messed up.

What settings have you use on Win7 to get 320h/s?
Please post the screen from the miners hash results nad the config from the amd.txt file.

@jceminer
Copy link
Owner

take a look at the standalone .exe that's an experimental version that should be faster on hd6000

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants