Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CL_OUT_OF_RESOURCES on Nvidia #73

Open
Kubuxu opened this issue Nov 3, 2016 · 6 comments
Open

CL_OUT_OF_RESOURCES on Nvidia #73

Kubuxu opened this issue Nov 3, 2016 · 6 comments

Comments

@Kubuxu
Copy link

Kubuxu commented Nov 3, 2016

I know that Nvidia was not target of this miner, but there are not many CUDA miners right now and I wanted to try it out on my 1070.

System info: up-to-date Arch, i7-6800k and 1070.

Trying to run the benchmark, I get following log:

./zcash-miner: /opt/cuda/lib64/libOpenCL.so.1: no version information available (required by ./zcash-miner)
./zcash-miner: /opt/cuda/lib64/libOpenCL.so.1: no version information available (required by ./zcash-miner)
Running the test miner
[OPENCL]:Found suitable OpenCL device [GeForce GTX 1070] with 8504279040 bytes of GPU memory
[OPENCL]:Using platform: NVIDIA CUDA
[OPENCL]:Using device: GeForce GTX 1070(OpenCL 1.2 CUDA)
[OPENCL]:Printing program log
[OPENCL]:

[OPENCL]:CL ERROR:CL_OUT_OF_RESOURCES
[1]    2980 segmentation fault (core dumped)  ./zcash-miner -G

and the backtrace in coredump is:

#0  0x0000561829626d7d in GPUSolver::GPUSolve200_9(unsigned char*, unsigned long, unsigned long, std::function<bool (std::vector<unsigned char, std::allocator<unsigned char> >)>, std::function<bool (GPUSolverCancelCheck)>, crypto_generichash_blake2b_state) (this=this@entry=0x56182b64e4a0, header=header@entry=0x56182c16b6d0 "\004", 
    header_len=header_len@entry=108, nonce=nonce@entry=0, validBlock=..., cancelled=..., base_state=...)
    at libzogminer/gpusolver.cpp:212
#1  0x0000561829626f91 in GPUSolver::run(unsigned int, unsigned int, unsigned char*, unsigned long, unsigned long, std::function<bool (std::vector<unsigned char, std::allocator<unsigned char> >)>, std::function<bool (GPUSolverCancelCheck)>, crypto_generichash_blake2b_state) (this=<optimized out>, n=<optimized out>, k=<optimized out>, 
    header=0x56182c16b6d0 "\004", header_len=108, nonce=0, validBlock=..., cancelled=..., base_state=...)
    at libzogminer/gpusolver.cpp:159
#2  0x000056182942edb4 in test_mine (n=<optimized out>, k=<optimized out>, d=<optimized out>, conf=...)
    at standaloneminer.cpp:186
#3  0x0000561829432025 in main (argc=<optimized out>, argv=<optimized out>) at standaloneminer.cpp:316
@Kubuxu
Copy link
Author

Kubuxu commented Nov 3, 2016

I also get follwing errors in kernel log:

[  343.963739] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Warp Exception on (GPC 0, TPC 0): Misaligned Address
[  343.963751] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Global Exception on (GPC 0, TPC 0): Physical Multiple Warp Errors
[  343.963755] NVRM: Xid (PCI:0000:03:00): 13, Graphics Exception: ESR 0x504648=0x104000f 0x504650=0x4 0x504644=0xd3eff2 0x50464c=0x17f
[  343.963786] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Warp Exception on (GPC 0, TPC 1): Misaligned Address
[  343.963790] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Global Exception on (GPC 0, TPC 1): Physical Multiple Warp Errors
[  343.963793] NVRM: Xid (PCI:0000:03:00): 13, Graphics Exception: ESR 0x504e48=0x10d000f 0x504e50=0x24 0x504e44=0xd3eff2 0x504e4c=0x17f
[  343.963824] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Warp Exception on (GPC 0, TPC 2): Misaligned Address
[  343.963827] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Global Exception on (GPC 0, TPC 2): Physical Multiple Warp Errors
[  343.963831] NVRM: Xid (PCI:0000:03:00): 13, Graphics Exception: ESR 0x505648=0x11f000f 0x505650=0x24 0x505644=0xd3eff2 0x50564c=0x17f
[  343.963861] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Warp Exception on (GPC 0, TPC 3): Misaligned Address
[  343.963864] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Global Exception on (GPC 0, TPC 3): Physical Multiple Warp Errors
[  343.963868] NVRM: Xid (PCI:0000:03:00): 13, Graphics Exception: ESR 0x505e48=0x11f000f 0x505e50=0x24 0x505e44=0xd3eff2 0x505e4c=0x17f
[  343.963898] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Warp Exception on (GPC 0, TPC 4): Misaligned Address
[  343.963901] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Global Exception on (GPC 0, TPC 4): Physical Multiple Warp Errors
[  343.963904] NVRM: Xid (PCI:0000:03:00): 13, Graphics Exception: ESR 0x506648=0x114000f 0x506650=0x24 0x506644=0xd3eff2 0x50664c=0x17f
[  343.963935] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Warp Exception on (GPC 1, TPC 0): Misaligned Address
[  343.963939] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Global Exception on (GPC 1, TPC 0): Physical Multiple Warp Errors
[  343.963942] NVRM: Xid (PCI:0000:03:00): 13, Graphics Exception: ESR 0x50c648=0x112000f 0x50c650=0x24 0x50c644=0xd3eff2 0x50c64c=0x17f
[  343.963972] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Warp Exception on (GPC 1, TPC 1): Misaligned Address
[  343.963976] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Global Exception on (GPC 1, TPC 1): Physical Multiple Warp Errors
[  343.963979] NVRM: Xid (PCI:0000:03:00): 13, Graphics Exception: ESR 0x50ce48=0x106000f 0x50ce50=0x24 0x50ce44=0xd3eff2 0x50ce4c=0x17f
[  343.964009] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Warp Exception on (GPC 1, TPC 2): Misaligned Address
[  343.964013] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Global Exception on (GPC 1, TPC 2): Physical Multiple Warp Errors
[  343.964016] NVRM: Xid (PCI:0000:03:00): 13, Graphics Exception: ESR 0x50d648=0x110000f 0x50d650=0x24 0x50d644=0xd3eff2 0x50d64c=0x17f
[  343.964044] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Warp Exception on (GPC 1, TPC 3): Misaligned Address
[  343.964047] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Global Exception on (GPC 1, TPC 3): Physical Multiple Warp Errors
[  343.964051] NVRM: Xid (PCI:0000:03:00): 13, Graphics Exception: ESR 0x50de48=0x10e000f 0x50de50=0x24 0x50de44=0xd3eff2 0x50de4c=0x17f
[  343.964077] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Warp Exception on (GPC 1, TPC 4): Misaligned Address
[  343.964080] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Global Exception on (GPC 1, TPC 4): Physical Multiple Warp Errors
[  343.964084] NVRM: Xid (PCI:0000:03:00): 13, Graphics Exception: ESR 0x50e648=0x107000f 0x50e650=0x24 0x50e644=0xd3eff2 0x50e64c=0x17f
[  343.964110] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Warp Exception on (GPC 2, TPC 0): Misaligned Address
[  343.964114] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Global Exception on (GPC 2, TPC 0): Physical Multiple Warp Errors
[  343.964117] NVRM: Xid (PCI:0000:03:00): 13, Graphics Exception: ESR 0x514648=0x10a000f 0x514650=0x24 0x514644=0xd3eff2 0x51464c=0x17f
[  343.964143] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Warp Exception on (GPC 2, TPC 1): Misaligned Address
[  343.964146] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Global Exception on (GPC 2, TPC 1): Physical Multiple Warp Errors
[  343.964150] NVRM: Xid (PCI:0000:03:00): 13, Graphics Exception: ESR 0x514e48=0x10b000f 0x514e50=0x24 0x514e44=0xd3eff2 0x514e4c=0x17f
[  343.964176] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Warp Exception on (GPC 2, TPC 2): Misaligned Address
[  343.964179] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Global Exception on (GPC 2, TPC 2): Physical Multiple Warp Errors
[  343.964182] NVRM: Xid (PCI:0000:03:00): 13, Graphics Exception: ESR 0x515648=0x113000f 0x515650=0x24 0x515644=0xd3eff2 0x51564c=0x17f
[  343.964208] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Warp Exception on (GPC 2, TPC 3): Misaligned Address
[  343.964213] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Global Exception on (GPC 2, TPC 3): Physical Multiple Warp Errors
[  343.964230] NVRM: Xid (PCI:0000:03:00): 13, Graphics Exception: ESR 0x515e48=0x109000f 0x515e50=0x24 0x515e44=0xd3eff2 0x515e4c=0x17f
[  343.964267] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Warp Exception on (GPC 2, TPC 4): Misaligned Address
[  343.964271] NVRM: Xid (PCI:0000:03:00): 13, Graphics SM Global Exception on (GPC 2, TPC 4): Physical Multiple Warp Errors
[  343.964274] NVRM: Xid (PCI:0000:03:00): 13, Graphics Exception: ESR 0x516648=0x10d000f 0x516650=0x24 0x516644=0xd3eff2 0x51664c=0x17f
[  343.964527] NVRM: Xid (PCI:0000:03:00): 43, Ch 00000018, engmask 00000101
[  343.967186] zcash-miner[14525]: segfault at 7fa2356807e8 ip 0000555aedfb6d7d sp 00007ffc45b952c0 error 4 in zcash-miner[555aedd5b000+326000]

I can run nheqminer on CUDA without problems.

@ddobreff
Copy link

ddobreff commented Nov 3, 2016

Currently there is no support for CUDA/Nvidia devices. I hope someone to will implement CUDA support soon.

@Kubuxu
Copy link
Author

Kubuxu commented Nov 3, 2016

Shouldn't OpenCL also run on Nvidia devices?

@antho281
Copy link

antho281 commented Nov 3, 2016

Yes NVIDIA support OpenCL. You should be able to mine with NVIDIA GPU

@Kubuxu
Copy link
Author

Kubuxu commented Nov 3, 2016

Thus my report.

@nginnever
Copy link
Owner

nginnever commented Nov 4, 2016

Hey Kubu! Good to hear from you. So from my minimal understanding of hardware programming, I think the problem from Nvidia is partly coming from the choice in our kernel to use wavefronts (amd hardware optmized) instead of warps (nvida hardware optimized). It has to do with the way the gpu schedules the thousands of threads it can use to do the equihash work. A warp likes to schedule 32 threads at a time while an AMD wavefront likes to schedule 64.

Here you can see we're scheduling the kernel local workgroup for the optimized AMD wavefront.

https://github.com/nginnever/zogminer/blob/master/src/libzogminer/cl_zogminer.cpp#L403

Good news is the kernel developer I believe has mentioned plans to support warps.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants