Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

CudaException code 77 #19

Closed
turowicz opened this issue Dec 19, 2018 · 16 comments
Closed

CudaException code 77 #19

turowicz opened this issue Dec 19, 2018 · 16 comments

Comments

@turowicz
Copy link

I'm getting the following error:

      CUDA Error Lib:libDlibDotNet.Native.Dnn.so Code:77 Driver:10000 Runti,:10000 Message:Exception of type 'DlibDotNet.CudaException' was thrown..
fail: People.Service[0]
      Exception of type 'DlibDotNet.CudaException' was thrown.
DlibDotNet.CudaException: Exception of type 'DlibDotNet.CudaException' was thrown.
   at DlibDotNet.Dnn.Cuda.ThrowCudaException(ErrorType error)
   at DlibDotNet.Dnn.LossMmod.Operator[T](IEnumerable`1 images, UInt64 batchSize)
   at FaceRecognitionDotNet.Dlib.Python.CnnFaceDetectionModelV1.Detect(LossMmod net, Image image, Int32 upsampleNumTimes)
   at FaceRecognitionDotNet.FaceRecognition.RawFaceLocations(Image faceImage, Int32 numberOfTimesToUpsample, Model model)
   at FaceRecognitionDotNet.FaceRecognition.FaceLocations(Image image, Int32 numberOfTimesToUpsample, Model model)+MoveNext()
   at System.Collections.Generic.List`1.AddEnumerable(IEnumerable`1 enumerable)
   at System.Linq.Enumerable.ToList[TSource](IEnumerable`1 source)
   at People.Common.Services.IdentificationService.IdentifyAsync(Guid applicationId, Mat frame, Double tolerance) in /app/src/People.Common/Services/IdentificationService.cs:line 70
   at People.Common.Pipeline.Blocks.IdentificationBlock.CheckIdentityAsync(Result result) in /app/src/People.Common/Pipeline/Blocks/IdentificationBlock.cs:line 54

Any ideas what this means? Googling the error code

@turowicz
Copy link
Author

@takuya-takeuchi have you had such problems?

@takuya-takeuchi
Copy link
Owner

@turowicz I have never face it. code 77 which is CudaErrorIllegalAccess.
I guess image data has no matter. numberOfTimesToUpsample may occur issues.
Could you provide information; image size, value of numberOfTimesToUpsample , etc.

@turowicz
Copy link
Author

size: 1920x1080 or 1440x800 (depends on source)
numberOfTimesToUpsample: 0

@turowicz
Copy link
Author

The error starts to appear after ~1h of video processing (frame by frame)

@turowicz
Copy link
Author

I'm investigating memory leaks.

@turowicz
Copy link
Author

@takuya-takeuchi
Copy link
Owner

@turowicz
You are right. Thank you!!
And there are same issues in other line.

@takuya-takeuchi
Copy link
Owner

@turowicz
Could you try nuget 1.2.3.6 package?
If you have any issue, please let me know!!! Thank you!!

@turowicz
Copy link
Author

turowicz commented Jan 2, 2019

@takuya-takeuchi I will test as soon as possible.

@turowicz
Copy link
Author

turowicz commented Jan 7, 2019

@takuya-takeuchi I haven't seen this error for a while now. I have started getting error 74 though. Investigating. Will create separate issue if real problem.

@turowicz
Copy link
Author

turowicz commented Jan 9, 2019

@takuya-takeuchi unfortunately the error still exists.

@turowicz
Copy link
Author

turowicz commented Jan 9, 2019

Actually I think this is due to a problem on my end. Only one of two PCs has this error. It's Alienware Aurora R7 running Ubuntu 18.04.

@turowicz
Copy link
Author

turowicz commented Jan 9, 2019

BTW memory usage is much better after the memory leak fix.

@takuya-takeuchi
Copy link
Owner

Alienware Aurora R7 running Ubuntu 18.04

I have same machine and I boot Ubuntu 18.04 from usb memory.
My R7 has 1080 (not Ti).

I may be reproduce same issue. Could you tell me Nvidia component version - CUDA and cuDNN?

@takuya-takeuchi
Copy link
Owner

And if you can, provide sample minimum source code you can reproduce issue?

@turowicz
Copy link
Author

The code is as simple as get face locations from image. Run it on 1000000 images or a video and at some point it will appear. The error happens randomly, but when it does, nothing fixes it until I reboot the computer. The error only appears on that one computer.

Nvidia SMI:

+-----------------------------------------------------------------------------+
| NVIDIA-SMI 410.79       Driver Version: 410.79       CUDA Version: 10.0     |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|===============================+======================+======================|
|   0  GeForce GTX 108...  Off  | 00000000:01:00.0 Off |                  N/A |
| 46%   78C    P2   121W / 250W |   1599MiB / 11177MiB |     92%      Default |
+-------------------------------+----------------------+----------------------+
                                                                               
+-----------------------------------------------------------------------------+
| Processes:                                                       GPU Memory |
|  GPU       PID   Type   Process name                             Usage      |
|=============================================================================|
|    0     23522      C   dotnet                                      1589MiB |
+-----------------------------------------------------------------------------+

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants