-
-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
All predictions place in top left-hand corner [ RTX 3*** does NOT work with TensorFlow 1.x! == odd errors! Please use deeplabcutcore ] #1142
Comments
Doesn't #1017 solve that? I ran the following code:
I can confirm I have no labels in the top left corner under .../labeled-data/_videofile__labeled. They are all correctly placed. |
^^ also note that now one can also just use @Ejdrup -- it must be you have TF1.x installed; we confirm this happens with TF 1, but NOT TF2! |
Cool. Would I need a fresh install of DLC-GPU environ, or can I just pip in the old environment? |
in the same env you can run (or also of course you can just make a new one) |
This is brand new setup, and we already have way too many cross-dependencies trying to load different cudnn and cuda versions, so we're just doing a fresh install. Which Ubuntu flavor do you guys use? It's primarily going to be a tracking workstations anyway, so trying to mimic your environment as much as possible. |
NOTE: we now support 3090 and TensorFlow 2! Please install a new conda file, you can direct download from deeplabcut.org - this comes with 2.2rc3 installed inside the conda, the latest tensorflow, and works with the latest CUDA. see the blog: http://blog.deeplabcut.org/ for the new code base highlights |
OS: Ubuntu 20.10 (Pop!_OS)
DeepLabCut Version: 2.1.10.2
CUDA: 10.0
cudnn: 7.6.5
GPU: RTX 3090
Tensorflow 1.13.1
Anaconda env used: DLC-GPU
After setting a new Ubuntu system and installing a fresh DLC-GPU, I've encountered an issue with GPU. Initially I received an error training, but managed to fix it following this issue, as I'm using RTX 3090: #1017
However, even though I'm able to progress through training without any issues, when I evaluate the network I get an error of roughly 500 pixels in both train and test despite running 100,000 iterations and plateuing at a loss of 0.03. However, all labels are placed in the top left-hand corner. I've tried starting a new project and relabeling, but to no avail. I've also attached a photo. Interestingly, after the first 1000 iterations the terminal outputs an astronomically high error, then drops to 0.12 at 2000.
![Training-Tracking_video2020-11-10T14_41_16-img19450](https://user-images.githubusercontent.com/50575240/111030683-f11e5f80-8403-11eb-9ddb-0cbcacc64d8a.png)
I've previously used DLC without issues. I've even processed videos from an identical setup using older versions of DLC on a windows machine with great results.
Here's the terminal output:
The text was updated successfully, but these errors were encountered: