-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Interesting but unusable. Too much false positive. #5
Comments
@Vadim2S yes fine-tuning is a good idea. I understand this is an issue with input dimensions but with the given code, it should work because dimensions of resnet50 are same in both encoder and decoder part. Thanks!!! |
I just fixed this bug. However, I suggest that you should reduce the dimension of encoder outputs since the output dimension of resnet50 will lead to super heavy model and take more GPU memory to train. For example, you can apply an extra conv1x1 to reduce the encoder tensor dimension. |
@jimleungjing Thanks for fixing it. Yes you are right, changing the backbone to ResNet50 results in more memory cost. Could you please guide if I need to add conv1x1 after each encoder layer (output). Wouldn't that change the dimensions in the decoder too? What changes do I have to make for code to work? Also if you've pretrained weights with backbone 50 on iHormany4 datasets, please share? Thanks again! |
Well, you can add conv1x1 in the decoder to reduce tensor dimension, and rewrite |
Okay. As I noticed fine-tuning works better than training from scratch. Did you train your model on iHarmony4 dataset with backbone Resnet50? |
No, you can try it. |
I am try DIRL with my own little test dataset. In 48 total here is only 2 success and around 45 false positive alarms.
For example: (fake, mask, DIRL result)
The text was updated successfully, but these errors were encountered: