You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Your optimization of mtcnn is specifically designed for 720p input image. While other size may be scaled or padded to fit into 720p. The failure case may be an input of portrait rather than landscape. In this case, images are heavily squeezed and make it impossible to detect small faces. Is there any plan to make it input independent?
The text was updated successfully, but these errors were encountered:
You are right. I've optimized MTCNN PNet based for 1280x720 image inputs, and it does not work very well for portrait inputs. But I don't plan to update the code to better support other cases for now. I think interested readers should be able to modify 'mtcnn/det1_relu.prototxt' and 'utils/mtcnn.py' by themselves.
In order to handle input images of multiple different input shapes, one might consider prebuilding TRT-PNet with a few different input shapes and select the best one at runtime.
Actually, I am trying to find a face alignment model for efficient batch inference. Maybe mtcnn is not the best choice. Anyway, your demo helped me a lot.
Your optimization of mtcnn is specifically designed for 720p input image. While other size may be scaled or padded to fit into 720p. The failure case may be an input of portrait rather than landscape. In this case, images are heavily squeezed and make it impossible to detect small faces. Is there any plan to make it input independent?
The text was updated successfully, but these errors were encountered: