Questions about the squeeze-and-exicitation module in PPHGNet series model #3023

thsno02 · 2023-10-30T07:22:28Z

Q: Is there any justification for using the ESE module instead of the SE module in PPHGNetv1?

The design of PPHGNetv1 strikes a balance between computational efficiency and model performance. However, the ESE module has more parameters than the SE module. The ESE module consists of a fully connected layer without dimension reduction, resulting in a parameter size of $C * C$. In contrast, the SE module includes two fully connected layers with a dimension reduction ratio of $r$, leading to a parameter size of $C * C / r + C/r * C$, which simplifies to $2 * C * C / r$. If $r$ is greater than 2, the SE module has fewer parameters. The original paper recommends setting $r$ to 16¹.

When considering the model's inference performance, it appears that using the ES module would be faster. So, why opt for the ESE² module in v1? Additionally, v2 employs an ES module with a ratio of 2 (if I understand correctly³). Would the performance of the ES module be superior while maintaining a consistent number of parameters?

From my perspective, the SE module may have an advantage in capturing inter-channel dependencies, given its utilization of dimension reduction as a form of compression. On the other hand, the ESE module might not be as effective in this regard.

cuicheng01 · 2023-11-13T07:09:21Z

Hello, thank you for your interest in PP-HGNet. In PP-HGNetV1, we did consider that ESE module takes less inference time than SE module without reducing accuracy, so we adopted the ESE module. However, we further found that the ESE module or SE module did not significantly improve object detection, semantic segmentation, and other tasks. Therefore, in PP-HGNetV2, we removed this module. In summary, there is neither SE module nor ESE module in PP-HGNetV2.

thsno02 closed this as completed Jan 11, 2024

paddle-bot bot added the status/close label Jan 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions about the squeeze-and-exicitation module in PPHGNet series model #3023

Questions about the squeeze-and-exicitation module in PPHGNet series model #3023

thsno02 commented Oct 30, 2023

cuicheng01 commented Nov 13, 2023

Questions about the squeeze-and-exicitation module in PPHGNet series model #3023

Questions about the squeeze-and-exicitation module in PPHGNet series model #3023

Comments

thsno02 commented Oct 30, 2023

Footnotes

cuicheng01 commented Nov 13, 2023