文本检测模型微调后,检测效果较差
#12280
Replies: 3 comments 1 reply
-
猜测和训练数据相关,训练数据对文本框的标注是否都为word-level?模型会拟合到训练数据中,如果需要line-level的推理结果,需要同步提供行级别的标注。 |
Beta Was this translation helpful? Give feedback.
1 reply
-
我也遇到了类似的问题,是否是数据集过少的原因呢?我目前只有100张训练集,但是不明白为什么训练得到的模型在训练集上测试的效果还那么差(出现漏检、检测不连续)。 |
Beta Was this translation helpful? Give feedback.
0 replies
-
和数据量大小有关也和模型结构有关,ppocr使用了轻量化的技术,参数量小意味着对反向传播很敏感,容易出现性能的起伏。可以尝试使用更深的backbone,应该会有改善 |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
参考:https://github.com/PaddlePaddle/PaddleOCR/blob/main/doc/doc_ch/finetune.md 进行模型微调
![254 (2)](https://private-user-images.githubusercontent.com/61486634/323094675-39bccfae-b018-4130-95ed-b2894d4a3a5a.jpg?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MTk1NTQ4OTgsIm5iZiI6MTcxOTU1NDU5OCwicGF0aCI6Ii82MTQ4NjYzNC8zMjMwOTQ2NzUtMzliY2NmYWUtYjAxOC00MTMwLTk1ZWQtYjI4OTRkNGEzYTVhLmpwZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDA2MjglMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQwNjI4VDA2MDMxOFomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTRjZmFkNThlMGFlMjZlMmMwMzU3Y2ZiNGZmNTNlODFkODI2ZDkxMzMxOTg0YTIxMTY0MTUyNGVkNjU3YjZhMDImWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0JmFjdG9yX2lkPTAma2V5X2lkPTAmcmVwb19pZD0wIn0._BFAFWotxSrNg-rCQPTDR-1etz4A9ZBcp_tJZ-lh4TU)
![254](https://private-user-images.githubusercontent.com/61486634/323094705-dfbcf994-aa68-4ee1-859b-1ec42210012b.jpg?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MTk1NTQ4OTgsIm5iZiI6MTcxOTU1NDU5OCwicGF0aCI6Ii82MTQ4NjYzNC8zMjMwOTQ3MDUtZGZiY2Y5OTQtYWE2OC00ZWUxLTg1OWItMWVjNDIyMTAwMTJiLmpwZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDA2MjglMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQwNjI4VDA2MDMxOFomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTljMjg3NzQ5N2RlNzFjOGJlNWI2NjBiNWNmNDQ2NzExMGQ5ODg2MDJmNzI1MTM4MzA2Njg3YzZjZWQxNjYyNDgmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0JmFjdG9yX2lkPTAma2V5X2lkPTAmcmVwb19pZD0wIn0.B9KrofKMSOIK0l5sjzwHjBhymZzZSPHejgKOLKW3vWk)
![img_10 (2)](https://private-user-images.githubusercontent.com/61486634/323094723-581dd78b-953f-423f-ab95-e5964c2b2ebe.jpg?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MTk1NTQ4OTgsIm5iZiI6MTcxOTU1NDU5OCwicGF0aCI6Ii82MTQ4NjYzNC8zMjMwOTQ3MjMtNTgxZGQ3OGItOTUzZi00MjNmLWFiOTUtZTU5NjRjMmIyZWJlLmpwZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDA2MjglMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQwNjI4VDA2MDMxOFomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTY5NWEyMjU5NjEwZjExODEyNTQ3ZGQ1NjRkN2EyNzE2YTdkODQwM2E2N2M4OTg0MDZiMGVmZDEwZGQ2NjEwYWEmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0JmFjdG9yX2lkPTAma2V5X2lkPTAmcmVwb19pZD0wIn0.rhjsbDfZs7eKgJXA715fx5rDsVI6CiwUcNMHrYnfKFk)
![img_10](https://private-user-images.githubusercontent.com/61486634/323094750-eb77cb7f-0460-4ce2-94a8-36f32f6249ae.jpg?jwt=eyJhbGciOiJIUzI1NiIsInR5cCI6IkpXVCJ9.eyJpc3MiOiJnaXRodWIuY29tIiwiYXVkIjoicmF3LmdpdGh1YnVzZXJjb250ZW50LmNvbSIsImtleSI6ImtleTUiLCJleHAiOjE3MTk1NTQ4OTgsIm5iZiI6MTcxOTU1NDU5OCwicGF0aCI6Ii82MTQ4NjYzNC8zMjMwOTQ3NTAtZWI3N2NiN2YtMDQ2MC00Y2UyLTk0YTgtMzZmMzJmNjI0OWFlLmpwZz9YLUFtei1BbGdvcml0aG09QVdTNC1ITUFDLVNIQTI1NiZYLUFtei1DcmVkZW50aWFsPUFLSUFWQ09EWUxTQTUzUFFLNFpBJTJGMjAyNDA2MjglMkZ1cy1lYXN0LTElMkZzMyUyRmF3czRfcmVxdWVzdCZYLUFtei1EYXRlPTIwMjQwNjI4VDA2MDMxOFomWC1BbXotRXhwaXJlcz0zMDAmWC1BbXotU2lnbmF0dXJlPTc1NTE4Yjg5NGYwNTBhMjM4NWZkMDIyOTcxZTgzOWRkOGQzODA0MTY0YTE5YjQzZWU5ZjNkMWEwNmJiOTcwY2UmWC1BbXotU2lnbmVkSGVhZGVycz1ob3N0JmFjdG9yX2lkPTAma2V5X2lkPTAmcmVwb19pZD0wIn0.1fXp4z4XRdSKhfg74b-dmuhIMSrGSOJeOs6CYZCTev4)
使用pp_ocrV3 student模型进行模型微调,训练500epoch,检测效果较差,出现检测框不连续、误检、漏检的现象,请问可能是什么原因。模型输出效果如下,后者为预训练模型输出(较好)
Beta Was this translation helpful? Give feedback.
All reactions