Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

文本识别训练时宽高比大于 10 或者文本长度大于 25 的图像会直接丢弃吗? #5017

Closed
CharlesWu123 opened this issue Dec 22, 2021 · 4 comments
Assignees

Comments

@CharlesWu123
Copy link

https://github.com/PaddlePaddle/PaddleOCR/blob/release/2.4/doc/doc_ch/FAQ.md#15
https://github.com/PaddlePaddle/PaddleOCR/blob/release/2.4/doc/doc_ch/FAQ.md#210
你好,我从这两个问题中看到,宽高比大于10的以及文本长度超过25的训练图像都会被丢弃,这个具体在代码中哪里体现的呢?
那对于文档图像的训练,大部分的文本行图像宽高比都大于10并且文本长度都会超过25,在可以使用与训练模型的情况下应该怎么处理呢?

@littletomatodonkey
Copy link
Collaborator

在这里实现的

if len(text) == 0 or len(text) > self.max_text_len:

可以把图像shape变大点,比如[3, 32, 640],然后max_text_length参数再修改的大一些

@CharlesWu123
Copy link
Author

@littletomatodonkey 好的,感谢

@surdldz
Copy link

surdldz commented Jan 17, 2022

在这里实现的

if len(text) == 0 or len(text) > self.max_text_len:

可以把图像shape变大点,比如[3, 32, 640],然后max_text_length参数再修改的大一些

shape变大后,会导致推理变慢么?

@paddle-bot-old
Copy link

Since you haven't replied for more than 3 months, we have closed this issue/pr.
If the problem is not solved or there is a follow-up one, please reopen it at any time and we will continue to follow up.
It is recommended to pull and try the latest code first.
由于您超过三个月未回复,我们将关闭这个issue/pr。
若问题未解决或有后续问题,请随时重新打开(建议先拉取最新代码进行尝试),我们会继续跟进。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants