Process CASIA MPF Files #2

brucegarro · 2019-07-04T19:33:53Z

Hello @lucaskjaero,
I have a project similar to yours where I've implemented some Chinese character recognition models using the CASIA data sets. For my project, I've similarly used the CASIA competition GNT files, but I believe it should be easier to build performant models on the HWDB1.X and OLHWDB1.X data sets because they are five times larger. Unfortunately, those data sets use a different file format MPF. Do you have any idea how to process these files using Python?

Datasets:
http://www.nlpr.ia.ac.cn/databases/handwriting/Download.html

My Project:
https://github.com/brucegarro/chinese-character-recognition

lucaskjaero · 2019-07-08T17:39:04Z

Hi @brucegarro,
I see there's a file specification here. You can read these files in python as strings of binary format using the struct library. In this project, I do this here, which hopefully is a decent example.
Let me know if that helps -- I can see about implementing it here if it doesn't.
Best,
Lucas

brucegarro · 2019-07-08T20:42:41Z

Thank you for your response @lucaskjaero 🙏

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Process CASIA MPF Files #2

Process CASIA MPF Files #2

brucegarro commented Jul 4, 2019

lucaskjaero commented Jul 8, 2019

brucegarro commented Jul 8, 2019

Process CASIA MPF Files #2

Process CASIA MPF Files #2

Comments

brucegarro commented Jul 4, 2019

lucaskjaero commented Jul 8, 2019

brucegarro commented Jul 8, 2019