Skip to content

DB-based Optical Chemical Structure Recognition

License

Notifications You must be signed in to change notification settings

duaibeom/chemOCR

Repository files navigation

ChemOCR(OCSR)

DB(Differentiable Binarization)-based Optical Chemical Structure Recognition


streamlit run demo.py

Backbone

  • Swin
  • ResNet
  • MobileNetV3

Parser

Rule-based method

DATA

Training data sample

Limitation

  • Non-charged atom
  • (Inner bridged) Complex ring is not possible.
  • Specific resolution of image
  • Small text atoms (like Iodin) are not recognized.

TODO

  • Character recognition (pretrained model from MMOCR RCNN)
  • Web front-end (streamlit)
  • Bond direction analysis

References

  1. https://github.com/MhLiao/DB
  2. https://github.com/open-mmlab/mmocr
  3. https://github.com/rdkit/rdkit

About

DB-based Optical Chemical Structure Recognition

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published