Skip to content

The chemical formula extraction parses PDF files and extracts all checmical entities from these files.

License

Notifications You must be signed in to change notification settings

gipplab/chem_formula_extractor

Repository files navigation

Chemical Data Extractor

Actions Status Actions Status License: MIT Code style: black

This chemical data extractor reads XML files parsed by Grobid and extracts chemical entities from these files.

Getting Started

Install the dependencies from requirements.txt. Note that due to dependencies a direct installation may not be possible in that case run the command

python3 -m pip install chemdataextractor2 --use-feature=2020-resolver

to install chemdataextractor2 with all of its dependencies.

Contributing

Fork the repo, make changes and send a PR. We'll review it together!

License

This project is licensed under the terms of MIT license. Please see the LICENSE file for details.

About

The chemical formula extraction parses PDF files and extracts all checmical entities from these files.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages