Skip to content

serge724/doc2data

main
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 
 
 
 
 

doc2data

PyPI - Version PyPI - Python Version Code style: black Hatch project


About doc2data

doc2data is a Python library that provides functionality to train deep learning models for various document processing tasks.

Currently, models can be trained for four tasks:

  1. Page rotation
  2. Page cropping
  3. Document (multi-page) classification
  4. Token classification

Please note that doc2data is currently in a prototype stage.

Installation

pip install doc2data

Documentation

The documentation can be found here.

License

doc2data is distributed under the terms of the Apache-2.0 license.

Credits

Prototypefund Federal Ministry of Education and Research

About

Integrated document processing with machine learning.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages