Docs | Install Guide | Tutorial
CarpeDM is a general library for downloading, viewing, and manipulating image data. Originally developed as a ChARacter shaPE Data Manager, CarpeDM aims to make Japanese character shape (字形) data and other image datasets more accessible to machine learning researchers.
Datasets Currently Available for DownloadID | Dataset |
---|---|
pmjtc | | Pre-Modern Japanese Text Character Shapes Dataset (日本古典籍字形データセット), | provided by the Center for Open Data in the Humanities (CODH). |
Though still in the early stages of development, a high-level interface is also provided for (1) automatic model-ready data generation and (2) flexible training of models with a variety of deep learning frameworks.
Currently supported deep learning frameworks:
To install CarpeDM use `pip`:
$ pip install carpedm
See the installation guide for more details.
A brief overview of the CarpeDM interface is given below.
For a complete guide on usage and available functionality, please refer to the documentation
If you would like to use CarpeDM with your data, please follow the directory structure of the sample dataset. The current version of CarpeDM supports CSV files that follow the naming scheme and format here, for example.
Contributions are welcome! Please see the contribution guide
We use SemVer for versioning. For the versions available, see the tags on this repository.
This project is licensed under the MIT License - see the LICENSE file for details.
- This project was developed, in part, during an internship at the National Institute of Informatics, Tokyo.