Skip to content

SimulatedANeal/carpedm

Repository files navigation

Logo

CarpeDM: Sieze the Data Manager

Latest Version

Continuous Integration Testing

License

Docs

Docs | Install Guide | Tutorial

Description

CarpeDM is a general library for downloading, viewing, and manipulating image data. Originally developed as a ChARacter shaPE Data Manager, CarpeDM aims to make Japanese character shape (字形) data and other image datasets more accessible to machine learning researchers.

Datasets Currently Available for Download
ID Dataset
pmjtc | Pre-Modern Japanese Text Character Shapes Dataset (日本古典籍字形データセット), | provided by the Center for Open Data in the Humanities (CODH).

Though still in the early stages of development, a high-level interface is also provided for (1) automatic model-ready data generation and (2) flexible training of models with a variety of deep learning frameworks.

Currently supported deep learning frameworks:

Installation

To install CarpeDM use `pip`:

$ pip install carpedm

See the installation guide for more details.

Overview

A brief overview of the CarpeDM interface is given below.

For a complete guide on usage and available functionality, please refer to the documentation

Interface

Using Your Data

If you would like to use CarpeDM with your data, please follow the directory structure of the sample dataset. The current version of CarpeDM supports CSV files that follow the naming scheme and format here, for example.

Contributing

Contributions are welcome! Please see the contribution guide

Versioning

We use SemVer for versioning. For the versions available, see the tags on this repository.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgments