Skip to content
Easy to use CMATERdb datasets converted in NumPy format
Branch: master
Clone or download
Latest commit 5efdea4 Jun 5, 2019
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
datasets
images added images May 16, 2019
scripts
.gitignore
LICENSE.md
README.md scripts added Jun 5, 2019

README.md

CMATERdb

Easy to use CMATERdb datasets converted in NumPy format

License: Apache 2.0

About

CMATERdb is the pattern recognition database repository created at the ‘Center for Microprocessor Applications for Training Education and Research’ (CMATER) research laboratory, Jadavpur University, Kolkata 700032, INDIA. This database is free for all non-commercial uses. Please acknowledge CMATER explicitly, whenever you use this database for academic and research purposes. For using some databases, one must also cite relevant research publications, mentioned in this website.

Official Dataset Repository: Link
Shifted Repository as per Google Code Archive (Not Live): Link

Citing CMATERdb 3: Handwritten Indian script character database

If you use any of the CMATERdb datasets in your work, we would appreciate a reference to authors' original papers:

@article{Das:2012:GAB:2161007.2161320,
  author = {Das, Nibaran and Sarkar, Ram and Basu, Subhadip and Kundu, Mahantapas 
            and Nasipuri, Mita and Basu, Dipak Kumar},
  title = {A Genetic Algorithm Based Region Sampling for Selection of Local Features 
          in Handwritten Digit Recognition Application},
  journal = {Appl. Soft Comput.},
  issue_date = {May, 2012},
  volume = {12},
  number = {5},
  month = may,
  year = {2012},
  issn = {1568-4946},
  pages = {1592--1606},
  numpages = {15},
  url = {http://dx.doi.org/10.1016/j.asoc.2011.11.030},
  doi = {10.1016/j.asoc.2011.11.030},
  acmid = {2161320},
  publisher = {Elsevier Science Publishers B. V.},
  address = {Amsterdam, The Netherlands, The Netherlands},
  keywords = {Feature selection, Genetic algorithm, N-Quality consensus, 
  Optimal local regions, Region sampling, Variable sized local regions},
}

@article{Das:2012:SFC:2240301.2240421,
  author = {Das, Nibaran and Reddy, Jagan Mohan and Sarkar, Ram and Basu, Subhadip and Kundu, 
            Mahantapas and Nasipuri, Mita and Basu, Dipak Kumar},
  title = {A Statistical-topological Feature Combination for Recognition of Handwritten Numerals},
  journal = {Appl. Soft Comput.},
  issue_date = {August, 2012},
  volume = {12},
  number = {8},
  month = aug,
  year = {2012},
  issn = {1568-4946},
  pages = {2486--2495},
  numpages = {10},
  url = {http://dx.doi.org/10.1016/j.asoc.2012.03.039},
  doi = {10.1016/j.asoc.2012.03.039},
  acmid = {2240421},
  publisher = {Elsevier Science Publishers B. V.},
  address = {Amsterdam, The Netherlands, The Netherlands},
  keywords = {Character recognition, Feature combination, MPCA, PCA, SVM, Statistical, Topological},
}

News and Updates

IMPORTANT:

  • How to use script added!

The Dataset

  • CMATERdb 3.1.1: Handwritten Bangla numeral database is a balanced dataset of total 6000 Bangla numerals (32x32 RGB coloured, 6000 images), each having 600 images per classs(per digit). Bangla Numerals
  • CMATERdb 3.2.1: Handwritten Devanagari numeral database is a balanced dataset of total 3000 Devanagari numerals (32x32 RGB coloured, 3000 images), each having 300 images per classs(per digit). Devanagari Numerals
  • CMATERdb 3.4.1: Handwritten Telugu numeral database is a balanced dataset of total 3000 Telugu numerals (32x32 RGB coloured, 3000 images), each having 300 images per classs(per digit). Telugu Numerals

Get the data

Script to download images and easy to use functions are on the way!

How to use?

Check out our load function in usage.py

CMATERdb 3.1.1: Handwritten Bangla numeral database

File Examples Download (NumPy format)
Training images with labels 5000 training-images.npz (700KB)
Testing images with labels 1000 testing-images.npz (141KB)

CMATERdb 3.2.1: Handwritten Devanagari numeral database

File Examples Download (NumPy format)
Training images with labels 2500 training-images.npz (347KB)
Testing images with labels 500 testing-images.npz (70KB)

CMATERdb 3.4.1: Handwritten Telugu numeral database

File Examples Download (NumPy format)
Training images with labels 2500 training-images.npz (338KB)
Testing images with labels 500 testing-images.npz (68KB)

License

Both the dataset itself and the contents of this repo are licensed under Apache 2.0 License as given here.

You can’t perform that action at this time.