Indonesian Manually Tagged Corpus
Switch branches/tags
Nothing to show
Clone or download
Latest commit 557e634 Sep 9, 2016


Manually Tagged Indonesian Corpus versi Bahasa

Format Data

Korpus ini menggunakan format tab-separated file (.tsv). Setiap baris berisi token beserta part-of-speech tag dari token tersebut yang terpisahkan oleh satu karakter tab(\t). Antar kalimat dipisahkan oleh satu baris kosong. English version

Data Format

Each line consists of token with its respective part-of-speech tag separated by a tab character(\t). There is an empty line between sentences.


  • Ruli Manurung
  • Arawinda Dinakaramani
  • Fam Rashel
  • Andry Luthfi


For publication and more details about this work, please visit


This work is licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. To view a copy of this license, visit