R package for Korean NLP
HTML Java R
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
R corrected message May 26, 2017
data added tag datasets Aug 30, 2012
etcs - fix typo Jan 3, 2017
inst/java - NFKC normalizer for input sentence. Dec 13, 2016
java
man - remove install_NIADic help and exporting Jan 11, 2017
tests
vignettes
.Rbuildignore - NFKC normalizer for input sentence. Dec 13, 2016
.gitignore
.travis.yml - remove useInsighterDic, useWoorimalsamDic functions. Nov 16, 2016
DESCRIPTION HanguJamoAutomata issue is resolved. Jul 12, 2018
MD5 first commit Nov 24, 2011
NAMESPACE - added get_dictionary() Nov 17, 2016
NEWS
README.md added licence Dec 21, 2016
cran-comments.md - can run multiple sentences inputs on extratNouns,... Dec 14, 2016

README.md

KoNLP

License: GPL v3 CRAN_Status_Badge CRAN Downloads CRAN Downloads Total Travis-CI Build Status Join the chat at https://gitter.im/KoNLP/KoNLP

POS Tagger and Morphological Analyzer for Korean text based research. It provides tools for corpus linguistics research such as Keystroke converter, Hangul automata, Concordance, and Mutual Information. It also provides a convenient interface for users to apply, edit and add morphological dictionary selectively.

  • Interfacing with opensource Hannanum analyzer.
  • Some twiks are applied on Hannanum analyzer for bigger or flexible user dictionary for Sejong project and NIADic.
  • Many other functions for Korean text analysis like keystroke conversion, is.jamo, is.hangul, Hangul antomata...

Some of Korean tutorials are on my blog, English pages are mainly on wiki.

To install from CRAN, use

install.packages('KoNLP')

To install from GitHub, use

install.packages('devtools')
devtools::install_github('haven-jeon/KoNLP')