Async Japanese tokenizer native plugin for React Native that works on iOS and Android. It tokenizes Japanese text in background thread so it won't block UI. It comes in handy for such as indexing a lot of texts for full-text search.
It utilizes CFStringTokenizer
to tokenize strings into words.
The implementation is based on TinySegmenter. It is lightweight word segmenter which works without dictionaries.
$ npm install react-native-japanese-tokenizer --save
$ react-native link react-native-japanese-tokenizer
- In XCode, in the project navigator, right click
Libraries
➜Add Files to [your project's name]
- Go to
node_modules
➜react-native-japanese-tokenizer
and addRNJapaneseTokenizer.xcodeproj
- In Xcode, in the project navigator, select your project. Add
libRNJapaneseTokenizer.a
to your project'sBuild Phases
➜Link Binary With Libraries
- Run your project (
Cmd+R
)
- Open up
android/app/src/main/java/[...]/MainActivity.java
- Add
import dog.craftz.japanese_tokenizer.RNJapaneseTokenizerPackage;
to the imports at the top of the file - Add
new RNJapaneseTokenizerPackage()
to the list returned by thegetPackages()
method
- Append the following lines to
android/settings.gradle
:include ':react-native-japanese-tokenizer' project(':react-native-japanese-tokenizer').projectDir = new File(rootProject.projectDir, '../node_modules/react-native-japanese-tokenizer/android')
- Insert the following lines inside the dependencies block in
android/app/build.gradle
:compile project(':react-native-japanese-tokenizer')
Using Promise
:
import Tokenizer from 'react-native-japanese-tokenizer'
var text = "週休七日で働きたい"
Tokenizer.tokenize(text).then(tokens => {
...
})
async/await
:
import Tokenizer from 'react-native-japanese-tokenizer'
var text = "週休七日で働きたい"
const tokens = await Tokenizer.tokenize(text)
Also check out the test to learn more.
MIT License. By Fukumoto Riku (@幸運18210).
Note that this module is based on following libraries with respective license. Thank you for the fantastic works!
- TinySegmenter: Javascriptだけで実装されたコンパクトな分かち書きソフトウェア
- BSD-3-Clause
- TinySegmenter for Java by takscape
- Public Domain