jieba-rs

The Jieba Chinese Word Segmentation Implemented in Rust

Installation

Add it to your Cargo.toml:

[dependencies]
jieba-rs = "0.4"

then you are good to go. If you are using Rust 2015 you have to extern crate jieba_rs to your crate root as well.

Example

use jieba_rs::Jieba;

fn main() {
    let jieba = Jieba::new();
    let words = jieba.cut("我们中出了一个叛徒", false);
    assert_eq!(words, vec!["我们", "中", "出", "了", "一个", "叛徒"]);
}

Enabling Additional Features

default-dict feature enables embedded dictionary, this features is enabled by default
tfidf feature enables TF-IDF keywords extractor
textrank feature enables TextRank keywords extractor

[dependencies]
jieba-rs = { version = "0.4", features = ["tfidf", "textrank"] }

Run benchmark

cargo bench --all-features

Benchmark: Compare with cppjieba

License

This work is released under the MIT license. A copy of the license is provided in the LICENSE file.

Name		Name	Last commit message	Last commit date
Latest commit History 224 Commits
benches		benches
capi		capi
examples/weicheng		examples/weicheng
src		src
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
azure-pipelines-template.yml		azure-pipelines-template.yml
azure-pipelines.yml		azure-pipelines.yml
build.rs		build.rs
rustfmt.toml		rustfmt.toml

License

slanterns-fork/jieba-rs

Folders and files

Latest commit

History

Repository files navigation

jieba-rs

Installation

Example

Enabling Additional Features

Run benchmark

Benchmark: Compare with cppjieba

License

About

Resources

License

Stars

Watchers

Forks

Languages