Skip to content

wareya/analyzer

master
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
src
 
 
 
 
 
 
 
 

unnamed japanese text analyzer

generates a word frequency list from japanese utf-8 text
depends on kuromoji-unidic-kanaaccent from maven
invoke java -jar analyzer.jar mycorpus.txt > myfrequencylist.txt
licensed under a public domain–like permissive license
particles, auxiliary verbs, etc are blacklisted from output

use the companion program to combine lists made from different sources: https://github.com/wareya/normalizer

About

unnamed japanese text analyzer

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages