Super compact Japanese tokenizer in Objective-C
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
forRegexKitLite
LICENCE.txt
README
TinySegmenter.h
TinySegmenter.m

README

TinySegmenter.m -- Super compact Japanese tokenizer in Objective-C

HOW TO USE

	1. CocoaOniguruma
		Add all .h, .c and .m files of under "Classes".
		( For details, see http://github.com/psychs/cocoaoniguruma )

	2. TinySegmenter
		Add TinySegmenter.h and TinySegmenter.m files  under "Classes".
		Import the header file.
		    #import "TinySegmenter.h"
		
		# If use CocoaOniguruma as a Framework
			
			TinySegmenter.h
				// #import "OnigRegexp.h" <- comment out
				#import "CocoaOniguruma/OnigRegexp.h" <- uncomment

	3. Test
		TinySegmenter* segmenter = [ [ TinySegmenter alloc ] init ];
		NSArray* segs = [ segmenter segment: @"これはテストですよ" ];
		NSLog(@"%@", [ segs componentsJoinedByString: @"|" ]);
		// これ|は|テスト|です|よ


* for RegexKitLite

	1. RegexKitLite
		Add RegexKitLite.h and RegexKitLite.m under "Classes".
		Add the linker option "-licucore".
		( For details, see http://regexkit.sourceforge.net/RegexKitLite/ )

	2. TinySegmenter
		Add forRegexKitLite/TinySegmenter.h and forRegexKitLite/TinySegmenter.m under "Classes".
			:
			:
			: