Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Indonesian language support #1288

Merged
merged 56 commits into from Aug 26, 2017
Merged
Changes from 1 commit
Commits
Show all changes
56 commits
Select commit Hold shift + click to select a range
c2b4dd7
start working on Indonesian language
geovedi Jul 23, 2017
da98676
use template
geovedi Jul 23, 2017
e2efeb1
added stopwords
geovedi Jul 23, 2017
082e9ad
fixed typo
geovedi Jul 23, 2017
b5de329
added norm_exceptions
geovedi Jul 23, 2017
b80c35b
added norm_exceptions
geovedi Jul 23, 2017
bed8162
added tokenizer_exceptions
geovedi Jul 23, 2017
f6f1567
added lex_attrs
geovedi Jul 23, 2017
d5fd32a
added known currencies
geovedi Jul 23, 2017
3b17eba
added frequency units
geovedi Jul 23, 2017
ba922e3
added ampere hour unit
geovedi Jul 23, 2017
0e590c7
added prefix & suffix rules
geovedi Jul 23, 2017
d0ec484
reverted
geovedi Jul 23, 2017
082e94a
added inflix rules
geovedi Jul 23, 2017
37fa2c8
punctution rules
geovedi Jul 23, 2017
c1f3fe9
updated punctuation rules
geovedi Jul 24, 2017
ad56c91
added tokenizer exceptions list
geovedi Jul 24, 2017
7aad671
enable tokenizer exceptions
geovedi Jul 24, 2017
eaf9cbd
cursed of copy & paste
geovedi Jul 24, 2017
68454c4
added missing import
geovedi Jul 24, 2017
73f6ac9
added hyhen
geovedi Jul 24, 2017
c97f5ae
updated tokenizer exceptions
geovedi Jul 26, 2017
62443d4
enable token match
geovedi Jul 26, 2017
edec51b
update punctuation rules
geovedi Jul 26, 2017
6eee7a7
updated tokenizer exceptions
geovedi Jul 26, 2017
f288964
removed -el from suffix rules
geovedi Jul 26, 2017
63f14ba
added hyphen-suffix rules
geovedi Jul 26, 2017
24a8c8b
added wip lemma dict
geovedi Jul 26, 2017
bbc75da
enable syntax iterator and lemma lookup
geovedi Jul 27, 2017
547973b
wip syntax iterators
geovedi Jul 27, 2017
c194f7a
Merge remote-tracking branch 'upstream/develop' into indonesian
geovedi Jul 27, 2017
6c725e8
updated lemma
geovedi Jul 27, 2017
8b814c6
more exceptions
geovedi Jul 27, 2017
3cca4ed
added lex attrs rules
geovedi Jul 29, 2017
7d96d47
updated like_num
geovedi Jul 29, 2017
4d04898
updated regexp
geovedi Jul 29, 2017
783f7d8
added test set for Indonesian language
geovedi Jul 29, 2017
e5adc26
simplified rules
geovedi Jul 29, 2017
e9af79a
added u-\d+ rules (sports team)
geovedi Jul 30, 2017
bb08d69
added hashtag rule and fixed currency rules
geovedi Jul 30, 2017
2572a9d
Merge remote-tracking branch 'upstream/develop' into indonesian
geovedi Jul 30, 2017
ba07e23
added USD in currency rules
geovedi Aug 2, 2017
4705ae1
Merge remote-tracking branch 'upstream/develop' into indonesian
geovedi Aug 3, 2017
30fd068
hashtag prefix should be handled somewhere else
geovedi Aug 3, 2017
37f19f5
added more currencies based on corpus data
geovedi Aug 3, 2017
cc4772c
reworks
geovedi Aug 3, 2017
c62b49b
Merge remote-tracking branch 'upstream/develop' into indonesian
geovedi Aug 9, 2017
7ae45bf
Merge remote-tracking branch 'upstream/develop' into indonesian
geovedi Aug 18, 2017
fa544e6
Merge remote-tracking branch 'upstream/develop' into indonesian
geovedi Aug 20, 2017
fdf802d
added examples
geovedi Aug 20, 2017
7193c47
direct lookup
geovedi Aug 20, 2017
b7d83f3
indonesian abbr.
geovedi Aug 20, 2017
713d7c0
added indonesian lang test
geovedi Aug 20, 2017
fbc62a0
added {pre,suf,in}fix tests
geovedi Aug 20, 2017
f77443a
reworked
geovedi Aug 20, 2017
58d8078
Merge remote-tracking branch 'upstream/develop' into indonesian
geovedi Aug 25, 2017
File filter...
Filter file types
Jump to…
Jump to file or symbol
Failed to load files and symbols.
+0 −0
Diff settings

Always

Just for now

Merge remote-tracking branch 'upstream/develop' into indonesian

  • Loading branch information...
geovedi committed Aug 20, 2017
commit fa544e6c9ae738a4bec35b23478a457ed6de61bf

This merge commit was added into this branch cleanly.

There are no new changes to show, but you can still view the diff.

ProTip! Use n and p to navigate between commits in a pull request.
You can’t perform that action at this time.