issue_71: add documentation #232

lalital · 2019-06-14T07:49:57Z

According to issue #71, docstrings documentation (for pythainlp v2.0.5 (10 May 2019)) was added for the following modules:

pythainlp.tokenize
pythainlp.tag
pythainlp.word_vector
pythainlp.ulmfit
pythainlp.util
pythainlp.soundex
pythainlp.spell
pythainlp.transliterate
pythainlp.corpus
pythainlp.tools
pythainlp.summarize
pythainlp.spell

Code examples were added to most of the functions/methods (i.e. tokenize.word_tokenize, tag.pos_tag). Rerturn type for functions/methods were specified. Also, brief explaination of the functionality for important functions/methods.

Fixing issues:

fix formatting issues (pep8) (due 15 June)

Todos:

High Priority

Due date Mon 10 June 2019

pythainlp.tokenize

pythainlp.tag

pythainlp.word_vector

todo_word_vector_1: give example for most_similar_cosmul
todo_word_vector_2: give example for doesnt_match
todo_word_vector_3: give example for similarity
todo_word_vector_4: give example for sentence_vectorizer

pythainlp.ulmfit

todo_ulmfit_1: provide example for document_vector
todo_ulmfit_2: provide example for merge_wgts (Canceled)
todo_ulmfit_3: explain and show example for pythainlp.ulmfit.ThaiTokenizer

pythainlp.util

Medium Priority

Due date: Thu 13 June 2019

pythainlp.soundex

todo_soundex_1: provide more examples for metasound
todo_soundex_2: provide examples for udom83
todo_soundex_3: provide examples for lk82
todo_soundex_4: provide examples for soundex
todo_soundex_5: briefly explain lk82
todo_soundex_6: briefly explain udom82
todo_soundex_7: briefly explain metasound

pythainlp.spell

pythainlp.transliterate

todo_transliterate_1: format docstring for romanize
todo_transliterate_2: format docstring for transliterate
todo_transliterate_3: provide examples for romanize
todo_transliterate_4: provide examples for transliterate
todo_transliterate_5: add reference

Low Priority

Due date: Thu 13 June 2019

pythainlp.corpus

pythainlp.tools

provide example for tools.get_full_data_path
provide example for tools.get_pythainlp_data_path
provide examples for tools.get_pythainlp_path

pythainlp.summarize

provide examples for summarize.summarize
briefly explain functionality of summarize.summarize

Merge from 2.0.5 release

Specify that package `pythainlp` is not in the same directory as thte configuration file `docs/conf.py` Reference: https://medium.com/@eikonomega/getting-started-with-sphinx-autodoc-part-1-2cebbbca5365

This reverts commit ec6e550.

…ewmm)and cite the reference

…he reference

…cu and cite the reference

…eference

docs/api/tokenize.rst

p16i · 2019-06-20T05:59:38Z

Hi,

I have a question regarding writing consistency. Currently, we tend to use both segmentation and tokenization. Although these words are interchangeable, do you think it's better if we're strict with one of them, i.e. tokenization?

bact

this is fantastic.

cstorm125

excellent job

korakot

Good details krub.

bact and others added 30 commits May 9, 2019 17:56

Merge pull request #220 from PyThaiNLP/dev

8dbd79a

Merge from 2.0.5 release

specify that package is not in the same directory

f2461c7

Specify that package `pythainlp` is not in the same directory as thte configuration file `docs/conf.py` Reference: https://medium.com/@eikonomega/getting-started-with-sphinx-autodoc-part-1-2cebbbca5365

todo_tokenize_2: provide example for word_tokenize

f2778c2

todo_tokenize_2: formatting example

ec6e550

Revert "todo_tokenize_2: formatting example"

ddd06f5

This reverts commit ec6e550.

todo_tokenize_2.2: provide example for word_tokenize

36bad18

todo_tokenize_3: provide example for syllabus_tokenize

b188cc4

todo_tokenize_2.3: add return type

e438b5c

todo_tokenize_1: provide example for sent_tokenize

0248257

todo_tokenize_3: formatting

9521661

todo_tokenize_4: provide example for subword_tokenize

4246db2

todo_tokenize_3: formatting

dcae38d

todo_tokenize_11: fix docstring format in tcc.py

db7d485

todo_tokenize_5: briefly explain the algorithm of maximum matching (n…

0b35f44

…ewmm)and cite the reference

todo_tokenize_9: briefly explain the algorithm of multicut and cite t…

99c0fcb

…he reference

todo_tokenize_tcc: format module docstring

3c8d3a9

todo_tokenize_tcc: add return type

1907404

todo_tokenize_init: format docstring for refering a library

2ddf640

todo_tokenize_init: format docstring for referring a library

f08af28

todo_tokenize_init: remove typos

705a3bb

todo_tokenize_Tokenizer: add docstring and example

d2c8b9a

todo_tokenize: shows all module in the documentation page

e7073dc

todo_tokenize: briefly explain the algorithm of deepcut, longest, pyi…

099420b

…cu and cite the reference

todo_tokenize_7: briefly explain the algorithm of etcc and cite the r…

190fb1b

…eference

todo_tokenize_pyicu: formatting and explain briefly

418ca12

todo_tokenize_init: format example docstring

43da122

todo_tokenize_init: format example docstring

93104a7

todo_tag_pos_tag: move default engine to the first in the list

5516149

todo_tag: provide list of NER, POS tag

707d77d

todo_tag: fix typo

9771458

PyThaiNLP deleted a comment from pep8speaks Jun 16, 2019

fix pep8 issues

f1ece8c

PyThaiNLP deleted a comment from pep8speaks Jun 16, 2019

fix pep8 issues

ae7dc22

PyThaiNLP deleted a comment from pep8speaks Jun 16, 2019

fix pep8 issues, invalid escape sequence

86f1d3e

PyThaiNLP deleted a comment from pep8speaks Jun 16, 2019

Chakri Lowphansirikul added 7 commits June 16, 2019 23:10

format docstring

f98caee

fix typo

e2321ec

format docstring

bdc535d

add rerferences sectttion for word_vector package

bdbbf2e

format docstring

5753e17

format docstring

59618f8

fix typo

65c7f71

p16i reviewed Jun 20, 2019

View reviewed changes

docs/api/tokenize.rst Outdated Show resolved Hide resolved

Chakri Lowphansirikul added 5 commits June 21, 2019 10:11

Edit the term to "Tokenzation Engines"

256927b

format docstring

82b6bea

format .rst files

9f1c2fd

fix sphinx warning by adding a blank line

e45d556

add function description to pythainlp.tokenize.word_tokenize

8cecb1a

wannaphong approved these changes Jun 23, 2019

View reviewed changes

wannaphong requested a review from bact July 9, 2019 07:14

bact approved these changes Jul 17, 2019

View reviewed changes

wannaphong requested review from cstorm125 and korakot July 17, 2019 16:01

cstorm125 approved these changes Jul 18, 2019

View reviewed changes

korakot approved these changes Jul 27, 2019

View reviewed changes

wannaphong merged commit 0a57ae9 into 2.0 Jul 27, 2019

bact deleted the issue71_add_documentation branch September 7, 2019 22:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

issue_71: add documentation #232

issue_71: add documentation #232

Uh oh!

lalital commented Jun 14, 2019 •

edited by bact

Loading

Uh oh!

Uh oh!

p16i commented Jun 20, 2019

Uh oh!

bact left a comment

Uh oh!

cstorm125 left a comment

Uh oh!

korakot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

issue_71: add documentation #232

issue_71: add documentation #232

Uh oh!

Conversation

lalital commented Jun 14, 2019 • edited by bact Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Fixing issues:

Todos:

Uh oh!

Uh oh!

p16i commented Jun 20, 2019

Uh oh!

bact left a comment

Choose a reason for hiding this comment

Uh oh!

cstorm125 left a comment

Choose a reason for hiding this comment

Uh oh!

korakot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

lalital commented Jun 14, 2019 •

edited by bact

Loading