-
Notifications
You must be signed in to change notification settings - Fork 326
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Fixes for [979] and adjusting Levenshtein (#982)
* Initial releases with unit tests and doctests * Added sections and preliminary documentation for: Scansion of Poetry About the use of macrons in poetry HexameterScanner Hexameter ScansionConstants Syllabifier Metrical Validator ScansionFormatter StringUtils module Made minor formatting corrections elsewhere to quiet warnings encountered during transpiling the rst file during testing and verification. * corrected documentation & doctest comments that were causing errors. doctests run with an added command line switch: nosetests --no-skip --with-coverage --cover-package=cltk --with-doctest * fixing broken doctest comment * correcting documentation comment that causes doctest to err * Corrections to make the build pass: 1. added install gensim to travis build script; its absence is causing an error in word2vec.py during the build. 2. Modified transcription.py so that the macronizer is initialized on instantiation of the Transcriber class and not at the module level; the macronizer file is 32MB and this also seems to cause an error with travis as github does not make large files displayable, and so it may not be available for the build. The macronizer object has been made a component of "self." * moved package import inside of main so that it does not prevent the build from completing; soon, we should move to update the dependencies of word2vec; gensim pulls in boto which isn't python3 compliant, there is a boto3 version which we may be able to slot in, but perhaps a larger question is boto necessary? * correcting documentation * corrected documentation & doctest comments that were causing errors. doctests run with an added command line switch: nosetests --no-skip --with-coverage --cover-package=cltk --with-doctest * Added: PentameterScanner, HendecasyllableScanner, more unittests and bug fixes; refactored Hexameter class into Verse class; pulled out VerseScanner, updated documentation * updating contributors * Additional testing and small bug fixes based on integration tests. * Corrections for unittest * Adding additional unit tests; catching errors if an invalid/unconfigured charset is used with Syllabifier * fix for gu[aieou] syllabification * fix for linguā syllabification * correcting implementation of paras in Json corpus reader correcting an error with hexameter scanner; adding additional doctest. Making some matrix corpus functions more efficient by pulling regex compilation outside of looped functions * correcting for test * fixes for #979 #978 * shaving extra lines, adding docstring for init Co-authored-by: Kyle P. Johnson <kyle@kyle-p-johnson.com>
- Loading branch information
1 parent
e7ba658
commit 77ffba7
Showing
7 changed files
with
116 additions
and
93 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.