Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Browse files
Browse the repository at this point in the history
#854) * Initial releases with unit tests and doctests * Added sections and preliminary documentation for: Scansion of Poetry About the use of macrons in poetry HexameterScanner Hexameter ScansionConstants Syllabifier Metrical Validator ScansionFormatter StringUtils module Made minor formatting corrections elsewhere to quiet warnings encountered during transpiling the rst file during testing and verification. * corrected documentation & doctest comments that were causing errors. doctests run with an added command line switch: nosetests --no-skip --with-coverage --cover-package=cltk --with-doctest * fixing broken doctest comment * correcting documentation comment that causes doctest to err * Corrections to make the build pass: 1. added install gensim to travis build script; its absence is causing an error in word2vec.py during the build. 2. Modified transcription.py so that the macronizer is initialized on instantiation of the Transcriber class and not at the module level; the macronizer file is 32MB and this also seems to cause an error with travis as github does not make large files displayable, and so it may not be available for the build. The macronizer object has been made a component of "self." * moved package import inside of main so that it does not prevent the build from completing; soon, we should move to update the dependencies of word2vec; gensim pulls in boto which isn't python3 compliant, there is a boto3 version which we may be able to slot in, but perhaps a larger question is boto necessary? * correcting documentation * add JsonFile Corpus Reader for Perseus Greek and Latin cltk json corpora add better corpus reader documentation correct annotations and package naming unit tests for JsonFile Corpus Readers * improved documentation and a fix for tests * remove unnecessary coerce to int for sorting sections and subsections * switch print statement to log statement * corrected JsonFileCorpusReader to work with arbitrary levels of nested dictionaries * add perseus corpus types file for assemble_corpus functionality revise assemble_corpus method to just return a CorpusReader instead of a tuple of CorpusReader and input params correct latin library corpus types Revised test_corpus.py file to use setUp; removed the download_test_corpora file, changed the travis script
- Loading branch information
1 parent
79123d9
commit e74a70d
Showing
14 changed files
with
829 additions
and
270 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.