All notable changes to this project will be documented in this file.
- Feature list for ptwiki
- Bumped revscoring to v2.5.1
- Old code examples (
examples/test_model.py
andexamples/train_model.py
)
- Bumped revscoring to v2.4.x
- Added
content_type
param to setup.py - Minor formatting edits in README
- Added features for English Wikipedia's short-format notes.
- Release Criteria document
- svwiki feature lists
- Added ability to do a fast filtering pass before parsing wikitext.
- Added svwiki extractor.
- Added Wikibase item features.
- Added
util
utility helpers. - Added
fetch_labels
utilities. - Added trwiki extractor.
- Added
words_to_watch
count to enwiki feature lists. - Added new features to wikidatawiki - (@glorianY)
- Added basic extraction pattern for item quality model.
- Added Persian Wikipedia features.
- Added glwiki feature lists.
- Adds
item_completes
to wikidatawiki.
- Rename wikiclass to articlequality.
- Bumped revscoring to v2.3.4
- Updated
fetch_text
for newrvslots
API param. - Remove target files when commands error out.
- Replaced filenames with automatic Make variables.
- Update classification examples to revscoring 2.x
- Started using TravisCI for automated builds.
- Use PyTest for testing now.
- Rename pagelevel prediction classes in frwikisource.
- Rename
wp10
->articlequality
. - Change wikidatawiki models to use GradientBoosting.
- Fixed bug in
fetch_item_info
. - Update about.py in wikiclass folder to the right github link.
- Resolved mwxml/mwtypes version conflict.
- Fixed "who" templates in enwiki features.
- Fixed trwiki extractor so that it works for 'baslagıç'.
- Added feature lists for ruwiki.
- Added
extract_scores
utility.
- Implemented modular
about.py
pattern for pkg info. - Bumped revscoring to v1.3.0
- Add HTML comment filtering to Russian extractor
- Added testcase to ruwiki extractor.
- Switched RF for GradientBoosting models in Makefile.
- Cleaned up
extract_from_text
utility.
- Wrong variable name in frwiki extractor.
- Fixed division with modifiers in
wikipedia.article
.
- Added Russian assessment extractor. - @nettrom
- Flexibility for revscoring version requirement.
- Typo in French extractor. - @nettrom
- Added basic counts for cn templates and dict_words/word to frwiki feature list.
- Added tuning reports to Makefile.
- Bumped revscoring requirement to v1.1.0.
- Updated feature extractor for revscoring 1.x
- Updates enwiki and frwiki
feature_lists
for revscoring 1.x
- Using
mwreverts
,mwxml
,mwapi
libraries instead ofmw
lib.
- Bumped revscoring requirement to 0.7.10 and fixed issues this causes.
- Updated requirement for mwtypes >= 0.2.0
- Adds new
templates_that_match
meta feature. - Added
not_an_article
filter. - Added
who
,citation_needed
andmain_article
templates to enwiki.
- Bumped revscoring requirement to 0.7.2
- Switched text extraction to be API-based.
- Added verbose option to
extract_features
. - Parallelization for
extract_features
.
- Minor divide-by-zero errors in enwiki and frwiki features.
- Template list error for frwiki. - @gpaumier
- Remove empty sections from CHANGELOG, they occupy too much space and create too much noise in the file. People will have to assume that the missing sections were intentionally left out because they contained no notable changes.
- Cleanup to feature sets for enwiki and frwiki.
- Spaces to tabs in Makefile
- Pass
page_labeling
toextract_text
as arg.
- Fixed issue with generator requirements in setup.
- README format changed from
.rst
to.md
. - Update functions documentation.
- Minor updates to Makefile and
extract_text
for running on stat3
- Basic API.
- Added tests for all features and datasources.
- Added frwiki extractor
- Added
extract_text
utility.
- Restructured wikiclass to make use of the revscoring package.
- Completed enwiki extractor.
- Added error handling in case mwparserfromhell fails.
- Switches
extract_labelings
to use mwxml library - Remove post '/' stuff from titles during normalization.
- Additional documentation.
- Minor issues in
extract_features.py
script.
- Removed duplicated feature definitions(now part of revscoring).
- Added minimal docs setup.
- Added a LICENSE.
- Moved
add_text
util toscripts/
dir. - Completed basic docs.
- README errors.
- Handle division-by-zero case for articles with no words.
- First release on PyPI.
- Working RFTextModel
- Added
add_text
util. - Basic README.