Skip to content

Latest commit

 

History

History
103 lines (75 loc) · 3.36 KB

CHANGELOG.md

File metadata and controls

103 lines (75 loc) · 3.36 KB

Change Log

[Unreleased]

Added

Changed

Fixed

  • Deserialized words are strings instead of unicode

Removed

[0.4.3] 2015-10-07

Added

Changed

Fixed

  • Correctly serialize hyperparameters
  • Serialize and deserialize vocabulary
  • Prediction doesn't require random number generator

Removed

[0.4.2] - 2015-10-01

Added

  • Add active_dishes method to state

Fixed

  • Bug where too many dishes are created when deserializing
  • Bug fixed where dishes_ is constructed incorrectly when dish ids aren't contiguous

[0.4.1] - 2015-09-30

Added

Changed

Fixed

  • Issue where deserialization sometimes failed due to deleted tables not being pruned from all vectors.

Removed

[0.3.1] - 2015-09-28

Added

  • Docstrings for most relevant/public methods
  • Add serialization of state objects. They are also picklable.
  • Make model_definition picklable.

Changed

  • Random number generator is no longer required for state object constructor

Fixed

  • Bug where perplexity calculation is assumed to be strictly monotone
  • Bugs in explicit initialization. Now fully functional.
  • Bug in sort order for term_relevance_by_topic

[0.3.0] - 2015-09-16

Added

  • Add utility function for getting pyLDAvis data
  • Added utility functions for translating data formats
  • Wrote tests based on those in ariddell's LDA
  • Added term_relevance_by_topic to get terms and relevance values as described by Sievert and Shirley

Changed

  • New README
  • Rename word_distribution method to word_distribution_by_topic
  • Rename document_distribution method to topic_distribution_by_document.

Fixed

  • Multidish initialization actually works (from Python)
  • Word distributions are normalized
  • Word distributions map from word (hashable objects) to probabilities instead of index to probability.

[0.2.2] - 2015-07-23

Fixed

  • Fixed bug where initialize incorrectly converted words (hashable objects) to integers (causing bad sampling issues).

[0.2.1] - 2015-07-22

Added

  • Documents can now be any list-of-list of hashable objects
  • Expose nwords method of state object to Python layer
  • Can initialize state with more than one dish (topic)

Removed

  • Removed biology abstract test script and data

Changed

  • State object rolls up m_k instead of tracking m
  • Reuse create_table and create_dish in state constructor
  • Rename state.t_ji to state.table_doc_word

[0.2.0] - 2015-07-20

Added

  • Initial (alpha) implementation HDP-LDA Posterior sampling in the Chinese restaurant franchise (Section 5.1 in Teh, et al) based on derivations done by Nakatani Shuyo.

Changed

  • Currently uses vector of vector of integers to represent documents (instead of variadic dataview).
  • Several changes to initialization API including hyperparameter setting.

[0.1.0] - 2014-11012

Added

  • Initial attempt at HDP-LDA. Sampler does not work, however provides much of the code infrastructure for model.