Skip to content

Version 0.2

Compare
Choose a tag to compare
@bbengfort bbengfort released this 01 Mar 22:21
· 78 commits to master since this release

This update was a push to get Baleen running on EC2 on an hourly basis in preparation for PyCon. We updated all of Baleen's dependencies to their latest versions, added tests and other important fixtures, and organized the code a bit better. New functionality includes the ability to fetch the post webpage from the link, export the corpus to disk using the command line utility, and run in the background using the schedule library.

Deployed: Tuesday, March 1, 2016
Contributors: Benjamin Bengfort

Changes

  • Refactoring of the code to a more organized structure
  • Added some tests for safety on a number of modules
  • Updated all the dependencies from 2014
  • Added an export command to the CLI
  • Uses requests.py to fetch the full webpage from the link
  • Slightly better logging configuration
  • Use schedule to run every hour
  • Created Upstart configuration for background on Ubuntu