Permalink
Switch branches/tags
Nothing to show
Commits on Nov 22, 2015
  1. Merge pull request #12 from ghostfreeman/master

    #11 Fixing version of Mongo in Gemspec to last known working version
    committed Nov 22, 2015
Commits on Nov 21, 2015
Commits on Jul 10, 2014
  1. fixed issue caused by using negative infinity as log likelihood of wo…

    …rd in class that has never seen it
    committed Jul 10, 2014
  2. clarified smoothing in readme

    committed Jul 10, 2014
Commits on Jul 8, 2014
Commits on Jun 7, 2014
  1. Merge pull request #7 from rurounijones/performance-increases

    Replace check method to see if a string is numeric
    committed Jun 7, 2014
Commits on Jun 3, 2014
  1. Sleep before running tests

    To give time for travis-CI to get the storage services up and running
    sleep for 15 seconds before starting our tests
    
    As documented at:
    http://docs.travis-ci.com/user/database-setup/#MongoDB-may-not-be-immediately-accepting-connections
    rurounijones committed Jun 3, 2014
  2. Replace check method to see if a string is numeric

    The old code used the very correct Float(word) method to see if a string
    was numeric. This works reliable with all sorts of edge-case data but it
    is very slow.
    
    Since we have already parsed out a lot of possibilities during word
    atomisation (e.g. decimal numbers like 123.45 have already been split
    into "123" and "45") we do not need this level of "dealing with edge"
    case surety.
    
    Therefore we can just do a simple regex check to see if the string is
    all numerals or not.
    
    In tests on 1000 emails (Single threaded) the run-time was reduced
    from 2.4 seconds to 1.4 seconds.
    
    Since we have traded edge-case reliability for speed we can no longer
    leave this as a String class monkey-patch so move it into a method that
    will only be called by Ankusa itself
    rurounijones committed Jun 3, 2014
Commits on May 30, 2014
  1. Merge pull request #6 from rurounijones/performance-increases

    Improve performance of Stopword lookup
    committed May 30, 2014
  2. Improve performance of Stopword lookup

    Ankusa::STOPWORDS is created once and then searched for every single
    word that we are classifying.
    
    Change it from an Array with O(n) average time complexity to a Set
    (Hash-Table) with O(1) average time complexity.
    
    In tests on 1000 emails (Single threaded) the run-time was reduced
    from 6.4 seconds to 2.4 seconds.
    rurounijones committed May 30, 2014
Commits on May 22, 2014
  1. Merge pull request #4 from rurounijones/travis-ci-support

    Improved Travis ci support
    committed May 22, 2014
Commits on May 21, 2014
  1. Add the "mongo" development dependency

    We do not want to include storage specific dependencies in production
    however for a developer hacking on Ankusa it is a different matter.
    Add the "mongo" development gem dependency to the gemspec to avoid
    people (and Travis-CI) having to do it themselves
    rurounijones committed May 21, 2014
  2. Combine memory and filesystem tests under Travis-CI

    * Update Rakefile with new Travis-CI task
    * Fix FileSystem tests (had same name as memory ones)
    * Update travis config file to call new task
    rurounijones committed May 21, 2014
  3. Libraries should not include Gemfile.lock

    Libraries should be more liberal with their dependencies than
    applications to allow bundler to select dependencies with the best
    chance of avoiding conflicts
    rurounijones committed May 21, 2014
Commits on Jun 19, 2013
  1. updated rvmrc and ruby version to 1.9.3, replaced iconv with String.e…

    …ncode (only available in 1.9.3), bumped version to 0.1
    committed Jun 19, 2013
Commits on Nov 30, 2012
Commits on Nov 29, 2012
  1. fix for classifying a doc when no training has been done. fixed tests…

    … for hbase, memory, and file system storage, but could'nt get cassandra tests running under ruby 1.9.3
    bborn committed Nov 29, 2012
Commits on Aug 15, 2012
Commits on May 23, 2012
Commits on Apr 16, 2012
  1. Merge pull request #6 from Dreepi/mongodb

    MongoDb Support added
    committed Apr 16, 2012
Commits on Apr 10, 2012
  1. bumped version

    committed Apr 10, 2012
  2. fixed bug in text atomizer

    committed Apr 10, 2012
Commits on Apr 1, 2012
  1. sped up stopword hashing

    committed Apr 1, 2012
Commits on Oct 16, 2011
  1. added mongo specs to readme

    kitop committed Oct 16, 2011
Commits on Jun 20, 2011
  1. added indexes to tables

    kitop committed Jun 20, 2011
Commits on Jun 17, 2011