Skip to content

Latest commit

 

History

History
237 lines (168 loc) · 6.92 KB

README.rst

File metadata and controls

237 lines (168 loc) · 6.92 KB

Jubatus

The Jubatus library is an online machine learning framework which runs in distributed environment.

See http://jubat.us for details.

QuickStart

Jubatus officially supports Ubuntu >= 12.04 , Red Hat Enterprise Linux >= 6.2 . On other platforms, some may work if they have gcc >= 4.4 .

For minimal standalone mode, Jubatus requires pficommon >= 1.3.1 (commit 8fde51454af897cc971bab9033e217ff83b12f78 or later), google-glog , libmsgpack, jubatus-mpio, jubatus-msgpack-rpc, python >= 2.6 . Please install these software with headers before installation.

$ git clone git://github.com/jubatus/jubatus.git
$ cd jubatus
$ ./waf configure --disable-re2
$ ./waf build
$ sudo ./waf install
$ jubaclassifier -f /usr/local/share/jubatus/example/config/classifier/pa.json

Here Jubatus classifier server has started. Enjoy!

License

LGPL 2.1

Update history

Release 0.4.0 2013/1/11

New machine learning service
  • Distributed Anomaly Detection (jubaanomaly)
    • Calculate anomaly measure value
Improvements
  • Change RPC implementation to msgpack-rpc based (#27)
    • [ATTENTION] Requires jubatus-mpio and jubatus-msgpack-rpc to install
    • Add mpidl's output converter 'mpidlconv' (#210)
  • Remove set_config API
    • Standalone mode: specify config file using command option '-f, --configpath'
    • Multinode mode: store config data to Zookeeper using jubaconfig (#164)
  • More flexible configration (#38)
    • Support hyper-parameters (#197)
  • Commonize name of variables and functions (#203)
  • Add or Change command options
    • Add bind IP address option '-b, --listen_addr' (#152)
    • Add bind network interfance option '-B, --listen_if' (#152, #214)
    • Check if logging directory specified by '-l, --logdir' is writeable (#206)
    • Add verbosity of log messages option '-e, --loglevel' (#207)
    • Add configuration file option '-f, --configpath'
    • Change '-d, --tmpdir' to '-d, --datadir' (#198)
  • Improve logging
    • Not separate log file for each log level (#207)
    • Add start save/load logs (#195)
    • Add many debug logs
Bugfix
  • Client symbols may conflict because of using same namespace (#84)
  • Fix shortest_path when target is found before reaching landmark (#177)
  • Commonize RPC return types and values (#193)
  • Segfaults when plugin throws JUBATUS_EXCEPTION (#211)
  • Fix classifier AROW hyper-parameter conversion (#225)
  • #86, #184, #204, #213, #219
Refactoring
  • #186, #187, #191, #192, #199

Release 0.3.4 2012/11/26

Improvements
  • Add --logdir option
  • Add --enable-debug option to configure (#130)
  • Catch bad cast of fv_converter (#170)
Bugfix
  • Plugin basename specification does not work correctly (#57)
  • MIX put_diff/get_diff and update/analyze RPC can be executed on the same time (#171)
  • #121, #131, #136, #150, #155, #156, #157, #160, #163, #165, #167, #172, #179, #180

Release 0.3.3 2012/10/29

Improvements
  • deb package (#14)
  • Jubatus loads plugin from default directory (#57)
  • Add hash_max_size option to learn in fixed-size memory (#67)
  • OS X Homebrew packaging (#116)
  • GCC compilation version <= 4.2 when zookeeper enabled (#60)
  • Experimental support for Clang compilation (#100)
  • Make the timeout smaller in unittest
  • libmecab_splitter works well in multi-thread environment, and now only support mecab ver. 0.99 or later
  • word_splitter::split method is now constant
  • global_id_generator(standalone-mode) for graph, that supports 32 bit environment (#135)
  • Use (document_frequency + 1) to calculate IDF weight to avoid inifinity
Bugfix
  • #94, #104, #106, #108, #110, #111, #113, #114, #117, #118, #124, #126, #129, #133, #138, #139, #146, #148

Release 0.3.2 2012/9/21

Improvements
  • jubavisor/jubactl support jubatus server options (#75)
  • jubadoc: document generator (.idl => .rst) (#88)
  • jenerator: enabled -o option to specify output directory
  • yum (RPM) packaging (#40)
Bugfix
  • #51, #76, #77, #79, #83, #90, #91, #96, #98, #99, #100, #101, #102, #103

Release 0.3.1 2012/7/20

Improvements
  • RPC enhances to many exceptions and provide new error handling interface (#49)
  • JSON interface for set_config APIs (#44)
  • jubavisor close zk connection correctly (#74)
Bugfix
  • #73, #69, #66, #65

Release 0.3.0 2012/6/29

Improvements
  • Distributed Graph Processing
    • Centrality computation
    • Shortest path search
  • New neighbor-search algorithm for recommender: minhash
  • installs pkgconfig file (#42, jubatus.pc)
  • jubavisor handling signal (#53)
  • Exception handling and error outputs (#55)
Bugfix
  • internal API get_storage implemented again (#21)
  • Forgot ignoring signal in keeper #47
  • #54, #45, #15

Release 0.2.3 2012/6/8

Improvements
  • Asynchronous call to multiple servers at once, both keeper and mix - common/mprpc
  • Refactor generator
  • Error message improve
  • Timeout in unittest expanded
Bugfix
  • jubactl doesn't work ver 0.2 and later #13
  • jubavisor doesn't work on daemon mode #5
  • Asynchronous mprpc client critical bug
  • #47, #50, #34, #36, #37, #31, #19, and other small bugfix

Release 0.2.2 2012/4/6

Improvements
  • Simpler interfaces at classifier, regression and recommender
    • Clients are NOT COMPATIBLE with previous releases
  • Now mix works concurrently in multiple threads (except tf-idf counting)
  • Asynchronous RPC to multiple servers at once
  • Add --version option
  • Interface description language changed from C++-like to Annotated MessagePack-IDL
  • Minor error handling
  • A bit more tested than previous releases
Bugfix
  • #30, #29, #22

Release 0.2.1 2012/3/13

Bugfix release: #28

Release 0.2.0 2012/2/16

New Features
  • recommender
    • support fast similar item search, real-time update, distributed data management
    • inverted index : exact result, fast search
    • locality sensitive hash : approximate result, fast search, small working space
  • regression
    • online SVR using passive agressive algorithm
    • as fast as current classifier
  • stat
    • a Key(string)-Value(queue<double>)
    • O(1) cost of getting sum, standard deviation, max, min, statistic moments for each queue
  • server framework
    • less-tightly coupled distributed processing framework with each ML implementation
    • idl & code generator - make it easy to write own jubatus system
    • removed public release of client libraries (so easy to generate!)
    • multiple mix - mutiple data objects can be mixed in one jubatus system
Bugfix
  • duplicate key entry in fv_converter breaks the parameter

Release 0.1.1 2011/11/15

Bugfix release

Release 0.1.0 2011/10/26

Hello Jubatus!

First release: including classifier, and mix operation