A utility to handle multiple CDBs
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
.circleci
_example/webapi
cmd/cdbs
data
.gitignore
BSD
LGPL
LICENSE
README.md
out.go
out_test.go
util.go
util_test.go

README.md

cdbs

CircleCI codecov.io Report card GoDoc LGPLv3 BSD

What's this

  • This tool converts the input key and value pairs into several CDB files
  • Input SHOULD be sorted by the key, otherwise you can not look up by using several cdbs with Get(key string)
    • For unsorted keys, BruteGet(key string) can be used. Because this searches all CDB files, the efficiency will be down.
  • Currently works only on 64 bit environments

Usage

Usage of cdbs:
  -i, -input :  Input file name. - or no designation means STDIN.
  -o, -output: Output file name suffix.
  -t, --separator: Separator of keys and values (deault: "\t")
  -z, --compress:   Compress values in gzip format (deault: false)
  --single:     Only output a single CDB file (deault: false)
  --log:        Enable logging (deault: false)

INSTALL

  • To install binary: two options. (select one)
  • To install library
    • go get github.com/shirayu/cdbs

Acknowledgement

I developed this program as a part of the research project "Establishment of Knowledge-Intensive Structural Natural Language Processing and Construction of Knowledge Infrastructure" in Kyoto University supported by CREST, JST.

Licence

  • (c) Yuta Hayashibe 2014
  • Released under any of the following licences
    • Lesser GNU General Public License 3.0 (see the file LGPL)
    • New BSD License (3-clause BSD License) (see the file BSD)