Mitsukeru

Abstract

This is an experiment in full-text indexing. I hope to use it to learn more about what makes a good search engine tick. If at all possible, one day I'd like this to be a usable piece of software.

Right now it builds a very simple term index and allows you to search it using boolean OR queries.

Usage

Build the application like so:

$ cmake . && make

Run it like this:

$ ./bin/mitsukeru ./data/data.json

Enter text to search for!

Todo

This is what I want to do with this, roughly in order:

Refactoring (started)
Persistent indexes (Kyoto Cabinet?)
More flexible indexes (think MongoDB for searching and not written in Java)
Network server mode (REST, probably)

Thanks

I'm reading http://nlp.stanford.edu/IR-book/html/htmledition/irbook.html right now and it's really, really good. I'd like to thank the authors of it for such a great reference and would highly suggest it to anyone wanting to learn about search engines and IR in general!

Also I'm using https://github.com/kazuho/picojson for the JSON parsing part of this, so props to Cybozu for that.

Licensing

All original work is licensed under the standard 3-clause BSD license, a copy of which is available with this code.

picojson uses the 2-clause BSD license.

Contact

Probably easiest to hit me up on one of:

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
bin		bin
data		data
src		src
vendor		vendor
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE.md		LICENSE.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mitsukeru

Abstract

Usage

Todo

Thanks

Licensing

Contact

About

Releases

Packages

Languages

License

deoxxa/mitsukeru

Folders and files

Latest commit

History

Repository files navigation

Mitsukeru

Abstract

Usage

Todo

Thanks

Licensing

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages