Cloud Mining automatically builds exploratory faceted search systems.
JavaScript Python CSS
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.
cloudmining Updated for changes in SimSearch Jun 25, 2013
examples Building the DBLP instance Oct 15, 2013
scraping Building the DBLP instance Oct 15, 2013
tests Minor commit changes Nov 4, 2012
tools first commit Nov 2, 2012
INSTALL.md Minor commit changes Nov 4, 2012
LICENSE first commit Nov 2, 2012
README.md documentation Nov 4, 2012

README.md

Cloud Mining automatically builds exploratory faceted search systems. It leverages Sphinx as a full text retrieval engine and fSphinx for faceted search. SimSearch is used for item based search. The aim is to provide an interface which will encourage nonlinear search and data exploration. The facets support different visualizations such as tag clouds, histogram counts or a rose diagram and can be extended with pluggins.

Create a file called application.py with the following lines:

from cloudmining import CloudMiningApp

# create a new CloudMining web application
app = CloudMiningApp()

# create a FSphinxClient from a configuration file
cl = FSphinxClient.FromConfig('/path/to/config/sphinx_client.py')

# set the fsphinx client of the app
app.set_fsphinx_client(cl)

Execute application.py and aim your browser at http://localhost:8080:

python application.py

On data from IMDb, you obtain the following interface:

Cloud Mining Generic Interface

And after customization, you get:

Cloud Mining Customized Interface

Check out some instances, here and there. Have a look at the api for customization and look into some of the example instances provided.

Thank you to Andy Gott for the logo design, FAMFAMFAM and Fugue for the icons. Rose diagram thanks to RGraph.