Skip to content


Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?

Latest commit


Git stats


Failed to load latest commit information.
Latest commit message
Commit time

Jiminy Modeler

This is the model service for the Jiminy project.


To setup the modeler, we assume that:

  1. A postgreSQL database has been set up, containing a table named ratings which holds the ratings information.
  2. MongoDB is running. This will be used to store the models created by the modeler.

To create the modeler service on OpenShift, first install the oshinko tools with

oc create -f

The modeler can then be launched using the following command:

oc new-app --template oshinko-python-build-dc \
  -p GIT_URI= \
  -e MONGO_URI=mongodb://mongo:mongo@mongodb/models \
  -e DB_HOST=postgresql \
  -e DB_USER=postgres \
  -e DB_PASSWORD=postgres \
  -e DB_DBNAME=postgres \
  -p \

In the command above, we assume that the postgreSQL database has associated username, database name and password postgres. These command line arguments can be amended appropriately to connect to the correct postgreSQL table, where the ratings are held.

The commands given above will launch the modeler. To watch the modeler, run:

oc logs -f s/Modeler/modeler/

Training the model

The command to launch the modeller, which is given above, will use a fast training method, meaning that parameters for the model are preselected. The default values are given by rank = 6, lambda = 0.09 and iteration = 2. You can run the modeler with different parameter values using command line arguments. For example, if you wish to run at rank = 3, lambda = 0.1 and iteration = 4 the following command line arguments should be included in the model launch command:

  -e RANK_VAL = 3
  -e LAMBDA_VAL = 0.1
  -e ITS_VAL = 4

However, if you wish to complete a more robust training of the model, which optimises for model parameters, the following command line argument should be included in the model launch command:

   -e DISABLE_FAST_TRAIN = disable-fast-train

Note that this command will result in the model taking over half an hour to train on the latest MovieLens Dataset.

If fast train is disabled and command line parameter values are given, the command line parameter values will be ignored and the slow train method will be used.


a framework for a spark based model creation service







No releases published


No packages published