Skip to content
Go to file

Latest commit


Git stats


Failed to load latest commit information.
Latest commit message
Commit time


To run SAX-LDA go to view folder and run The main data stream folder can include multiple csv files, each representing one data stream. To re-create the experiments, use folder The context data stream currently only supports one data stream represented by a csv file (see After setting both data streams and optionally adjusting the parameter, the experiment can be started by pressing the "Start" button.

alt text

The overall workflow of SAX-LDA consists of 4 steps as can be seen in the diagram.

alt text

Data Discretisation and Pattern Recognition

Piecewise Aggregate Approximation (PAA) is used to reduce the dimensionality of the data.

alt text

Symbolic Aggregate approXimation (SAX) uses the output to discretise the con- tinuous, numerical data streams and to recognise (lower-level) patterns.

alt text


We have implemented a rule-based engine which uses the SAX patterns and the statistical information of the time frame to produce human understandable labels. The general form of the rules is given in Equation 3 where < p > is the SAX pattern, < l > is the level, < f > is the feature, < mod > a modifier for the pattern movement and < pm > is the pattern movement.

alt text

, and can take the following values:

alt text

# Virtual Document Generation The labels are grouped together from different sources within a certain time-frame toform virtual documents which can be used as input for the LDA model.

#Latent Dirichlet Allocation (LDA) We train and incrementally update an LDA model on the virtual documents that are generated from the data streams to identify and extract relations between the labels. Below is a plate model representation of LDA.

alt text





No releases published


No packages published


You can’t perform that action at this time.