Classification (class label prediction) over two non-stationary data streams, one with labeled data (source) and the other with unlabeled data (target). Covariate shift is assumed between the source and target streams.
The problem is to predict the class label data on target stream using labeled data from the source stream, both of which can have concept drift asynchronously. More details in the publication at CIKM 2016
- Java code for change point detection is based from this paper.
- We use the instance weighted libSVM code from here.
- config.properties file specifies data path and other configurable items.
- Python v2.7
$ python multistream.py <dataset_name>