SSLH (Semi-Supervised Learning with Heterophily)
SSLH on github:
This library implements efficient algorithms in linear algebra for solving various inference and estimation problems in networks with observed heteorphily between classes of nodes (Heterophily: "Opposites attract" vs. Homophily: "Birds of a feather flock together"). The technical framework is that of undirected graphical models (Markov Random Fields or Markov Networks). The key idea is that after applying certain linearization assumptions (that change the semantics) the resulting formulations allow several orders of magnitude speed-up in calculation.
The methods are described in detail in the following papers:
- Linearized and Single-Pass Belief Propagation. Wolfgang Gatterbauer, Stephan Günnemann, Danai Koutra, Christos Faloutsos. PVLDB 8(5): 581-592 (2015). [Paper (PDF)], [Full version (PDF)]
- Semi-Supervised Learning with Heterophily. Wolfgang Gatterbauer [Working paper (PDF)]
Usage & Documentation
The package consists of:
- A directory
sslhthat contains files with the main methods
- A directory
testthat contains the test files, each of which makes use of methods from the corresonding file in the
Thus ideally take a look in the
test directory, run some files and look through the annotations in the files.
The latest version of SSLH can be installed from the master branch using pip:
pip install sslh
pip install git+https://github.com/wolfandthegang/sslh/
Another option is to clone the repository and install SSLH using
python setup.py install or
python setup.py develop.
SSLH is tested on Python 2.7 and depends on NumPy, SciPy, Sklearn, and PyAMG (see setup.py for version information).
Sklearn: includes methods for semi-supervised learning (assuming homophily): http://scikit-learn.org/stable/modules/label_propagation.html
PyPGMc: focusing on directed graphical models https://github.com/kadeng/pypgmc/
Copyright 2015 Wolfgang Gatterbauer
Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License.
Distributed in the hope that it will be useful to other researchers, however, unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License. You may obtain a copy of the License at http://www.apache.org/licenses/LICENSE-2.0
Questions or comments about
SSLH? Drop me an email at
- Initial Release: Main method 'linBP_undirected' for linearized belief propagation with one single doubly stochastic and symmetric potential as described in "Linearized and Single-pass Belief Propagation"
- Linearized BP for networks with one single aribtrary potentials as described in "Linearization for Pairwise MRFs."