amaunz / last

LAST: LAtent STructure miner

This URL has Read+Write access

amaunz (author)
Tue Nov 03 07:44:17 -0800 2009
commit  4b13d8c00ac922ca2804f20b56a8d29a124fb0a8
tree    897ecbd3f1b885e88a1f32a04f05f22f8da93494
parent  622ffadff8858b90d5b7d908db57c28dc81eb7fb
last /
name age message
file Doxyfile Wed Jul 22 05:09:24 -0700 2009 Initial version of last [amaunz]
file INSTALL Loading commit data...
file LICENSE Wed Jul 22 05:09:24 -0700 2009 Initial version of last [amaunz]
file Mainpage.h Wed Jul 22 05:09:24 -0700 2009 Initial version of last [amaunz]
file Makefile Mon Nov 30 04:00:38 -0800 2009 Bugfix discrete counting adds also weights ins... [amaunz]
file README
file closeleg.cpp Wed Jul 22 05:09:24 -0700 2009 Initial version of last [amaunz]
file closeleg.h Wed Jul 22 05:09:24 -0700 2009 Initial version of last [amaunz]
file constraints.cpp Tue Nov 03 01:21:34 -0800 2009 Fixed bug: activating was set also by upper bou... [amaunz]
file constraints.h Tue Nov 03 01:21:34 -0800 2009 Fixed bug: activating was set also by upper bou... [amaunz]
file database.cpp
file database.h Wed Jul 22 05:09:24 -0700 2009 Initial version of last [amaunz]
file fminer.cpp
file fminer.h
file fminer_wrap.i Wed Jul 22 05:09:24 -0700 2009 Initial version of last [amaunz]
file globals.h
file graphstate.cpp
file graphstate.h
file legoccurrence.cpp Wed Jul 22 05:09:24 -0700 2009 Initial version of last [amaunz]
file legoccurrence.h Wed Jul 22 05:09:24 -0700 2009 Initial version of last [amaunz]
file libfminer.css Wed Jul 22 05:09:24 -0700 2009 Initial version of last [amaunz]
file misc.h
file path-bbc.cpp
file path.cpp
file path.cpp.test
file path.h
file patterntree.bbc.cpp
file patterntree.cpp
file patterntree.h
README
Welcome to LibFminer.

This is the Fminer library, available from http://github.com/amaunz/libfminer/tree/master.
The Fminer application that uses this library is available from http://github.com/amaunz/fminer/tree/master.
The official website with documentation is http://www.maunz.de/libfminer-doc .

For installation and documentation see INSTALL.
For license information see LICENSE.

Abstract:
We present a new approach to large-scale graph mining based on so-called backbone refinement classes.
The method efficiently mines tree-shaped subgraph descriptors under minimum frequency and significance constraints, 
using classes of fragments to reduce feature set size and running times.
The classes are defined in terms of fragments sharing a common backbone.
The method is able to optimize structural inter-feature entropy as opposed to occurrences, which is characteristic for 
open or closed fragment mining.
In the experiments, the proposed method reduces feature set sizes by >90 % and >30 % compared to  complete tree mining 
and open tree mining, respectively.
Evaluation using crossvalidation runs shows that their classification accuracy is similar to the complete set of trees 
but significantly better than that of open trees. 
Compared to open or closed fragment mining, a large part of the search space can be pruned due to an improved 
statistical constraint (dynamic upper bound adjustment), which is also confirmed in the experiments in lower running 
times compared to ordinary (static) upper bound pruning. 
Further analysis using large-scale datasets yields insight into important properties of the proposed descriptors, such 
as the dataset coverage and the class size represented by each descriptor. 
A final cross-validation run confirms that the novel descriptors render large training sets feasible which previously 
might have been intractable.

Andreas Maunz, 2008