Tensorflow Trees

A library to deal with tree-structured data in TensorFlow. It provides an efficient implementation of an Encoder and a Decoder mappings trees to and from a flat space. Due to the dynamical nature of the computations Tensorflow Eager mode is employed.

TODO tree encoding/decoding gif

Example

In examples\simple_expression is provided a well documented example of an autoencoder for tree structured arithmetic expressions.

In order to run it fullfill all the dependencies in requirements.txt, for instance with conda:
conda create --name tf_tree --file requirements.txt

Then you can run the example with all the default settings as:
PYTHONPATH=. python examples/simple_expression/exp_autoencoder.py
Use --helpfull for all the available flags.

To monitor training progress launch tensorboard --logdir=/tmp/tree_autoencoder/test and visit 'http://localhost:6006'

Concepts

In order to be able to understand and handle tree structured data some concepts need to be introduced.

Trees Definition

First of all we need a way to characterize the trees we want to deal with. So we can instantiate the proper sub-networks and generate valid trees.
See examples/simple_expression/exp_definition for a full example.

A tree is characterized by a TreeDefinition object, basically listing the kind of nodes can appear in the trees:

TreeDefinition(node_types=[NODE_DEF1, NODE_DEF2, NODE_DEF3, ...])

Every node is characterized by a NodeDefinition object:

associating an unique string id for the node type
defining whether such nodes can appear as root nodes
characterizing their arity (the number of children they have)
characterizing the associated value they might have

For instance:

NodeDefinition("node_type_id", may_root=True, arity=NodeDefinition.FixedArity(0), value_type=VALUE_TYPE_DEF)

Every value appearing associated to a node must be characterized by extending the class NodeDefinition.Value implementing:

representation_to_abstract_batch and abstract_to_representation_batch methods to convert values between a human readable representation and neural network readable one.
representation_shape indicating the size of the representation
class_value denoting whether the value is one hot encoding or a dense embedding (it's use to choose which loss to employ, MSE or Cross Entropy, see #9)

Batch

Due to structured nature of the computation we need some support data structure, BatchOfTreesForEncoding and BatchOfTreesForDecoding defined in tensorflow_trees/batch.py. They are used to efficiently and incrementally store trees and intermediate values during the computations, they are the only way of interacting with encoders and decoders.

Encoder and Decoder

Encoder and Decoder basically works as a dynamic composition of a finite number of sub-network kinds, depending on the sample structure. These sub-networks are built accordingly to the tree definition and in a way defined by means of a CellsBuilder as defined in tensorflow_trees/encoder.py and tensorflow_trees/decoder.py. Some sub-network basic kinds implementations are provided in tensorflow_trees/encoder_cells.py and tensorflow_trees/decoder_cells.py.

More

For a more detailed and in depth discussion you can refer to my master thesis about conditional variational autoencoder on tree structured data, this library is a by product. Although in the library refactoring some details have changed (hopefully in a better way), most of it still relevant.

Benchmark

TODO

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
examples/simple_expression		examples/simple_expression
tensorflow_trees		tensorflow_trees
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tensorflow Trees

Example

Concepts

Trees Definition

Batch

Encoder and Decoder

More

Benchmark

About

Releases

Packages

Languages

License

m-colombo/tf_tree

Folders and files

Latest commit

History

Repository files navigation

Tensorflow Trees

Example

Concepts

Trees Definition

Batch

Encoder and Decoder

More

Benchmark

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages