Sequential API for Model Construction #925

rbharath · 2017-11-06T01:44:58Z

This PR adds a new API for constructing sequential models (those that are a linear stack of models) based on the Keras Sequential API. For simple networks, this new API allows users to skip explicitly specifying Feature, Label, loss inputs, outputs, and in_layers. For example, here's a simple classifier in Sequential:

X = np.random.rand(20, 2)                     
y = [[0, 1] for x in range(20)]
dataset = dc.data.NumpyDataset(X, y)                              
model = dc.models.Sequential(learning_rate=0.01)                  
model.add(Dense(out_channels=2))                                  
model.add(SoftMax())
model.fit(dataset, loss="binary_crossentropy", nb_epoch=1000)     
prediction = np.squeeze(model.predict_on_batch(X))

For comparison, here's the same model in the TensorGraph API

X = np.random.rand(20, 2)                     
y = [[0, 1] for x in range(20)]                        
dataset = NumpyDataset(X, y)                                      
features = Feature(shape=(None, 20))                      
dense = Dense(out_channels=2, in_layers=[features])               
output = SoftMax(in_layers=[dense])
label = Label(shape=(None, 2))                                    
smce = SoftMaxCrossEntropy(in_layers=[label, dense])              
loss = ReduceMean(in_layers=[smce])                               
tg = dc.models.TensorGraph(learning_rate=0.01)                    
tg.add_output(output)
tg.set_loss(loss) 
tg.fit(dataset, nb_epoch=1000)                                    
prediction = np.squeeze(tg.predict_on_batch(X))

Underneath the hood, Sequential inherits from TensorGraph and simply constructs the explicit TensorGraph as needed. For now, losses are explicitly specified as strings passed into Sequential.fit(). The initial implementation will support the same string arguments as the Keras Sequential API. The PR also removes an old (non-functional) version of Sequential that was not based on TensorGraph.

This PR isn't quite ready to merge (will need to add in losses beyond binary_crossentropy and mse), but I wanted to put it out for feedback on the API design.

…quential

coveralls · 2017-11-06T03:59:20Z

Coverage increased (+0.08%) to 80.499% when pulling 5246f2c on rbharath:sequential into 401d669 on deepchem:master.

coveralls · 2017-11-08T04:13:07Z

Coverage increased (+0.08%) to 80.499% when pulling 7453a90 on rbharath:sequential into 401d669 on deepchem:master.

lilleswing · 2017-11-08T15:12:01Z

deepchem/models/tensorgraph/sequential.py

+        if not len(layer.in_layers) == 0:
+          raise ValueError("Cannot specify in_layers for Sequential.")
+        layer.in_layers += [prev_layer]
+        self._add_layer(layer)


Do you need this? When you set loss this should go through the tree and call already?

Good point, will remove.

lilleswing · 2017-11-08T15:12:16Z

deepchem/models/tensorgraph/sequential.py

+
+      if loss == "binary_crossentropy":
+        smce = SoftMaxCrossEntropy(in_layers=[labels, prev_layer])
+        self._add_layer(smce)


Ditto for this _add_layer

Will remove.

lilleswing · 2017-11-08T15:12:36Z

deepchem/models/tensorgraph/sequential.py

+      else:
+        # TODO(rbharath): Add in support for additional losses.
+        raise ValueError("Unsupported loss.")
+    self._built = True


Why is this needed? super().fit() should call build and will also install the queue for faster training/

Good point. Will remove.

peastman · 2017-11-08T17:22:09Z

This class is kind of strange, in that the model isn't fully defined until you call fit(). But what if it never gets called? What if instead the user calls restore() to reload a previously fit model? It won't work.

rbharath · 2017-12-21T01:58:27Z

@peastman You raise a good point about restore(). For now, I raise a ValueError if restore() is called. In a future PR, we should figure out a more general mechanism. The issue is that Sequential allows users to avoid specifying placeholders and imputes placeholder shapes from the provided dataset in fit(). This means that the full TensorFlow graph can't be built until fit() is called.

A more general solution would probably be to write placeholder shapes in metadata and add placeholders if necessary in restore(). Will do this in a future PR.

coveralls · 2017-12-21T21:05:22Z

Coverage increased (+0.01%) to 80.749% when pulling c6b7c64 on rbharath:sequential into fbf8b79 on deepchem:master.

…quential

coveralls · 2017-12-21T23:39:10Z

Coverage increased (+0.07%) to 80.806% when pulling 046c8e8 on rbharath:sequential into f6e75cf on deepchem:master.

rbharath · 2017-12-21T23:42:37Z

Going to go ahead and merge in this PR. Will address any outstanding issues in a follow-on PR.

LGTM

Bharath Ramsundar added 6 commits November 5, 2017 17:37

Sequential

ab5a09a

Merge branch 'master' of https://github.com/deepchem/deepchem into se…

f636568

…quential

Forgot sequential file

6911677

Fixing some issues

2a62b79

Adding in mse loss.

050dac2

Removing old sequential class

5246f2c

More docs

d054219

rbharath changed the title ~~[WIP] Sequential API for Model Construction~~ Sequential API for Model Construction Nov 8, 2017

Cleanup

7453a90

lilleswing reviewed Nov 8, 2017

View reviewed changes

Bharath Ramsundar added 5 commits December 20, 2017 11:56

local changes

795cb45

merged

a8824ed

Remove old overfit tests

b9b2701

Debugging

a2a693d

Changes

57df93a

Bharath Ramsundar added 4 commits December 20, 2017 18:00

Removing unnecessary files from PR

7536028

Removing sequential tests

1fb1e5e

yapf

240ce32

Removing old sequential code

c6b7c64

rbharath mentioned this pull request Dec 21, 2017

libicui18n.so.56 error #980

Closed

Merge branch 'master' of https://github.com/deepchem/deepchem into se…

046c8e8

…quential

rbharath merged commit ae75476 into deepchem:master Dec 21, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sequential API for Model Construction #925

Sequential API for Model Construction #925

rbharath commented Nov 6, 2017 •

edited

coveralls commented Nov 6, 2017 •

edited

coveralls commented Nov 8, 2017 •

edited

lilleswing Nov 8, 2017

rbharath Dec 20, 2017

lilleswing Nov 8, 2017

rbharath Dec 20, 2017

lilleswing Nov 8, 2017

rbharath Dec 20, 2017

peastman commented Nov 8, 2017

rbharath commented Dec 21, 2017 •

edited

coveralls commented Dec 21, 2017

coveralls commented Dec 21, 2017 •

edited

rbharath commented Dec 21, 2017

Sequential API for Model Construction #925

Sequential API for Model Construction #925

Conversation

rbharath commented Nov 6, 2017 • edited

coveralls commented Nov 6, 2017 • edited

coveralls commented Nov 8, 2017 • edited

lilleswing Nov 8, 2017

Choose a reason for hiding this comment

rbharath Dec 20, 2017

Choose a reason for hiding this comment

lilleswing Nov 8, 2017

Choose a reason for hiding this comment

rbharath Dec 20, 2017

Choose a reason for hiding this comment

lilleswing Nov 8, 2017

Choose a reason for hiding this comment

rbharath Dec 20, 2017

Choose a reason for hiding this comment

peastman commented Nov 8, 2017

rbharath commented Dec 21, 2017 • edited

coveralls commented Dec 21, 2017

coveralls commented Dec 21, 2017 • edited

rbharath commented Dec 21, 2017

rbharath commented Nov 6, 2017 •

edited

coveralls commented Nov 6, 2017 •

edited

coveralls commented Nov 8, 2017 •

edited

rbharath commented Dec 21, 2017 •

edited

coveralls commented Dec 21, 2017 •

edited