Add a helper function for sequential models. #20

saeta · 2019-02-22T06:37:38Z

Many deep learning models are composed of sequential layers stacked one on
top of each other. It can be relatively tedious to write out the explicit
applied(to:) function because it's fairly repetitive and the underlying
intent is relatively obscured. (It can be especially bothersome because
it's the 2nd (or 3rd) time you're writing out all the layers. (The first time
is to declare all the instance variables, and the second time (if necessary)
is in the initializer.)

Fortunately, with a single helper functions, we can make everything both type
safe as well as convenient and easily expressible & readable!

This commit adds a family of Sequential functions that take in a context, an
input, and a variable number of layers. It chains through the output of one
layer into the input of the next.

This API approach has a number of advantages:

It avoids introducing new symbolic operators, which can be very confusing
to new users.
It works with today's AutoDiff implementation. (Yay!)
It is very readable and clean.
It avoids users "getting stuck". Concretely, if someone implemented a model
using my previously proposed >>> operator, if they wanted to add a
residual (or skip) connection, they would have to basically re-write their
whole model using a struct, etc. With this API structure, only "local"
changes are required. (e.g. If only one skip-connection is required, they
can split the sequential chain into two pieces.)

Downsides of this approach:

It doesn't DRY-out the types required to define a model. (I have some
thoughts here, but there isn't enough room in this
margin^H^H^H^H^H^Hcommit message.)
We should think hard about how things should look when we have loops.
I'm sure there's a better way to code-gen out all the different Sequential
airities. (I got bored hand-writing them out after 4...) Suggestions
welcome!

Many deep learning models are composed of sequential layers stacked one on top of each other. It can be relatively tedious to write out the explicit `applied(to:)` function because it's fairly repetitive and the underlying intent is relatively obscured. (It can be especially bothersome because it's the 2nd (or 3rd) time you're writing out all the layers. (The first time is to declare all the instance variables, and the second time (if necessary) is in the initializer.) Fortunately, with a single helper functions, we can make everything both type safe as well as convenient and easily expressible & readable! This commit adds a family of `Sequential` functions that take in a context, an input, and a variable number of layers. It chains through the output of one layer into the input of the next. This API approach has a number of advantages: 1. It avoids introducing new symbolic operators, which can be very confusing to new users. 2. It works with today's AutoDiff implementation. (Yay!) 3. It is very readable and clean. 4. It avoids users "getting stuck". Concretely, if someone implemented a model using my previously proposed `>>>` operator, if they wanted to add a residual (or skip) connection, they would have to basically re-write their whole model using a struct, etc. With this API structure, only "local" changes are required. (e.g. If only one skip-connection is required, they can split the sequential chain into two pieces.) Downsides of this approach: 1. It doesn't DRY-out the types required to define a model. (I have some thoughts here, but there isn't enough room in this margin^H^H^H^H^H^Hcommit message.) 2. We should think hard about how things should look when we have loops. 3. I'm sure there's a better way to code-gen out all the different Sequential airities. (I got bored hand-writing them out after 4...) Suggestions welcome!

rxwei

I like this proposed direction!

(Feedback below is copied from internal chat, but I'll continue that in the open.)

There are always tradeoffs between sugar operators and methods with clear names. Swift standard library APIs (or Swift libraries in general) tend to discourage functional sugars. It's certainly not necessarily the guideline to follow for every domain-specific library, but we should start with clear named APIs and carefully consider the consequences of each proposed sugar.

I have the following concerns:

Naming. Swift function names should use camel case. Moreover, function names should be clear about the operation -- "sequential(in:from:...)" is not doing a good job at indicating this function is doing a chain of function applications because it feels like returning a sequence.
- Usually a gerund is better in Swift. However, the argument label from: is still unclear.
```
func sequencing<...>(in context: Context, from input: Input, _ layer1: Layer1, ...) -> LayerN.Output
```
- We can use through: to make it read better, but it's overly verbose. So, tradeoffs.
```
func sequencing<...>(in context: Context, from input: Input, through layer1: Layer1, ...) -> LayerN.Output
```
Free functions should be introduced with extra care. They are generally discouraged because they will 1) mess up code completion and 2) are not quite idiomatic unless they have lots of common precedents, e.g. sin(_:), matmul(_:_:). In this case, I do not think it should be a top-level function. Instead, I'd very much prefer to define this as a protocol extension method in Differentiable:
```
extension Differentiable {
    func sequenced<...>(in context: Context, through layer1: Layer1, _ layer2: Layer2, ...) -> LayerN.Output
}
```
This approach would make the API easy to discover without messing up top-level code completion, and conforms to Swift API design guidelines. And it makes the input not juxtaposed with layers, emphasizing that the input the source of what's being sequenced. Of course, it would look much much better without the context argument, but we don't support implicit arguments yet.
```
func applied(to input: Input) -> Output {
    return input.sequenced(in: context, through: layer1, layer2, layer3, ...)
}
```

rxwei · 2019-02-22T07:13:09Z

Sources/DeepLearning/Layer.swift

 }

+@differentiable(wrt: (input, l1, l2))
+public func Sequential<L1: Layer, L2: Layer>(in context: Context, from input: L1.Input, _ l1: L1, _ l2: L2) -> L2.Output where L1.Output == L2.Input {


Function names should always use camel case.

…ntax and avoiding polluting the global function namespace. Also switch to camelCase.

rxwei

Love it!

rxwei · 2019-02-22T07:53:08Z

Sources/DeepLearning/Layer.swift

    }
 }

+extension Differentiable {


Use public extension so you can drop publics from individual methods.

Thanks; fixed!

rxwei · 2019-02-22T07:54:54Z

Sources/DeepLearning/Layer.swift

+    @differentiable(wrt: (self, l1, l2))
+    public func sequenced<L1: Layer, L2: Layer>(
+        in context: Context, through l1: L1, _ l2: L2)
+        -> L2.Output


I was about to leave comments about indentation, but I thought "it's gonna be gyb'd anyway" :)

Yeah, I don't have my editor set up to do the indentation automatically for me. :-( Sorry!

…l function.

Tests/DeepLearningTests/SequentialTests.swift

Sources/DeepLearning/Layer.swift

saeta requested review from dan-zheng, jekbradbury, lattner and rxwei February 22, 2019 06:37

rxwei reviewed Feb 22, 2019

View reviewed changes

saeta added 2 commits February 22, 2019 07:49

Switch to using a protocol extension on Differentiable for a nicer sy…

ab7c272

…ntax and avoiding polluting the global function namespace. Also switch to camelCase.

Fix up test case that was forgotten!

120ddfe

rxwei approved these changes Feb 22, 2019

View reviewed changes

Mark extension as public and remove the annotations on each individua…

1372785

…l function.

rxwei approved these changes Feb 22, 2019

View reviewed changes

Formatting fixes.

afd3901

saeta merged commit 9b3b609 into master Feb 24, 2019

saeta deleted the sequential-take-two branch February 24, 2019 02:06

marcrasi mentioned this pull request Feb 27, 2019

workaround new autocomplete crasher swiftlang/swift#22942

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add a helper function for sequential models. #20

Add a helper function for sequential models. #20

Uh oh!

saeta commented Feb 22, 2019

Uh oh!

rxwei left a comment •

edited

Loading

Uh oh!

rxwei Feb 22, 2019

Uh oh!

rxwei left a comment

Uh oh!

rxwei Feb 22, 2019

Uh oh!

saeta Feb 22, 2019

Uh oh!

rxwei Feb 22, 2019

Uh oh!

saeta Feb 22, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add a helper function for sequential models. #20

Add a helper function for sequential models. #20

Uh oh!

Conversation

saeta commented Feb 22, 2019

Uh oh!

rxwei left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

rxwei Feb 22, 2019

Choose a reason for hiding this comment

Uh oh!

rxwei left a comment

Choose a reason for hiding this comment

Uh oh!

rxwei Feb 22, 2019

Choose a reason for hiding this comment

Uh oh!

saeta Feb 22, 2019

Choose a reason for hiding this comment

Uh oh!

rxwei Feb 22, 2019

Choose a reason for hiding this comment

Uh oh!

saeta Feb 22, 2019

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rxwei left a comment •

edited

Loading