[AutoDiff] lift samefile derivative constriant #28790

marcrasi · 2019-12-14T01:31:24Z

Lifts the samefile derivative constraint, and adds many testcases for cross-file and cross-module @derivative attrs.

The tests revealed many broken things. I filed TODOs for some of the broken things, and I fixed others in this PR.

The things that are fixed in this PR are:

If there is no @differentiable attribute, then the differentiability witness must have the linkage of the derivative function. (Otherwise, when you define a @derivative of a function in a separate file or module, the differentiability witness definition gets external linkage because the original function has external linkage, and you're not supposed to give external linkage to definitions.)
Need to check that derivative visibility is less than or equal to original function visibility because the opposite doesn't make sense.
TBDGen needed to be changed back to operate on attributes rather than on AFD->getDerivativeFunctionConfigurations(), because a @derivative of a function in a different module puts a configuration on the function in the different module, and the TBDGenVisitor doesn't visit functions in other modules!
Typechecking needed to be modified to eagerly check all the @derivative attributes in the module (rather than just the @derivative attributes in the primary file), so that the differentiation pass sees configurations arising from @derivatives in other files. This approach is slightly inspired by bindExtensions.

rxwei · 2019-12-14T07:07:01Z

test/AutoDiff/Inputs/derivative_registration/module2/module2.swift

+
+extension FunctionInModule1_InternalDerivatives {
+  // TODO(TF-XXXX): This causes duplicate symbol linker errors.
+  // TODO(TF-XXXX): Why is @usableFromInline necessary?


@usableFromInline isn't necessary if we are registering an internal derivative, but is necessary if we are registering a public derivative.

marcrasi · 2019-12-18T05:16:31Z

Ok, this is ready for review now.

I'm running swift-apis and swift-models tests on this too because it does some pretty dramatic things.

dan-zheng · 2019-12-18T06:04:12Z

lib/TBDGen/TBDGen.cpp

@@ -67,6 +67,8 @@ void TBDGenVisitor::addSymbol(StringRef name, SymbolKind kind) {
  if (StringSymbols && kind == SymbolKind::GlobalSymbol) {
    auto isNewValue = StringSymbols->insert(mangled).second;
    (void)isNewValue;
+    if (!isNewValue)


remove debug print statement

lib/Sema/TypeCheckAttr.cpp

rxwei · 2019-12-18T06:25:56Z

For a public original function, how do we currently differentiate (no pun) between an internal derivative and a public derivative? Is that by the @usableFromInline attribute?

dan-zheng · 2019-12-18T14:22:25Z

lib/Sema/TypeCheckAttr.cpp

@@ -2919,8 +2920,7 @@ static bool checkFunctionSignature(
 static IndexSubset *computeDifferentiationParameters(
    ArrayRef<ParsedAutoDiffParameter> parsedWrtParams,
    AbstractFunctionDecl *function, GenericEnvironment *derivativeGenEnv,
-    StringRef attrName, SourceLoc attrLoc
-) {
+    StringRef attrName, SourceLoc attrLoc, bool diagnoseErrors = true) {


Rather than adding an invasive diagnoseErrors flag, can you try using DiagnosticTransaction for diagnoseErrors = false users instead?

DiagnosticTransaction collects all diagnostics and provides a way to cancel them:

DiagnosticTransaction transaction(Ctx.Diags); SWIFT_DEFER { transaction.abort(); };

See #28717 for an example.

I'll try this.

marcrasi · 2019-12-18T16:30:55Z

For a public original function, how do we currently differentiate (no pun) between an internal derivative and a public derivative? Is that by the @usableFromInline attribute?

Here are some current working behaviors:

// public derivative
@differentiable
public func f1(...) {...}
@derivative(of: f1)
internal func df1(...) {...}

// public derivative
public func f2(...) {...}
@derivative(of: f2)
public func df2(...) {...}

// internal derivative
public func f3(...) {...}
@derivative(of: f3)
internal func df3(...) {...}

dan-zheng · 2019-12-18T17:10:37Z

lib/Sema/TypeCheckAttr.cpp

+/// - Stores the attribute in the `ASTContext` list of derivative attributes.
+/// - Stores the derivative configuration in the original function's list of
+///   derivative configurations.
+static void typeCheckDerivativeAttr(ASTContext &Ctx, AttributeChecker *AC,


I believe AttributeChecker is just a thin wrapper around an ASTContext and we can call ASTContext::diagnose instead. How about removing the AttributeChecker * argument?

If it's possible to use DiagnosticTransaction in callers to abort diagnostics instead of conditionally emitting diagnostics based on a flag, that would be preferable too!

good idea, done

marcrasi

I've addressed all comments and uploaded a new commit.

I still want to add more tests, and I'll do that soon.

marcrasi · 2019-12-19T02:15:34Z

Actually, I think that testing and polishing this PR is going to take a lot more time and I would rather spend my last day before vacation doing some more immediately useful things. Therefore, I'm going to abandon this until I come back on Jan 2, 2020.

I will do one thing though: This PR requires some changes in upstreamed TypeCheckAttr.cpp code, so I'll send a master PR that makes those changes. Hopefully these changes will get downstreamed into tensorflow through the regular merge processes before I resume work on this PR :)

…rodiff-lift-samefile

dan-zheng · 2019-12-19T20:32:58Z

lib/Sema/TypeCheckAttr.cpp

@@ -3790,6 +3812,12 @@ static FuncDecl *findAutoDiffDerivativeFunction(
  return funcDecl;
 }

+void AttributeChecker::visitDerivativeAttr(DerivativeAttr *attr) {
+  if (typeCheckDerivativeAttr(Ctx, D, attr))


Are @derivative attributes in the primary file already type-checked (e.g. parameter indices are resolved) after TypeChecker::typeCheckDerivativeAttrs is called in TypeCheckSourceFileRequest::evaluate?

If so, is AttributeChecker::visitDerivativeAttr doing redundant work (e.g. recomputing parameter indices)?

If so, can AttributeChecker::visitDerivativeAttr be changed to assert that attr is already type-checked, and to avoid redundant work by only checking for duplicate @derivative attributes?

Yes, there is redundant work happening. I can't deduplicate the work by doing exactly what you say because the second typechecking uses some of the intermediate calculations to make diagnostic messages (e.g. original function not found messages). So I need to either redo the computation or find some way of passing the intermediate calculations from the first time it typechecks to the second time it typechecks.

Here's an idea that could work:

static bool typeCheckDerivativeAttr(...) { if (!attr->getOriginalFunction() || !attr->getParameterIndices()) { // Do typechecking // If successful, set original function and parameter indices // If failure, diagnose and return true. } // Insert attr into Ctx.DerivativeAttrs // Diagnose duplicates }

That way, if the first round of typechecking is successful, then we don't duplicate the work on the second round. If the first round fails, we duplicate the work during the second round, but speed is less important in the failure case so this isn't terrible.

I can't deduplicate the work by doing exactly what you say because the second typechecking uses some of the intermediate calculations to make diagnostic messages (e.g. original function not found messages).

Aha, that makes sense.

Here's an idea that could work:

static bool typeCheckDerivativeAttr(...) { if (!attr->getOriginalFunction() || !attr->getParameterIndices()) { // Do typechecking // If successful, set original function and parameter indices // If failure, diagnose and return true. } // Insert attr into Ctx.DerivativeAttrs // Diagnose duplicates }

That way, if the first round of typechecking is successful, then we don't duplicate the work on the second round. If the first round fails, we duplicate the work during the second round, but speed is less important in the failure case so this isn't terrible.

This idea sounds great! Duplicate work only for bad attributes seems very acceptable.

Since cross-file derivative registration requires a lot of distinct fixes and tests (some of which are drafted in #28790), it would be nice to be able to develop it on `tensorflow` under a flag. This PR adds a flag and a very tiny testcase -- the only situation that I'm aware of that currently works without any of the fixes.

marcrasi · 2020-01-08T21:47:12Z

Closing because I'm going to do this as separate incremental pieces gated behind the flag (#28891) instead of one large PR.

marcrasi requested review from rxwei and dan-zheng December 14, 2019 01:31

rxwei reviewed Dec 14, 2019

View reviewed changes

[AutoDiff] draft of lifting samefile derivative constriant

8dfd83b

marcrasi force-pushed the marcrasi-retrodiff-lift-samefile branch 2 times, most recently from 43fac5f to e6b39f7 Compare December 18, 2019 05:06

marcrasi changed the title ~~[AutoDiff] draft of lifting samefile derivative constriant~~ [AutoDiff] lift samefile derivative constriant Dec 18, 2019

marcrasi marked this pull request as ready for review December 18, 2019 05:16

dan-zheng reviewed Dec 18, 2019

View reviewed changes

typecheck all derivative attributes in the module

569b24c

marcrasi force-pushed the marcrasi-retrodiff-lift-samefile branch from e6b39f7 to 569b24c Compare December 19, 2019 00:47

marcrasi commented Dec 19, 2019

View reviewed changes

Merge branch 'tensorflow' of github.com:apple/swift into marcrasi-ret…

5fc02b2

…rodiff-lift-samefile

marcrasi mentioned this pull request Dec 19, 2019

[AutoDiff] factor derivative typechecking helper out of AttributeChecker #28879

Merged

dan-zheng reviewed Dec 19, 2019

View reviewed changes

marcrasi mentioned this pull request Dec 20, 2019

[AutoDiff] flag for cross-file derivative registration #28891

Merged

dan-zheng mentioned this pull request Jan 6, 2020

[AutoDiff] Enable derivative registration for imported C functions. #29016

Merged

marcrasi closed this Jan 8, 2020

dan-zheng mentioned this pull request Apr 23, 2020

[AutoDiff] Enable cross-file derivative registration by default. #31249

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AutoDiff] lift samefile derivative constriant #28790

[AutoDiff] lift samefile derivative constriant #28790

marcrasi commented Dec 14, 2019 •

edited

rxwei Dec 14, 2019

marcrasi commented Dec 18, 2019

dan-zheng Dec 18, 2019

marcrasi Dec 19, 2019

rxwei commented Dec 18, 2019

dan-zheng Dec 18, 2019

marcrasi Dec 18, 2019

marcrasi Dec 19, 2019

marcrasi commented Dec 18, 2019 •

edited

dan-zheng Dec 18, 2019

marcrasi Dec 19, 2019

marcrasi left a comment

marcrasi commented Dec 19, 2019

dan-zheng Dec 19, 2019

marcrasi Dec 19, 2019

dan-zheng Dec 19, 2019

marcrasi commented Jan 8, 2020

[AutoDiff] lift samefile derivative constriant #28790

[AutoDiff] lift samefile derivative constriant #28790

Conversation

marcrasi commented Dec 14, 2019 • edited

Choose a reason for hiding this comment

marcrasi commented Dec 18, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rxwei commented Dec 18, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

marcrasi commented Dec 18, 2019 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

marcrasi left a comment

Choose a reason for hiding this comment

marcrasi commented Dec 19, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

marcrasi commented Jan 8, 2020

marcrasi commented Dec 14, 2019 •

edited

marcrasi commented Dec 18, 2019 •

edited