Revise tagging mechanism. Add the notion of @ProvidedTags. #123

chumer · 2016-03-14T01:32:45Z

Since there was bad feedback on the SourceSection#hasTag solution, I tried to come up with a more flexible approach to tagging. I changed to a protected method Node#isTaggedWith(String). So its now up to the language how tagging is implemented.

Also new and noteworthy:

ProvidedTags to specify for a language which tags are provided by a language. (isTaggedWith is never called with other tags than that)
RequiredTags to specify which tags are required by an instrument. One cannot create queries for tags they do not require anymore.
PolyglotEngine#Instrument#isCompatibleWith(Language) to check wheter an instrument is compatible to a language. (contains all tags)
Instrumenter#isNodeTaggedWith(Node, String) to query other nodes not instrumented for a specific tag.

Since the behavior for provided and required tags is incompatible with the original version, I've introduced a temporary compatibility mode flag -Dtruffle.instrumentation.compatibility=true. For a language to verify that it is compatible to the new required/provided tags schema you need to run all you tests with compatibility set to false.

Also please note that in compatibility mode this change is fully compatible also SourceSection#hasTag is still supported.

@mickjordan @chrisseaton @woess please review for use in language
@mlvdv @jtulach please review for everything else.

@mlvdv
As you requested: here is a sketch to display all tags of a node:

    public static void displayAllTags(PolyglotEngine engine, Instrumenter instrumenter, Node node) {
        Set<String> allTags = new HashSet<>();
        for (Instrument instrument : engine.getInstruments().values()) {
            allTags.addAll(instrument.getRequiredTags());
        }
        for (Language language : engine.getLanguages().values()) {
            allTags.addAll(language.getProvidedTags());
        }

        for (String tag : allTags) {
            if (instrumenter.isNodeTaggedWith(node, tag)) {
                System.out.println(String.format("Node %s is tagged with %s.", node, tag));
            }
        }
    }

jtulach · 2016-03-14T10:01:04Z

I don't understand why one should be able to display all tags of a node.

lukasstadler · 2016-03-14T10:15:43Z

truffle/com.oracle.truffle.sl/src/com/oracle/truffle/sl/nodes/SLStatementNode.java

+    @Override
+    protected boolean isTaggedWith(String tag) {
+        switch (tag) {
+            case Debugger.HALT_TAG:


couldn't this be solved much simpler now, by just checking whether the parent is a sequence node?

I would need to check for the parent beeing a wrapper. Also isTaggedWith is defined as unchanged after AST creation, its hard to gurantee that if the parent is changing. Also its not true that each statement of a block is tagged halt in SL. For conditionals you want to step before the condition and not the full if expression. So the boolean solution is a lot easier to fine-tune.

smarr · 2016-03-14T10:24:46Z

In responds to @lukasstadler question about 'just checking a parent' for something.
This brings up a problem, which might or might not have been discussed before, don't remember:

How much support is needed/desired to 'just check' a node, a parent, or child for something in the potential presence of wrapper nodes?

Do wrapper nodes delegate questions about tags to their wrapped nodes? When asking a parent node, this could give very confusing results. Do you expect every usage side to guard with node instanceof WrapperNode? I got quite a number of these checks in SOMns, and I am not sure I put all in that are necessary.

chumer · 2016-03-14T10:50:42Z

@smarr Yes this is the downside of using wrappers. The alternative would be to make each leaf node class instrumentable individually without wrappers.

Do wrapper nodes delegate questions about tags to their wrapped nodes?

Yes since wrappers derive the automatically inherit the tags behavior. But wrapper nodes are never asked for tags by the instrumentation framework.

Do you expect every usage side to guard with node instanceof WrapperNode?

I don't think its good practice to rely on the parent/childs type. If you do, eg for pattern matching, you can work with helpers that filter wrapper nodes.

chumer · 2016-03-14T11:04:25Z

@jtulach @mlvdv wants to have that for instrument debugging.

woess · 2016-03-14T11:26:15Z

I believe relying on a parent node is fragile and thus should be avoided.

lukasstadler · 2016-03-14T13:38:36Z

I believe relying on a parent node is fragile and thus should be avoided.

Why? as long as there's a way to identify the next "real" parent, it should be fine.
Are you thinking about situations where the parent of still-executing code is changed to null?

smarr · 2016-03-14T13:49:23Z

Method-level recursion and AST-level self-optimization can lead to very surprising results when relying on certain invariants wrt. parent nodes.

This bit me because some information I need for tagging only becomes available during execution, i.e., after self-optimization in dynamic languages. So, the tags-are-stable-after-AST creation is very optimistic.

chumer · 2016-03-14T16:43:36Z

@lukasstadler no parent do ever become null after adoption. this is necessary also for multithreading.
I agree with @smarr that the parent is always that stable is optimistic. It might lead to hard to debug errors and its hard to verify. Wheras having a simple state in the node is easy to verify unchanged after AST creation, but it needs state. But for FastR it might make sense to do that. The good thing is that each language can now choose how to do it.

woess · 2016-03-14T17:22:45Z

@lukasstadler Simply because there's no real guarantee that your parent is always going to be what you think it will be. You're right, it can work if you can identify the "real" parent. I'm thinking more about potential refactoring hazards: someone might later change the AST structure a bit or introduce a new parent node and forget to update the tag detection. (e.g., you might skip the SequenceNode for a single statement and suddenly the statement tagging is gone.)
Obviously, these aren't obstacles, and it's totally up to the guest language implementation.

chrisseaton · 2016-03-14T18:33:45Z

Works for JRuby jruby/jruby@truffle-head...truffle-new-tagging

woess · 2016-03-16T01:14:35Z

It would have been good to split this into 2 separate changes (which I believe are mostly independent):

isTaggedWith
ProvidedTags, RequiredTags
I like it and I think this is the right approach.
doesn't really allow for optional tags, does it? I'm not sure all these restrictions are really necessary and if not implementing one tag should already mean that the instrument is incompatible.

Please also add a changelog entry. Thanks!

chumer · 2016-03-16T10:06:48Z

@woess ad 2: Its never going to get as easy as now to integrate this. I think adding @OptionalTags as well could solve most of our problems, right?

Ad changelog. Will add it.

woess · 2016-03-16T13:43:25Z

@chumer I'm more concerned that we're introducing something that we don't actually need. That said I don't object to the annotations.

ghost · 2016-03-16T14:59:32Z

In FastR we have some utililty functions to "unwrap" both down and up that takes care of wrapper nodes.

jtulach · 2016-03-18T17:13:49Z

Objections:

instrumentation presents itself as an optional feature - it used to be once (in early January) - but since then it sneaked into core API and now it even has its central method in Node. Something is rotten in the kingdom of Denmark!
while the list of supported tags per language/instrument Require/Provide tags is nice, we don't need it for anything right now. E.g. there is some truth on Andreas @woess comment that this could be separated into two changes - and only one of them is incompatible -e.g. needs to be done in somoe hurry.
discussion about parent is completely out of scope of this change as that aspect isn't influenced at all.
listing all tags is a non-goal - e.g. Instrument.getRequiredTags shouldn't be public. If we really want to use that for verification of compatibility between language/instrument, such methods should be at most protected.

chumer · 2016-03-18T17:46:39Z

Its still optional in the sense that you don't need to implement anything at all. In the current state its sneaking into the SourceSection API which is an even more central API. Do you have any suggestions on how to fix that? I think its a good compromise. This solution comes with no overhead (as opposed to having a SourceSection#tags field)
Will separate into separate PR. Will reuse this PR for the tagging mechanism. I don't agree that any change in here is incompatible to the previous Truffle version. Since we introduce an all new instrumentation framework its now that we can introduce such an api like require/provide very easily. Later its going to be more difficult.
Its not at all out of scope as the languages want to know how they should implement isTaggedWith. How getParent() behaves is very relevant to this question.
Agreed. Will remove Instrument#getRequiredTags from the API.

woess · 2016-03-18T20:27:24Z

(1.) Yes, it's better than the SourceSection solution. But we could extract it into an interface.
(4.) Agreed. There's no reason to get all tags.

chumer · 2016-03-20T13:05:35Z

@woess
(1.) So you suggest an interface InstrumentableNode? We had this in the first prototype. @jtulach is that better in your opinion as well?

jtulach · 2016-03-21T10:33:48Z

It is really interesting to see how the design walks in spiral (hopefully it is a spiral, not a circle).

In December we had relatively working design based on annotations. Then we found it too constraining (Ruby had a node which might or might not be a statement) and switched to string tags to give us flexibility. Now we see that strings are bit too flexible, and we need a way to find out if language/instrument can agree on a set of tags. Thus we build a new protocol (provides/requires annotation) to find out if the tags match.

This whole round trip makes me ask: What was the core reason for rejecting annotations? My answer: the inflexibility shown by a Ruby wanting to represent statement/non-statement by a single class.

Well, that can easily be fixed by Node.isTaggedWith or similar. Then the dynamism is there, yet we can keep using standard Java meta programming facility - e.g. annotations. Here is my rewrite of the API: jtulach@7b30201 - it is not fully polished, but following works:

truffle$ mx clean && mx build && mx unittest sl.test && mx javadoc

The test execution includes SLDebugTest which makes me believe it is working correctly for the debugger usecase and adopting languages to this API will be straightforward.

If we switch to the annotations again, we'll get static typing between instrument and a language and ability to perform static analysis during compilation - for example find out which annotations a language nodes support automatically. We'll be able to use standard IDE tools as home made protocols like provides/requires tags will no longer be needed or will operate on typed annotations.

chrisseaton · 2016-03-21T11:24:11Z

My answer: the inflexibility shown by a Ruby wanting to represent statement/non-statement by a single class.

The problem isn't that the Ruby team is just stubborn about adding a new class to represent statements. I can still do that if it's wanted. I would just wrap each expression that is also a statement with a new StatementNode, which could be annotated with the right tags.

The problem is that this new node is essentially a wrapper node - a node that just takes up space and slows down partial evaluation - and this is what the original rewrite of instrumentation was supposed to remove. That was my only objection.

If we want to go back to wrapper nodes and annotations on nodes, that's fine I can do that any time.

smarr · 2016-03-21T12:02:40Z

From my experience with the dynamic metrics tool, annotations alone are not sufficient. You don't want to restructure your AST nodes just because you want to support a new tool. Also, it is unclear that tools are not going to have conflicting requirements when it comes to tags, so, we need more flexibility there.

woess · 2016-03-21T12:28:17Z

@chumer: It's a possibility if @jtulach thinks it should not be part of the Node API. However, I don't think it has to be extracted, now that it's just one method querying a tag.

lukasstadler · 2016-03-21T12:34:50Z

From the FastR perspective, what constitutes a statement cannot be tied to specific node types, because it depends on where something is, not what it is.
Whether the tag itself is a string or an annotation type should be decided based upon whether the tighter coupling of class references as opposed to matching stings is desired and/or possible.
The main difference is that you actually have to compile against the source of whoever provides the tags, right?

jtulach · 2016-03-21T12:48:53Z

Hello @chrisseaton, no I don't want to re-introduce wrappers into Ruby. The system that I am proposing should have the same level of flexibility. See https://github.com/jtulach/jruby/commit/8aaed63d02eb5407a46e9cae19f0701bd4d4b724

Good point, @lukasstadler. The compilation makes the difference. Please note that in both systems (e.g. @ProvidedTags vs. annotations) the list of tags supported by a language is hardcoded during compilation of the language binary. Thus I think @ProvidedTags and annotations have the same flexibility.

chrisseaton · 2016-03-21T12:53:11Z

isAnnotationPresent is a new method - this isn't related to Java's Class#isAnnotationPresent is it? And I guess there's a default that does the obvious thing (presumably using Class#isAnnotationPresent)?

Seems fine to me.

woess · 2016-03-21T14:14:29Z

@jtulach: seems like a good compromise, although it's a bit of an abuse of annotations.

chumer · 2016-03-25T14:46:05Z

@chrisseaton think of it as: you need to be able to run code that uses local variables at that location. If you would do that before the prolog your expression might not be evaluatable.

That is actually true for any @Instrumentable source location.

lukasstadler · 2016-03-25T14:59:51Z

tags can be defined by what the debugger and other tools do with this information, or at an abstract language level, e.g., create a definition for what a statement is.
the former requires tool writers to collaborate on creating a set of tags, while the latter additionally requires involvement form the language writers.

I'd go with the former - what difference does it make whether something is a statement, call or expression, other than what tools do with it?

+1 though on discussion this as a separate issue.

chumer · 2016-03-25T15:16:27Z

I'd go with the former - what difference does it make whether something is a statement, call or expression, other than what tools do with it?

Are you opting for no tag sharing between tools? We want a standard set of tags shared between the languages independent of tools. There is/was consensus about that in the meetings. How exactly the tags look like, we don't know. But we can evolve them easily. So it might turn out that we have a DEBUGGER_HALT standard tag in the future. Lets wait and see.

+1 though on discussion this as a separate issue.

Yes we should continue the discussion. For this release I will go with this set of tags (or maybe some minor modifications) because we have use-cases. If it turns out they are wrong we deprecate them. I do this now to fix concerns raised by @mickjordan .

smarr · 2016-03-25T15:30:35Z

Was there a specific reason not to introduce a common super class for tags? If I am not mistaken someone else also asked for it. Could make finding tags easier, I think. (and I think typing was brought up as another benefit.)

woess · 2016-03-25T15:31:19Z

Note that statement is not necessarily the same as debugger-halt (i.e., you might want to halt on something that's not a statement (in the language) and not halt on some specific statements). Can this be a problem for our set of standard tags?

woess · 2016-03-25T15:33:30Z

Was there a specific reason not to introduce a common super class for tags? If I am not mistaken someone else also asked for it. Could make finding tags easier, I think. (and I think typing was brought up as another benefit.)

One reason not to have a common superclass could be to allow annotation classes to be used as tags.

smarr · 2016-03-25T15:34:35Z

@woess, I think it is more of an indication that tags will likely be rather tool specific. For my tool, I see only the root node as useful.

wrt. superclass: ok, I see.

lukasstadler · 2016-03-25T15:37:45Z

Are you opting for no tag sharing between tools?

Quite the opposite. My point is that it doesn't make sense for the languages to add tags and hope that they are useful. The set of tags should result from discussions on what would be useful for tools, and individual tags should be defined in terms of the expected behavior (as opposed to names of programming language concepts, which can be interpreted in wildly different ways by different languages).

woess · 2016-03-25T15:40:47Z

@smarr Regarding superclass/interface: Not sure if that's relevant to users, though. The standard tags aren't annotation classes, so these can't be used as annotations right now.

mlvdv · 2016-03-28T04:44:18Z

This is a good discussion, but sometimes the most important requirements get lost in a sea of implementation detail. This is a big deal because tags are the single most important functional thing about Truffle Instrumentation (runtime overhead seems well under control now, and everything else is secondary).

Here is the first of three comments about the actual requirements for tags in the Instrumentation Framework

First: the usefulness of Truffle Instrumentation depends almost completely on their flexibility.

Here’s why. Tags provide the essential level of indirection between language engineering and tool behavior. Tags make it possible to configure tool behavior with the least possible impact on language engineering. Their concerns are orthogonal:

Language engineering is strongly constrained by correctness and mainly motivated by performance.
Tool behavior, on the other hand, is mainly sensitive to user expectations, part of an evolving tool set, and subject to frequent fine-tuning.

So, do not restrict how or when language implementors apply tags any more than absolutely necessary for fast path performance. Every bit of coupling between tool behavior and language engineering reduces the usefulness of tags, and this includes anything that relates to AST node types. This is true for the standard tools we are building now, and it will becomes more true for tools not yet built.

Other than fast path concerns, flexibility is the only thing that matters. A tag gets bound to one or more source locations; the rest is implementation detail.

mlvdv · 2016-03-28T04:49:32Z

A second comment on the real requirements for tags.

The critical functional requirement for Truffle tags is flexibility, but this discussion is also about how to use them. Instrumentation/tools will only get adopted in practice if we get the tags right. Our strategy for adoption is:

provide workable tools to both language implementors and end users
for very little effort
that can be incrementally fine-tuned and improved as needed.

This strategy creates two hard requirements:

The platform must define standard tags that configure a collection of standard tools (debugger, profiler, etc.), and it must be as easy as possible for language implementors to apply tags and to adjust them frequently.
The platform must provide standard tools that are useful (if not optimal) for every language implementation that applies standard tags, and it must be as easy as possible to fine-tune tool behavior using tool-specific tags.

Adoption depends on this positive feedback loop. A language implementor adds a few standard tags, and the Truffle platform rewards her/him immediately with tools that do something useful. Since they are useful, the language implementor and colleagues use tools during development and notice rough edges. This motivates him/her to improve tool behavior, possibly in collaboration with tool builders, by adjusting standard tags and fine-tuning tool-specific tags.

mlvdv · 2016-03-28T04:56:32Z

A third comment on the real requirements for tags.

As recent discussion shows, we are still stuck on how to define standard tags. The only way forward is to keep working with tags collaboratively, tool builders together with language implementors. We have to do this, even if this costs some API stability.

@chumer want to return to tags as objects, with some new (to me) features, and detach them from SourceSection. This is the right thing to do (except for not enough tagging flexibility!).

In order to move forward and get some experience, we should start now with an absolutely minimal set of standard tags. For Debugger only STATEMENT and CALL. I know that CALL can be tricky, so let’s get started working through those issues with Debugger and Profiler as clients. There seems to be an understood use case for ROOT, so that could also be included. All other tags can be postponed.

Then we can redirect most of this discussion, based on very little experience, back to working with the actual tags, languages, and tools. We'll all learn a lot.

Can we manage this within our API constraints? I’d be happy to keep a new set of standard tags out of the published API during this experimental period, if that’s workable; otherwise we just have to take some stability hit so we can work out issues between language implementations and tools.

chumer · 2016-03-29T07:09:56Z

@mlvdv don't my latest changes exactly do what you want? Introducing a minimal StandardTags class? I want to add the tags to the published API. No problem to deprecate and remove them later. Could you comment on the proposed set of StandardTags (with javadoc)?

Also another small correction: we don't move to objects as tags but classes as tags which is somewhat different.

jtulach · 2016-03-29T08:28:46Z

I did a quick mapping of @mlvdv high-level goals to the actual code included in this pull request and as far as I can tell, the essentials are part of @chumer changes. The flexibility is there, type-safety has been increased, and now we even converged on a minimal set of general purpose tags!

That is great and I am looking forward to offer these changes to public as part of Truffle 0.12 release.

chumer · 2016-03-29T08:43:32Z

Will merge after the check is through.

…LE.COM/truffle:bug/empty-source-section to master Including test that now passes. * commit 'f827720a2922962c3e869b17b9438a6324cf62a1': SourceSection: include a test that now passes

chumer added feature oracle-emp labels Mar 14, 2016

chumer assigned jtulach Mar 14, 2016

lukasstadler reviewed Mar 14, 2016
View reviewed changes

chumer added 6 commits March 25, 2016 16:23

Revise tagging mechanism. Add the notion of @ProvidedTags.

2aa2eff

Merge fixes in REPLServer.

424177a

Update Node javadoc to use snippets.

46b89dc

Fix eclipseformat.

d462ce0

Make findbugs (unwritten field) happy with the StatementNode example.

fce3b98

Introduce a set of standard tags.

ae4094f

chumer force-pushed the improved_tagging branch from ee1ec64 to ae4094f Compare March 25, 2016 15:27

SLStatementNode needs to preserve tags on copy.

0372a1f

jtulach added accept and removed do not merge labels Mar 29, 2016

chumer added 2 commits March 29, 2016 10:30

Merge branch 'master' into improved_tagging

dc29600

Fix merge problems.

09d7384

chumer merged commit 162e68a into oracle:master Mar 29, 2016

brunoborges unassigned jtulach Jan 8, 2018

Revise tagging mechanism. Add the notion of @ProvidedTags. #123

Revise tagging mechanism. Add the notion of @ProvidedTags. #123

Conversation

chumer commented Mar 14, 2016 • edited by pitr-ch

jtulach commented Mar 14, 2016

lukasstadler Mar 14, 2016

Choose a reason for hiding this comment

chumer Mar 14, 2016

Choose a reason for hiding this comment

smarr commented Mar 14, 2016

chumer commented Mar 14, 2016

chumer commented Mar 14, 2016

woess commented Mar 14, 2016

lukasstadler commented Mar 14, 2016

smarr commented Mar 14, 2016

chumer commented Mar 14, 2016

woess commented Mar 14, 2016

chrisseaton commented Mar 14, 2016

woess commented Mar 16, 2016

chumer commented Mar 16, 2016

woess commented Mar 16, 2016

ghost commented Mar 16, 2016

jtulach commented Mar 18, 2016

chumer commented Mar 18, 2016

woess commented Mar 18, 2016

chumer commented Mar 20, 2016

jtulach commented Mar 21, 2016

chrisseaton commented Mar 21, 2016

smarr commented Mar 21, 2016

woess commented Mar 21, 2016

lukasstadler commented Mar 21, 2016

jtulach commented Mar 21, 2016

chrisseaton commented Mar 21, 2016

woess commented Mar 21, 2016

chumer commented Mar 25, 2016

lukasstadler commented Mar 25, 2016

chumer commented Mar 25, 2016

smarr commented Mar 25, 2016

woess commented Mar 25, 2016

woess commented Mar 25, 2016

smarr commented Mar 25, 2016

lukasstadler commented Mar 25, 2016

woess commented Mar 25, 2016

mlvdv commented Mar 28, 2016

mlvdv commented Mar 28, 2016

mlvdv commented Mar 28, 2016

chumer commented Mar 29, 2016

jtulach commented Mar 29, 2016

chumer commented Mar 29, 2016

chumer commented Mar 14, 2016 •

edited by pitr-ch