Add other output formats for TreeStructureVisitor #753

matozoid · 2017-02-02T14:05:56Z

Try...

DOT
XML
HTML
yaml
JSON

matozoid · 2017-02-02T20:32:10Z

<root type='MethodCallExpr'>
    <arguments type='EnclosedExpr'>
        <inner type='EnclosedExpr'>
            <inner type='BinaryExpr' operator='PLUS'>
                <left type='IntegerLiteralExpr' value='1'/>
                <right type='IntegerLiteralExpr' value='1'/>
            </inner>
        </inner>
    </arguments>
    <name type='SimpleName' identifier='println'/>
    <scope type='FieldAccessExpr'>
        <name type='SimpleName' identifier='out'/>
        <scope type='NameExpr'>
            <name type='SimpleName' identifier='System'/>
        </scope>
    </scope>
</root>

matozoid · 2017-02-02T20:35:16Z

root (BinaryExpr)
	operator: PLUS
	left (IntegerLiteralExpr)
		value: 1
	right (IntegerLiteralExpr)
		value: 1

ryan-beckett · 2017-02-23T01:41:31Z

@matozoid: Is there any rhyme or reason to the XML format you've created or just an example. Before I work on this I'd like to agree upon a syntax. I'll post a sample format that covers most of the node types that we can review before I implement it.

ryan-beckett · 2017-02-23T01:43:11Z

@matozoid: Also, would it suffice to add these formatters as XMLStructureVisitor, DOTStructureVisitor, etc?

matozoid · 2017-02-23T09:53:44Z

Okay, the full story: I was working on this before, trying to get a basic framework together that would make it easy to build these things. I generated TreeStructureVisitor for that. By now I think I should have used the TreeVisitor or a custom simple recursive one. I think the user wants to specify how to output a single node, so it should have a method that receives that node, and you can do anything there. Maybe also a context or a stack for each level. The whole idea (read #754 too) is that people can quickly get an overview of what the node structure looks like. Since it doesn't have the purpose of getting processed, the format can be arbitrary.

Anyway, the current TreeStructureVisitor should be trashed.

My plan was to create a new TreeStructureVisitor and give it a callback that will create the output. Pass different callbacks, get a different output format. It currently uses extension.

Naming: "tree structure visitor" says "I visit the structure of a tree." "XML structure visitor" says "I visit the structure of XML" which it does not - so I guess it needs the word "output" or so in there somewhere?

Output formats: I think the ones above are reasonably nice. TreeStructureVisitor does not deal correctly with lists so those are not in there yet. I say: try generating a bit in a few formats, see what you think is best, and post it here. Keep in mind that we're trying to visualize the tree to human eyes :)

Last note: if one of these needs a new maven dependency, don't do it.

ryan-beckett · 2017-02-24T02:02:35Z

@matozoid: Thanks for the clarification. I'll start looking into applicable patterns and get back to you.

ryan-beckett · 2017-02-25T16:20:15Z

@matozoid, on further thought I feel like this feature can get messy quickly if trying to come up with a way to allow the user to specify how to output a node. I think it'll suffice to simply supply a few outputs out of the box following how you did in TreeStructureVisitor. Perhaps I could create a factory design atop the TreeStructureVisitor, so as not to break the API. Does this sound OK?

Suppose you did try to use TreeVisitor.visitPreOrder() , which would visit the nodes in the correct order for an output feature, but then how would you, for instance, output the closing XML tag for a node? Once process() has executed, there's no going back.

matozoid · 2017-02-26T12:49:22Z

Okay, I don't think the current TreeStructureVisitor is the way to go. We don't care which node we're printing, so we don't need a method for each node type, so we could use a much simpler tree walking algorithm. If you try adding nodelist support to the visitor you'll also notice that there's no nice way to do so. It's okay to break the API here, check the changelog.

Here's something I was playing with a while ago: https://bitbucket.org/matozoid/serious - a library that introspects an object tree and prints it as you want. Sounds like what we want, except we don't need to track circular references, and we can use the metamodel instead of introspection. Oh, did you take a look at the metamodel yet?

Damn, maybe I shouldn't have tagged this as easy :-D

ryan-beckett · 2017-02-26T14:37:37Z

I'll take a look at the library and the model and get back to you. Thanks.

matozoid · 2017-02-27T20:15:28Z

@ryan-beckett here's the branch that outputs the test stuff seen above: https://github.com/matozoid/javaparser/commits/issue_754_more_structure_output

ryan-beckett · 2017-02-28T02:16:14Z

@matozoid, thanks. That's helpful. Your idea about the framework "hit" last night. I understand a bit better now. I haven't looked at the serious library yet, but I was already thinking that using reflection it should be as simple as writing an API that accepts a mapping of node type to string output, i.e.

new TreeStructureVisitor(new HashMap<String, String>() {{
    put(MethodDeclaration.class,"<MethodDeclaration>", "</MethodDeclaration>"),
    put(AnnotationDeclaration.class,"<AnnotationDeclaration>", "</AnnotationDeclaration>");
    .
    .
    .
}});

All the client would need to do is define a mapping for the output language instead of creating a full visitor implementation. Is this what you had in mind?

matozoid · 2017-02-28T09:12:51Z

Hmmm, what I was thinking is that for showing structure, it doesn't matter what node we're looking at. It matters what it is named, what its fields are, and what kind of fields they are (simple type, node, or nodelist.)

Anyway, I think I'm trying to steer you too much. Try implementing XML and simple text output, and put the reusable stuff in a generic class, then use that to implement some others. Keep the goal in mind: letting people study the AST. Good luck :)

matozoid · 2017-04-06T19:52:01Z

@ryan-beckett hey Ryan, are you still around?

ryan-beckett · 2017-04-06T19:56:43Z

@matozoid: Yup. Just needed a few weeks to clear my head and try and learn this code base. It's ironic you've messaged considering I just pulled the project up in my browser. Makes me think you're alerted of who views the project. Anyways, I had planned on taking a stab at the implementing DOT format.

matozoid · 2017-04-06T20:19:38Z

Cool! Don't tell anyone about the hidden webcam streamer I put in JavaParser okay?

ryan-beckett · 2017-04-06T20:22:00Z

@matozoid: Lol, I won't.

ryan-beckett · 2017-06-17T00:45:06Z

@matozoid: I'm going to implement the others in the way you did the XXXDump.java files. It's the most straightforward way. IMO, patterning or abstracting any more is just overkill. It'll be:

...\ast\printer\XMLPrinter.java
...\ast\printer\JSONPrinter.java
.
.
.

matozoid · 2017-06-17T11:07:41Z

Okay, #964 was updated

ryan-beckett · 2017-06-18T02:30:12Z

Two questions:

Format the output or leave it without unnecessary white space. I.e.:

<outer>
     <inner>
     </inner>
</outer>

vs

<outer><inner></inner></outer>

I know there are tools to format the output, but it would it be such a bad idea for it to be done automatically?

Which example will suffice in terms of output. I think the second is verbose, but are there cases where that verbosity is necessary? I feel like someone manually processing the XML might appreciate a more simplistic representation, on the other hand, I feel like I may be taking away helpful data. Better to have more and waste it, than to have not enough.

<CompilationUnit>
	<ClassOrInterfaceDeclaration isInterface='false'>
		<SimpleName identifier='A'>
		</SimpleName>
		<MethodDeclaration>
			<BlockStmt>
				<ExpressionStmt>
					<VariableDeclarationExpr>
						<VariableDeclarator>
							<SimpleName identifier='a'>
							</SimpleName>
							<PrimitiveType type='INT'>
							</PrimitiveType>
						</VariableDeclarator>
						<VariableDeclarator>
							<SimpleName identifier='b'>
							</SimpleName>
							<ArrayType>
								<PrimitiveType type='INT'>
								</PrimitiveType>
							</ArrayType>
						</VariableDeclarator>
					</VariableDeclarationExpr>
				</ExpressionStmt>
			</BlockStmt>
			<VoidType>
			</VoidType>
			<SimpleName identifier='foo'>
			</SimpleName>
		</MethodDeclaration>
	</ClassOrInterfaceDeclaration>
</CompilationUnit>

<root type='CompilationUnit'>
	<types>
		<type type='ClassOrInterfaceDeclaration' isInterface='false'>
			<name type='SimpleName' identifier='A'>
			</name>
			<members>
				<member type='MethodDeclaration'>
					<body type='BlockStmt'>
						<statements>
							<statement type='ExpressionStmt'>
								<expression type='VariableDeclarationExpr'>
									<variables>
										<variable type='VariableDeclarator'>
											<name type='SimpleName' identifier='a'>
											</name>
											<type type='PrimitiveType' type='INT'>
											</type>
										</variable>
										<variable type='VariableDeclarator'>
											<name type='SimpleName' identifier='b'>
											</name>
											<type type='ArrayType'>
												<componentType type='PrimitiveType' type='INT'>
												</componentType>
											</type>
										</variable>
									</variables>
								</expression>
							</statement>
						</statements>
					</body>
					<type type='VoidType'>
					</type>
					<name type='SimpleName' identifier='foo'>
					</name>
				</member>
			</members>
		</type>
	</types>
</root>

matozoid · 2017-06-18T09:28:17Z

People request this to get an overview of the AST, just to examine it by eye. So formatting would make sense.
Phew, maybe read through the original issues to see what they were after. Both are nice (although the first is missing the property names, which would make it less uhhh... informative?)

ryan-beckett · 2017-06-18T13:37:49Z

Yeah, essentially the first is stripped of Metadata. We'll go with the 2nd then.

matozoid · 2017-06-18T14:32:02Z

I've put a preliminary XmlPrinter and JsonPrinter on master so people have something to play with, you should see them when you merge master.

ryan-beckett · 2017-06-18T18:20:23Z

Cool. I altered your code a little so that it prints indented content. I'll submit pull requests when I finish.

matozoid · 2017-09-10T12:50:17Z

Hey @ryan-beckett - is this one still on your name? It's been more than half a year now with not much progress...

ryan-beckett · 2017-09-10T16:29:52Z

You've done XML and JSON. Are you referring to the others?

matozoid · 2017-09-10T19:59:02Z

There's an ASCII art printer that's not done, and the others, your last message suggests you wanted to do more work on it? I pushed it into master because people wanted to use it, not because I thought it was finished.

ryan-beckett · 2017-09-10T21:55:26Z

If the XML and Json versions are fine then I'm going to leave them be and work on the others. What exactly is an ASCII art printer?

matozoid · 2017-09-11T11:00:11Z

Alright, I'll mark them final then. The ASCII art one is something like this: https://cmatskas.com/generate-ascii-folder-structures-for-windows-with-tree/

If you think XML and JSON are all we need we can also close this issue.

ryan-beckett · 2017-09-11T12:16:44Z

You can leave them open a little while longer. I don't mind taking a stab at them, but I had started working on 'pointing a Problem to a node'.

ryan-beckett · 2017-09-17T05:51:40Z

@matozoid: I feel like the sky is the limit with HTML output. What exactly are we trying to accomplish with it? I was thinking something like this -- a simple collapsible tree view. However, we need to use Javascript. What are your thoughts? If we're going to do the HTML version, we might as well make or worthwhile, or just scrap it. I'm in favor of scrapping it honestly, because I really don't see a use case for it that XML/JSON/YAML doesn't already cover. Same goes for the ASCII art. Has someone actually requested this? Lol. The DOT format is worthwhile because it provides a diagramming aspect.

matozoid · 2017-09-17T10:08:11Z

Skip the HTML then, this use case has more than enough code by now :-)

ASCII art was suggested by myself because it could be a quick way to output the tree in your console - but I think YAML should take care of that now, right?

ryan-beckett · 2017-09-17T15:30:27Z

Cool. I'm moving on.

matozoid · 2017-09-17T17:05:46Z

For completeness, I was also considering that #725 exists.

Not a hint to go work on that, by the way, it's not a core thing :-)

matozoid added Improvement Not a bug, but a way that JP can be be enhanced to work better. Easy An ideal bug for somebody new to the project to take on! labels Feb 2, 2017

matozoid self-assigned this Feb 2, 2017

This was referenced Feb 3, 2017

Investigate code from @ducmle for TreeStructureVisitor #754

Closed

Add pre-in-post-order visiting to TreeVisitor #751

Closed

matozoid removed their assignment Feb 24, 2017

matozoid mentioned this issue Mar 5, 2017

[DRAFT] issue753: New formatting framework #811

Closed

matozoid removed the Easy An ideal bug for somebody new to the project to take on! label Mar 5, 2017

This was referenced Jun 15, 2017

GenericVisitorAdapter: print tree? #960

Closed

Issue 754 more structure output #964

Merged

matozoid added this to the next release milestone Sep 17, 2017

matozoid mentioned this issue Sep 17, 2017

Add some javadoc and remove deprecation #1144

Merged

matozoid closed this as completed in #1144 Sep 17, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add other output formats for TreeStructureVisitor #753

Add other output formats for TreeStructureVisitor #753

matozoid commented Feb 2, 2017 •

edited

matozoid commented Feb 2, 2017

matozoid commented Feb 2, 2017

ryan-beckett commented Feb 23, 2017

ryan-beckett commented Feb 23, 2017

matozoid commented Feb 23, 2017

ryan-beckett commented Feb 24, 2017

ryan-beckett commented Feb 25, 2017

matozoid commented Feb 26, 2017

ryan-beckett commented Feb 26, 2017

matozoid commented Feb 27, 2017

ryan-beckett commented Feb 28, 2017

matozoid commented Feb 28, 2017

matozoid commented Apr 6, 2017

ryan-beckett commented Apr 6, 2017

matozoid commented Apr 6, 2017

ryan-beckett commented Apr 6, 2017

ryan-beckett commented Jun 17, 2017

matozoid commented Jun 17, 2017

ryan-beckett commented Jun 18, 2017 •

edited

matozoid commented Jun 18, 2017

ryan-beckett commented Jun 18, 2017 •

edited

matozoid commented Jun 18, 2017

ryan-beckett commented Jun 18, 2017

matozoid commented Sep 10, 2017

ryan-beckett commented Sep 10, 2017

matozoid commented Sep 10, 2017

ryan-beckett commented Sep 10, 2017

matozoid commented Sep 11, 2017

ryan-beckett commented Sep 11, 2017

ryan-beckett commented Sep 17, 2017 •

edited

matozoid commented Sep 17, 2017

ryan-beckett commented Sep 17, 2017

matozoid commented Sep 17, 2017

Add other output formats for TreeStructureVisitor #753

Add other output formats for TreeStructureVisitor #753

Comments

matozoid commented Feb 2, 2017 • edited

matozoid commented Feb 2, 2017

matozoid commented Feb 2, 2017

ryan-beckett commented Feb 23, 2017

ryan-beckett commented Feb 23, 2017

matozoid commented Feb 23, 2017

ryan-beckett commented Feb 24, 2017

ryan-beckett commented Feb 25, 2017

matozoid commented Feb 26, 2017

ryan-beckett commented Feb 26, 2017

matozoid commented Feb 27, 2017

ryan-beckett commented Feb 28, 2017

matozoid commented Feb 28, 2017

matozoid commented Apr 6, 2017

ryan-beckett commented Apr 6, 2017

matozoid commented Apr 6, 2017

ryan-beckett commented Apr 6, 2017

ryan-beckett commented Jun 17, 2017

matozoid commented Jun 17, 2017

ryan-beckett commented Jun 18, 2017 • edited

matozoid commented Jun 18, 2017

ryan-beckett commented Jun 18, 2017 • edited

matozoid commented Jun 18, 2017

ryan-beckett commented Jun 18, 2017

matozoid commented Sep 10, 2017

ryan-beckett commented Sep 10, 2017

matozoid commented Sep 10, 2017

ryan-beckett commented Sep 10, 2017

matozoid commented Sep 11, 2017

ryan-beckett commented Sep 11, 2017

ryan-beckett commented Sep 17, 2017 • edited

matozoid commented Sep 17, 2017

ryan-beckett commented Sep 17, 2017

matozoid commented Sep 17, 2017

matozoid commented Feb 2, 2017 •

edited

ryan-beckett commented Jun 18, 2017 •

edited

ryan-beckett commented Jun 18, 2017 •

edited

ryan-beckett commented Sep 17, 2017 •

edited