Runtime parser result debugger with frontend support #219

MF42-DZH · 2023-11-06T15:24:33Z

Summary

Adds a separate optional library to Parsley, parsley-debug. This library contains a runtime and public APIs for attaching a higher-level debugger for Parsley, to complement its lower-level debugger found in parsley.debug.

Public API

The public API is found (and documented) in parsley.debugger.combinator, where the basis method of this debugger (attachDebugger) lives. This method traverses the internal LazyParsley tree of a parser and attaches debugger nodes to every single node (even arbitrary dynamically-produced nodes from flatMap).

This produces a pair of a generator function for a parse tree (a DebugTree; a tree representing the path of execution the parser took, along with relevant results and success statuses), and a debugged parser, which needs to be run before the generator function is called.

The tree itself has a fixed set of methods for querying information, found in parsley.debugger.DebugTree.

For the convenience of the user, utilities are also provided in parsley.debugger.util.Collector, where parser names can be automatically collected from the field names found in a class (only for fields, not methods; use parsley.debugger.combinator.named for non-field parsers). However, that uses reflection via scala-reflect (!), which may need to be reworked (to use something else more cross-version compatible) or removed entirely.

An interface, parsley.debugger.frontend.DebugFrontend is also provided for defining custom processing frontends for the aforementioned DebugTree. One can traverse the tree there and do whatever they want to the tree (convert it to JSON, print it to console, display it in a GUI, etc.) as long as the entire tree is traversed. These can also be passed automatically (explicitly or via implicits) into the debugger attachment process, which can automate the processing of a tree from a parser as it runs.

Known Issues or Missing Features

Parsers that employ iteration do not collate individual iterations.
Creating new internal parsers may require additional methods in parsley.internal.deepembedding.frontend.debugger.DebugInjectingVisitorM.
As above, finding parser names automatically requires reflection. Perhaps we might want to consider an alternative for this (other than manually renaming the parsers) or remove it entirely if it becomes too much to maintain.
- Related to automatic parser name collection: def-generated parsers must be manually renamed.
Using the debugger accrues a large memory usage and speed penalty for Parsley (perhaps unavoidably).
This debugger requires an extra line added to a user's build.sbt, with parsley-debug as the artifact name, with the same version as parsley (perhaps this is not an issue).

MF42-DZH · 2023-11-06T18:19:06Z

Critical roadblock at the moment is the usage of WeakHashMap to prevent memory leaks for flatMap-based parsers, as they are not available in ScalaJS (and possibly not in Scala Native either).

MF42-DZH · 2023-11-06T20:29:48Z

Added XWeakMap for platform-dependent unified weak hash map implementation, but the JS implementation is hand-rolled, and may contain mistakes. WeakMap from ECMA 6 would be nice to have instead of this hand-rolled implementation.

MF42-DZH · 2023-11-06T22:37:52Z

TODO

Factor out the concrete implementations of XWeakMap to reduce code duplication.
- While doing this, also increase the amount of times stale entries are expunged on the JS implementation.
- While we're at it, merge the dummy XCollector implementations too.
Re-merge the 2.13 and 3 implementations again at some point for XWeakMap.
Complete documentation for public API.

Currently implemented as a rose tree, with a mutable version and a frozen immutable version for safer analysis.

…g and testing

-- TODO -- > hook up the parse-time internals to the tree, so it can be populated when the parser runs > clean up the names of symbolic classes such as <|>

-- TODO -- > Fix residual check when parser being debugged fails. > Fix the lack of tree objects caused by reconstruct not fully traversing the tree of parsers.

-- TODO -- > Fix residual check when parser being debugged fails. > Fix the lack of tree objects caused by reconstruct not fully traversing the tree of parsers. > Remove handler from handler stack if parser being debugged fails.

-- TODO -- > Fix the lack of tree objects caused by something unknown?

children, broke entire parser stack flow. -- TODO -- > Investigate the failure chain and figure out a way to stop the check stack and debug context stack from being emptied out. > Fix the lack of tree objects caused by something unknown?

This reverts commit 84fb89c.

is built and can be mostly traversed -- TODO -- > Fix the fact that successful parses leave handlers on the handler stack. > Fix the lack of tree objects caused by lack of attached Debugged combinators.

tree is now fully populated -- TODO -- > Test deeper, more complex parsers to ensure that nothing actually gets lost. > Test recursive parsers to see if the infinite-recursion-stopping mechanics function as intended. > Make a simple printout view for the debug tree into the console for when the user does not want to use any of the GUI options.

order in the parser trees -- TODO -- > Test deeper, more complex parsers to ensure that nothing actually gets lost. > Test recursive parsers to see if the infinite-recursion-stopping mechanics function as intended. > Make a simple printout view for the debug tree into the console for when the user does not want to use any of the GUI options.

-- TODO -- > Test deeper, more complex parsers to ensure that nothing actually gets lost. > Test recursive parsers to see if the infinite-recursion-stopping mechanics function as intended. > Make a simple printout view for the debug tree into the console for when the user does not want to use any of the GUI options.

input for a parse attempt -- TODO -- > Make a simple printout view for the debug tree into the console for when the user does not want to use any of the GUI options.

to make looking at the tree simpler -- TODO -- > Make a simple printout view for the debug tree into the console for when the user does not want to use any of the GUI options.

finder just in case -- TODO -- > Re-use our debugged parsers when the parser is cyclic. > Make a simple printout view for the debug tree into the console for when the user does not want to use any of the GUI options.

cyclic to allow better renaming of parsers during debugger runtime -- TODO -- > Make a simple printout view for the debug tree into the console for when the user does not want to use any of the GUI options.

fix: GUI now executes correctly without redundant ops

- flattened packages to make the public API smoother - renamed some packages to remove pluralities for consistency - improved documentation for public API, especially for the debug combinators - cleared context with each parser run so debugged parsers can be re-used freely as long as the trees are saved or discarded appropriately between runs

attachDebuggerGUI now match - also fixed instruction generator to more closely match an iterative parser's generated instructions

to avoid reversing the ListMaps used

case it is needed -- TODO -- > Give the parse attempts the position info for renderers to take advantage of.

origin info instead of reconstructed info -- TODO -- > Give the parse attempts the position info for renderers to take advantage of.

-- TODO -- > Give the parse attempts the position info for renderers to take advantage of.

remaining issues: - `ParseAttempt` object public or private - `Lexer` private objects with parsers - possibly reintroduce `XMap` or similar mechanism

- prevents potential future binary incompatibilities

- Future versions of the JVM will restrict `setAccessible`'s abilities, and will render this collector eventually unusable.

- remove stale comment from AddAttemptAndLeave - remove types where obvious

- MapAddAll is internal only now - apply uses partial since code paths are the same - addNames now has a separate overload for IterableOnce of pairs of parsers and names

Scala 2.12 does not have IterableOnce ???

- Added zipWith and zipWith3 to `ContOps` to facilitate

armanbilge · 2023-11-27T20:36:22Z

WeakMap from ECMA 6 would be nice to have instead of this hand-rolled implementation.

If that's all you need, it is really only a few lines of code.

import scala.scalajs.js
import scala.scalajs.js.annotation.JSGlobal

@js.native
@JSGlobal
class WeakMap[K, V] extends js.Object {
  def delete(key: K): Boolean = js.native
  def get(key: K): js.UndefOr[V] = js.native
  def has(key: K): Boolean = js.native
  def set(key: K, value: V): this.type = js.native
}

On the other hand, if you need more APIs than that then it's more work. For example an IterableWeakMap is available here:
https://github.com/typelevel/cats-effect/blob/31ad01e798133f9bc62e043aac2e7709d4ba017d/core/js/src/main/scala/cats/effect/unsafe/IterableWeakMap.scala

j-mie6 · 2023-11-27T20:50:27Z

@armanbilge raises another interesting point. Other than the lack of scala-js implementation, we could use the java API instead of the scala one and sidestep the cross-version collections API problem (and just suffer the java API instead)...

MF42-DZH · 2023-11-27T21:21:08Z

WeakMap from ECMA 6 would be nice to have instead of this hand-rolled implementation.

If that's all you need, it is really only a few lines of code.
import scala.scalajs.js
import scala.scalajs.js.annotation.JSGlobal

@js.native
@JSGlobal
class WeakMap[K, V] extends js.Object {
  def delete(key: K): Boolean = js.native
  def get(key: K): js.UndefOr[V] = js.native
  def has(key: K): Boolean = js.native
  def set(key: K, value: V): this.type = js.native
}
On the other hand, if you need more APIs than that then it's more work. For example an IterableWeakMap is available here: https://github.com/typelevel/cats-effect/blob/31ad01e798133f9bc62e043aac2e7709d4ba017d/core/js/src/main/scala/cats/effect/unsafe/IterableWeakMap.scala

For an implementation as simple as this is, you have to wonder why the ScalaJS team didn't decide to just add this to the standard library for it (even if it does not conform directly to Map[K, V])...

(For what it's worth, this API is sufficient for the task at hand. The ability to iterate over the map isn't used in the internal logic that needs the weak map.)

armanbilge · 2023-11-27T21:29:21Z

you have to wonder why the ScalaJS team didn't decide to just add this to the standard library for it

Because nobody got around to it yet? :) I don't see any issues open about it, and they accepted a similar PR recently in scala-js/scala-js#4366.

MF42-DZH · 2023-11-27T21:30:43Z

Because nobody got around to it yet? :) I don't see any issues open about it, and they accepted a similar PR recently in scala-js/scala-js#4366.

I think I recall seeing this PR when looking for WeakMap specifically, all those weeks ago

Source: j-mie6#219 (comment)

- Finally unified the Scala 2.12, 2.13 and 3.3 maps at last.

- same reasoning as in debugger, stops ephemeral parsers from flooding the heap

MF42-DZH · 2023-11-27T22:26:19Z

@armanbilge raises another interesting point. Other than the lack of scala-js implementation, we could use the java API instead of the scala one and sidestep the cross-version collections API problem (and just suffer the java API instead)...

This is a possibility if the interfaces are exposed to ScalaJS (Scala Native probably has them, honestly).

(At least they'd be stable, the one saving grace about Java.)

j-mie6 · 2023-11-27T22:28:38Z

When scala-js implements the java one, it'll just work out for us (and Arman notes this may be more efficient than the scala implementation if optimisations are applied per platform). In the mean time we just need a java API facade over the top of the actual java API and make XMap... Javaish

MF42-DZH · 2023-11-28T07:53:53Z

When scala-js implements the java one, it'll just work out for us (and Arman notes this may be more efficient than the scala implementation if optimisations are applied per platform). In the mean time we just need a java API facade over the top of the actual java API and make XMap... Javaish

It seems to compile and run (using java.util.Map and java.util.AbstractMap) without throwing an error at runtime (like with some other Java classes), too.

MF42-DZH · 2023-11-29T09:28:40Z

For Java-ifying XMap, I have this so far on a separate branch: coffeeification:debugger/internal/facade/XJavaMap.scala, but it currently fails to compile (compiler states no ClassTag found for a lot of the generic variables).

The direct translations of Java's <? extends K, ? extends V> type parameters are frankly not great to deal with ([_ <: K, _ <: V]).

There is also the issue that the DebugTree trait expects nodeChildren to be a Scala Map, and not a Java map, so that API will end up being affected by this as well (see comment in file about Scala's collection converters):

parsley/parsley-debug/shared/src/main/scala/parsley/debugger/DebugTree.scala

Line 54 in 4cb81b5

def nodeChildren: Map[String, DebugTree]

MF42-DZH · 2023-11-30T13:20:14Z

There is one avenue to be explored, and that is manually defining a simpler Java API for some kind of "invariant map" in raw Java (that is then implemented within Scala instead).

Now the issue is, would that be cross-platform?

* feat: made `X...Exception`s package private, added `XUnsupportedOperationException` * style: add license headers

j-mie6 · 2023-12-21T14:28:49Z

Ok are we ready to merge this for now? This will give us the debugger backends right, but we still need to figure out where to put the frontends (or rather, transfer them under my domain and get them ready for publishing and deploy)

MF42-DZH · 2023-12-21T15:00:36Z

It should be ready to merge. It'll give just the backend (and whatever changes to Parsley were necessary for it), the frontends still live in a separate repository on my account.

MF42-DZH and others added 27 commits November 7, 2023 17:58

feat: created parsleyDebug project and skeleton packages

5266615

feat: added debug tree

6606d6f

Currently implemented as a rose tree, with a mutable version and a frozen immutable version for safer analysis.

fix: added parsleyDebug to root aggregate project for easier compilin…

760e355

…g and testing

feat: added basic downwards traversal of parsers to construct tree

8dcd536

-- TODO -- > hook up the parse-time internals to the tree, so it can be populated when the parser runs > clean up the names of symbolic classes such as <|>

license: added copyright declaration comments

420fde1

feat: added simple name translation for symbolic operator names

b897f8c

feat: added barely working debug internals

dda62eb

-- TODO -- > Fix residual check when parser being debugged fails. > Fix the lack of tree objects caused by reconstruct not fully traversing the tree of parsers.

temp: removed debug print from rebuildMasterTree

8fface9

-- TODO -- > Fix residual check when parser being debugged fails. > Fix the lack of tree objects caused by reconstruct not fully traversing the tree of parsers. > Remove handler from handler stack if parser being debugged fails.

fix: LeaveParser now correctly handles failure cases for parsers.

acf106c

-- TODO -- > Fix the lack of tree objects caused by something unknown?

breaking: Attempted to fix the lack of tree

8da8dba

children, broke entire parser stack flow. -- TODO -- > Investigate the failure chain and figure out a way to stop the check stack and debug context stack from being emptied out. > Fix the lack of tree objects caused by something unknown?

Revert "breaking: Attempted to fix the lack of tree"

3bcb63a

This reverts commit 84fb89c.

fix: revert to partially working version, now tree

3d443b2

is built and can be mostly traversed -- TODO -- > Fix the fact that successful parses leave handlers on the handler stack. > Fix the lack of tree objects caused by lack of attached Debugged combinators.

fix: successful parsers no longer request for too much

0b5b8c4

input for a parse attempt -- TODO -- > Make a simple printout view for the debug tree into the console for when the user does not want to use any of the GUI options.

feat: added simple renaming of parsers in the output

e5f411b

to make looking at the tree simpler -- TODO -- > Make a simple printout view for the debug tree into the console for when the user does not want to use any of the GUI options.

fix: Added Parsley as a type check in the field / method

d866b38

finder just in case -- TODO -- > Re-use our debugged parsers when the parser is cyclic. > Make a simple printout view for the debug tree into the console for when the user does not want to use any of the GUI options.

fix: Added the ability to re-use parsers even if

48b07e3

cyclic to allow better renaming of parsers during debugger runtime -- TODO -- > Make a simple printout view for the debug tree into the console for when the user does not want to use any of the GUI options.

feat: added console pretty-printing

d65e4da

fix: GUI now executes correctly without redundant ops

fix: trees produced by attachDebugger and

b4a4886

attachDebuggerGUI now match - also fixed instruction generator to more closely match an iterative parser's generated instructions

fix: used foldRight instead of foldLeft

f0d8742

to avoid reversing the ListMaps used

refactor: swapped ConsoleGUI for a singleton

f199612

feat: gave the renderer method the full input in

001a2dc

case it is needed -- TODO -- > Give the parse attempts the position info for renderers to take advantage of.

fix: debugged wrapper now correctly returns

3a9bdbb

origin info instead of reconstructed info -- TODO -- > Give the parse attempts the position info for renderers to take advantage of.

test: modified rename case slightly

63f9523

-- TODO -- > Give the parse attempts the position info for renderers to take advantage of.

MF42-DZH added 5 commits November 26, 2023 16:45

fix: fix basic issues from recent review

b507b78

remaining issues: - `ParseAttempt` object public or private - `Lexer` private objects with parsers - possibly reintroduce `XMap` or similar mechanism

fix: ParseAttempt's object now private to parsley

c43752c

- prevents potential future binary incompatibilities

delete: XCollectorJava removed, as reflection is too fragile

a578fd2

- Future versions of the JVM will restrict `setAccessible`'s abilities, and will render this collector eventually unusable.

fix: asserts converted to XAssert.assert where applicable

a44e3c8

feat: re-add XMap, turns out it was needed

b045dc8

j-mie6 approved these changes Nov 26, 2023

View reviewed changes

MF42-DZH added 4 commits November 26, 2023 19:35

style: typing and comments

f8b8b2d

- remove stale comment from AddAttemptAndLeave - remove types where obvious

refactor: Rename changes

aa328ce

- MapAddAll is internal only now - apply uses partial since code paths are the same - addNames now has a separate overload for IterableOnce of pairs of parsers and names

fix: IterableOnce -> Iterable

b776c8d

Scala 2.12 does not have IterableOnce ???

optimise: replace for...yield with zipWith and friends

32072a5

- Added zipWith and zipWith3 to `ContOps` to facilitate

MF42-DZH added 5 commits November 27, 2023 21:53

feat: replace XWeakMapImpl with native facade

daa5229

Source: j-mie6#219 (comment)

feat: introduce XMutMap for mutable map cross-compatibility

7c43b11

- Finally unified the Scala 2.12, 2.13 and 3.3 maps at last.

feat: replace HashMap with XWeakMap in rename

8aea5ad

- same reasoning as in debugger, stops ephemeral parsers from flooding the heap

license: add license link for WeakMap facade in comment

586c7a5

workflow: when will I stop forgetting to run prePR

4cb81b5

Unified exceptions into one file per platform. (#8)

6cd46d7

* feat: made `X...Exception`s package private, added `XUnsupportedOperationException` * style: add license headers

Merge branch 'master' into dev

2539d6e

j-mie6 merged commit 0961dab into j-mie6:master Dec 21, 2023
15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Runtime parser result debugger with frontend support #219

Runtime parser result debugger with frontend support #219

MF42-DZH commented Nov 6, 2023 •

edited

Loading

MF42-DZH commented Nov 6, 2023

MF42-DZH commented Nov 6, 2023

MF42-DZH commented Nov 6, 2023 •

edited

Loading

armanbilge commented Nov 27, 2023

j-mie6 commented Nov 27, 2023

MF42-DZH commented Nov 27, 2023 •

edited

Loading

armanbilge commented Nov 27, 2023

MF42-DZH commented Nov 27, 2023

MF42-DZH commented Nov 27, 2023

j-mie6 commented Nov 27, 2023

MF42-DZH commented Nov 28, 2023

MF42-DZH commented Nov 29, 2023 •

edited

Loading

MF42-DZH commented Nov 30, 2023

j-mie6 commented Dec 21, 2023

MF42-DZH commented Dec 21, 2023

Runtime parser result debugger with frontend support #219

Runtime parser result debugger with frontend support #219

Conversation

MF42-DZH commented Nov 6, 2023 • edited Loading

Summary

Public API

Known Issues or Missing Features

MF42-DZH commented Nov 6, 2023

MF42-DZH commented Nov 6, 2023

MF42-DZH commented Nov 6, 2023 • edited Loading

TODO

armanbilge commented Nov 27, 2023

j-mie6 commented Nov 27, 2023

MF42-DZH commented Nov 27, 2023 • edited Loading

armanbilge commented Nov 27, 2023

MF42-DZH commented Nov 27, 2023

MF42-DZH commented Nov 27, 2023

j-mie6 commented Nov 27, 2023

MF42-DZH commented Nov 28, 2023

MF42-DZH commented Nov 29, 2023 • edited Loading

MF42-DZH commented Nov 30, 2023

j-mie6 commented Dec 21, 2023

MF42-DZH commented Dec 21, 2023

MF42-DZH commented Nov 6, 2023 •

edited

Loading

MF42-DZH commented Nov 6, 2023 •

edited

Loading

MF42-DZH commented Nov 27, 2023 •

edited

Loading

MF42-DZH commented Nov 29, 2023 •

edited

Loading