Clean up Resolve.scala and related code to improve rigor and error reporting #2453

lihaoyi · 2023-04-23T05:48:48Z

This PR overhauls Resolve.scala and Resolve{Tasks,Metadata,Segments}.scala, which is some of the oldest and most poorly organized code in the repo. This gives us several things:

Improving maintainability and understandability: I can explain how the resolve logic works now, where I couldn't before. The way it called back and forth between RunScripts and Resolve* sub-classes and the Resolve superclass was crazy
Make the resolve behavior more predictable: the previous implementation had a lot of copy-pasty code, both within Resolve.scala and between Resolve{Tasks,Metadata,Segments}.scala, with subtle changes between them especially in error reporting. e.g. the code to handle _ and __ was copy-pasted 4-5 times. Now the implementations share much more code, without weird copy-pasty divergences, and should behave much more consistently
Fix a lot of old bugs: e.g. when resolution failed, the error message sometimes (but not always) had the segments in the backwards order. Passing in a query to Mill with a parse error would sometimes (but not always) just truncate it at the parse error. Brace expansion would fail to expand empty sections e.g. {,foo,bar}qux => qux fooqux barqux. These are now fixed, along with many other issues mostly around edge cases or error reporting
Greatly improve error reporting in the case of module initialization failure. These previously returned massive stacktraces from un-caught exceptions, and now return truncated short(ish) stacktraces with mostly the stack frames the user cares about it. The fact that the exceptions are caught also means we can properly unit test and integration test their contents
Improve the laziness of the module initialization process. Previously, we would initialize many more modules than we needed to while performing target resolution. Now we resolve a much tighter set:
1. When some portions of the module tree fail to initialize, you can still run queries and builds on the other parts
2. If you have a very large build.sc that takes a non-trivial amount of time to initialize all the modules, now you only initialize what you need to initialize depending on what command you are running

Major Changes

RunScript.scala, Resolve.scala, and Resolve{Tasks,Metadata,Segments}.scala have been mostly rewritten. Now there are three main components:
1. ResolveCore.scala, that performs resolution of everything-that-can-be-resolved, wrapped in ResolveCore.Resolved objects
2. ResolveNonEmpty.scala, which wraps ResolveCore.scala and standardizes the error reporting logic when resolution fails
3. Resolve.scala, which contains ResolveTasks, ResolveMetadata, and ResolveSegments that take the ResolveCore.Resolved objects and perform the minimal processing necessary for each use case
Improved the laziness of ResolveCore.scala.
1. A lot of previous calls to millModuleDirectChildren that previously would force all children to be instantiated have been replaced by more targeted APIs, e.g. .millModuleDirectChildren.collect { case b: RootModule => b } is now .reflectNestedObjects[RootModule](), .millModuleDirectChildren.find(_.millOuterCtx.segment == Segment.Label(singleLabel)) is now .reflectNestedObjects0[Module](namePred: String => Boolean), etc.
2. A bunch of the helpers in Module.scala were refactored to allow reflecting on members without invoking them, or specifying the name of the thing you want to reflect on to select it directly rather than having to list out and instantiate everything
3. Cross modules also had some tweaks in order to return MapViews instead of Maps, to allow instantiation of the individual Cross.Module instances to happen separately and on-demand
Greatly improved the strictness of error reporting of ResolveCore.scala. In particular, the places which can throw exceptions should be mostly wrapped in catchReflectException or catchNormalException and turned into Either[String, T]s

Minor Changes

I broke out ExpandBraces.scala and ExpandBracesTests.scala, so they're no longer mixed in with ParseArgs.scala. Mill's brace expansion happens textually in a totally separate phase before argument parsing runs, following how the Bash shell performs brace expansion, and so neither query parsing nor brace expansion need to know about each other.
Segments API was tweaked a bit to better suite the common usage patterns

Testing

MainTests was renamed ResolveTests, and now exercises the ResolveMetadata code path in addition to ResolveTasks
ResolveTests has a new section moduleInitError, which goes through a number of scenarios - three standalone modules, two dependent modules, partially and fully-broken cross module - and checks that module initialization failures in various places are properly caught and handled with truncated stack traces shown to users
Added a simple integration test integration/failure/module-init-error to exercise the error handling during module initialization end-to-end and make sure that truncated stack traces are shown to users
Added some more tests to ResolveTests.scala, covering the end-to-end handling of brace expansion, along with error reporting for double-nested modules (to reproduce a previous bug)

Notes

The API to task resolution used by most code internal and external should be mostly the same, the only minor change is from RunScript.resolveTasks(mill.main.ResolveFoo, ...) to ResolveFoo.resolve(...) due to the removal of one layer of indirection
I might have missed some spots where we need to catch exceptions to truncate/convert-to-either; the nature of exceptions means there's no way to statically guarantee you caught them all. But the tests should cover most use cases, and if we later notice some code paths still throwing uncaught exceptions it'll be easy to wrap them then
The laziness of module initialization can probably be improved further, e.g. we don't need to initialize modules at all for mill resolve unless there are cross modules whose keys we need to compute. Leaving that for future work

lefou

This is a great change! 👏 I myself got lost multiple times in the resolver code. Looks good to me.

lihaoyi added 30 commits April 23, 2023 13:03

add module-init-error test

3abfa6b

flesh out ModuleInitErrorTests

5cc0e35

fix

9794eaf

wip try to collapse Resolve into one impl

f06baed

ResolveTasks compiles again with new implementation, incomplete

00f0178

try to re-introduce resolve error reporting

718be72

wip

092120d

re-implement not-a-module error

d5a1d2e

wip

dfc4328

iwp

1d4b411

fixes

363e482

all MainTests pass

d35f58b

add doubleNestedModule tests, enable more tests

cd9c0ca

try and re-enable other uses of resolve

add13e1

all main.test passes

4b95cc4

break out ResolveNonEmpty

184dfcb

fix-compile

c07f8b0

scalafmt

fa085b8

cleanup

9e7a40b

cleanup

35cfef0

wip

4862477

split out ExpandBracesTests

b274839

cleanup

3e186c8

Merge branch 'main' into module-init-error

cea7c6c

fix

424a1a9

fixes

4669340

resolveTasks -> resolve

a229e4d

things compile again after trying to make reflection return an Either

7b74968

first few module initialization error unit tests pass

1655c51

comments

9881f0d

lihaoyi added 6 commits April 25, 2023 17:17

fix-compile

6c7bc42

cleanup

a4dfa70

cleanup

096e6c4

add dependency test cases, assert stack traces are shortish

a1fdd47

ensure output of resolution gets sorted

17fd69b

.

2e8bb12

lihaoyi changed the title ~~[WIP] Try to clean up Resolve.scala and related code~~ [WIP] Clean up Resolve.scala and related code to improve rigor and error reporting Apr 25, 2023

lihaoyi added 12 commits April 25, 2023 19:16

fox

c3b100d

cleanup

5071c2c

.

540ddf7

.

12c844f

cleanup

78f62b5

.

53b1914

Merge branch 'main' into module-init-error

695280a

try to fix getSimpleName on JDK8

09e7484

.

d9ca2f8

.

18d7a6d

.

6079a36

.

5f174bf

lihaoyi changed the title ~~[WIP] Clean up Resolve.scala and related code to improve rigor and error reporting~~ Clean up Resolve.scala and related code to improve rigor and error reporting Apr 26, 2023

lihaoyi requested review from lefou and lolgab April 26, 2023 00:47

lihaoyi marked this pull request as ready for review April 26, 2023 00:47

lefou approved these changes Apr 26, 2023

View reviewed changes

lihaoyi added 3 commits April 26, 2023 20:23

tweaks

fc6b17f

Merge branch 'main' into module-init-error

4df0166

.

6b5f726

lihaoyi merged commit 135d0fd into com-lihaoyi:main Apr 26, 2023

lefou added this to the 0.11.0-M9 milestone Apr 27, 2023

lefou mentioned this pull request Jun 6, 2023

millModuleDirectChildren is not respected when resolving #2573

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clean up Resolve.scala and related code to improve rigor and error reporting #2453

Clean up Resolve.scala and related code to improve rigor and error reporting #2453

lihaoyi commented Apr 23, 2023 •

edited

Loading

lefou left a comment

Clean up Resolve.scala and related code to improve rigor and error reporting #2453

Clean up Resolve.scala and related code to improve rigor and error reporting #2453

Conversation

lihaoyi commented Apr 23, 2023 • edited Loading

Major Changes

Minor Changes

Testing

Notes

lefou left a comment

Choose a reason for hiding this comment

lihaoyi commented Apr 23, 2023 •

edited

Loading