Code golf scoring #866

kostmo · 2022-11-15T01:22:06Z

It would be fun to keep track of the "best" solution in terms of code size.

The code size might be determined by summing all of the REPL executions plus all transitive inclusions of code (include or run).

To discourage short variable names, it may be best to use "symbol count" rather than "character count" as the metric.

The text was updated successfully, but these errors were encountered:

byorgey · 2022-11-15T01:36:45Z

symbol count

Or number of AST nodes.

byorgey · 2022-11-15T01:38:38Z

Personally, I enjoy trying to optimize character count, but tastes vary.

kostmo · 2022-11-15T01:49:01Z

trying to optimize character count

That can be fun, but I think there's more challenge in crafting a parsimonious program structure than shrinking variable names. I recognize appeal of finding out in which circumstances whitespace or semicolons are optional as well.

Perhaps we can track both AST node count and character count as metrics?

byorgey · 2022-11-15T02:28:07Z

Perhaps we can track both AST node count and character count as metrics?

Sure, why not? 😄 I agree, both are fun for different reasons. One is about finding elegant program structures, as you say, and one is about pushing a language to its limits.

xsebek · 2022-11-15T07:08:34Z

This sounds fun and should be easy to add to current best score. 👍

I think it needs an “automated mode” where you can not cancel your solution and start writing in REPL. 🤔
Once that computation finishes you have either won or failed (new state) and can leave or continue experimenting, now with REPL.

byorgey · 2022-11-15T10:50:25Z

I think it needs an “automated mode” where you can not cancel your solution and start writing in REPL.

I don't think I understand - what's the motivation?

xsebek · 2022-11-15T17:31:27Z

@byorgey well if you could continue in REPL, then an empty solution would be always shortest and then just finish it in REPL.

byorgey · 2022-11-15T17:51:23Z

I thought the idea was to count all things typed at the REPL plus anything run.

xsebek · 2022-11-15T18:00:08Z

But that way its always more efficient to type in REPL and step through.

For example the sheep solution would not have to wait for wool to drop, if the player waits for it.

I.e. the “solution” would not have to be full, which is OK, but different from other code golf challenges.

xsebek · 2022-11-15T18:03:51Z

Also if you wanted to count REPL, you would have to do it at each command because of run.

Counting once is much simpler and would not litter the codebase.

byorgey · 2022-11-15T20:40:03Z

Ah, I think I see your point now. It's also harder to share short solutions if they require typing multiple things at the REPL. I agree, to keep things simple and consistent we should probably just score a single .sw file (+ transitive imports) when counting length.

kostmo · 2022-11-16T01:41:35Z

Perhaps for now, one shall only be qualified for code golf scoring when using the --run CLI option. Any repl commands entered before the goal is reached shall disqualify the attempt.

xsebek · 2022-11-16T08:31:42Z

Maybe we could allow either —run or one REPL entry. 🤔

Then we could have states NoEntry, EntryWithLength and MoreEntries. If you finish the game with second state, it’s considered for shortest solution.

towards #866

kostmo · 2023-01-10T01:13:19Z

I wonder if it might be interesting to have constraints on the "stored code size" of a robot for a challenge. Perhaps by default all built robots have an "infinite memory bank", but we could define a "finite memory bank" entity. Perhaps possessing many of these entities avails the robot to some multiple of AST nodes.

towards #866

towards #866 NOTE: #1116 should be merged first so that the schema change of save files is less disruptive. ## Examples Different criteria can have their own best score: ![image](https://user-images.githubusercontent.com/261693/219904496-fcd23ca0-b208-43e1-afc6-188acfe327cf.png) All criteria share the same single best score: ![image](https://user-images.githubusercontent.com/261693/219904553-abe3011c-41b0-469c-b34d-95d84b91697a.png) ## Behavior notes * As currently designed, the code size will only be scored if the the player has specified their code **before** the scenario begins. Furthermore, any input into the REPL will invalidate code size scoring for the current game. * Because of this, the only way to score code so far is with a command-line argument of `--run` or `--autoplay`. However, #1010 shall implement code size scoring when a scenario is launched from the UI. * In the "best scores" display, if multiple "best score" criteria were all from the same game, they will be consolidated. If all criteria are for the same game, the criteria labels will be omitted. * The code size metrics will not be displayed if a "best score" was not obtained via `--run`. ## Caveats ### `run` command Currently, the code entailed in a `run "somescript.sw"` command is not transitively included, so using `run` make the code size score meaningless. ## Testing ### Unit tests Run the subset of unit tests: scripts/run-tests.sh --test-arguments '--pattern "Tests.Precedence"' ### Manual integration tests First, reset the score: rm -f ~/.local/share/swarm/saves/Tutorials_grab.yaml Saving the following to `grab-soln.sw`: ``` move; move; grab; turn back; move; turn back; move; move; grab; turn back; move; turn back; move; move; grab; turn back; move; turn back; move; move; grab; turn back; move; turn back; move; move; grab; turn back; move; turn back; move; move; grab; ``` Run as follows: scripts/play.sh --scenario Tutorials/grab.yaml --run grab-soln.sw This should establish a record for code size. Then, play the Grab tutorial and immediately paste and run this in the REPL: move; move; grab; move; grab; move; grab; move; grab; move; grab; move; grab; This solution is faster in terms of time, but should not displace the code-length record, since no code length should be recorded from a REPL solution.

kostmo added the Z-Feature A new feature to be added to the game. label Nov 15, 2022

kostmo added C-Moderate Effort Should take a moderate amount of time to address. S-Moderate The fix or feature would substantially improve user experience. G-Design An issue having to do with game design. labels Dec 10, 2022

kostmo mentioned this issue Jan 6, 2023

Challenge scenario requiring the use of a stack #932

Open

kostmo added a commit that referenced this issue Jan 7, 2023

Measure size of AST

3b82e3c

towards #866

kostmo mentioned this issue Jan 7, 2023

Record best code size #974

Merged

kostmo added a commit that referenced this issue Jan 7, 2023

Measure size of AST

7e00f61

towards #866

kostmo added a commit that referenced this issue Jan 7, 2023

Measure size of AST

4c74d91

towards #866

kostmo mentioned this issue Jan 10, 2023

Solution verification with random seeds #967

Open

kostmo added a commit that referenced this issue Feb 4, 2023

Measure size of AST

6c730df

towards #866

kostmo added a commit that referenced this issue Feb 5, 2023

Record best code size

9408fcd

towards #866

kostmo added a commit that referenced this issue Feb 6, 2023

Record best code size

48a6d29

towards #866

kostmo added a commit that referenced this issue Feb 7, 2023

Record best code size

04c483d

towards #866

kostmo added a commit that referenced this issue Feb 8, 2023

Record best code size

6278b99

towards #866

kostmo added a commit that referenced this issue Feb 11, 2023

Record best code size

6cf2d53

towards #866

kostmo added a commit that referenced this issue Feb 13, 2023

Record best code size

41658e9

towards #866

kostmo added the G-Scoring Keeping track of high scores label Feb 17, 2023

kostmo added a commit that referenced this issue Feb 18, 2023

Record best code size

1d1df51

towards #866

kostmo mentioned this issue Feb 25, 2023

Add timestamps to REPL input for deterministic replayability #1131

Open

kostmo added a commit that referenced this issue Feb 25, 2023

Record best code size

5da1dea

towards #866

kostmo mentioned this issue Apr 5, 2023

RPG scenarios #1197

Open

kostmo added a commit that referenced this issue Apr 25, 2023

Record best code size

2123802

towards #866

kostmo added a commit that referenced this issue May 2, 2023

Record best code size

5d9b9d1

towards #866

kostmo mentioned this issue Jun 6, 2023

Scenario launch options selection #1010

Merged

mergify bot closed this as completed in #1010 Jun 9, 2023

mergify bot closed this as completed in b382494 Jun 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Code golf scoring #866

Code golf scoring #866

kostmo commented Nov 15, 2022

byorgey commented Nov 15, 2022

byorgey commented Nov 15, 2022

kostmo commented Nov 15, 2022

byorgey commented Nov 15, 2022

xsebek commented Nov 15, 2022

byorgey commented Nov 15, 2022

xsebek commented Nov 15, 2022

byorgey commented Nov 15, 2022

xsebek commented Nov 15, 2022

xsebek commented Nov 15, 2022

byorgey commented Nov 15, 2022

kostmo commented Nov 16, 2022

xsebek commented Nov 16, 2022

kostmo commented Jan 10, 2023

Code golf scoring #866

Code golf scoring #866

Comments

kostmo commented Nov 15, 2022

byorgey commented Nov 15, 2022

byorgey commented Nov 15, 2022

kostmo commented Nov 15, 2022

byorgey commented Nov 15, 2022

xsebek commented Nov 15, 2022

byorgey commented Nov 15, 2022

xsebek commented Nov 15, 2022

byorgey commented Nov 15, 2022

xsebek commented Nov 15, 2022

xsebek commented Nov 15, 2022

byorgey commented Nov 15, 2022

kostmo commented Nov 16, 2022

xsebek commented Nov 16, 2022

kostmo commented Jan 10, 2023