Refactor: Separate RESP Command Parsing and Execution #164

lmaas · 2024-03-27T23:57:58Z

This PR is a first step towards a more maintainable RESP parser design.

Overview

This PR tries to separate command parsing and execution by making the following changes:

All parsing has been unified in a single ParseCommand() function with clear semantics. If parsing is successful, the function returns the RespCommand ID, optionally a subcommand ID (if parsed) and the number of remaining elements in the RESP array (i.e., the number of command arguments)
Every command has been assigned an ID in the RespCommand enum.

Additionally, the parsing code is now enforcing the following semantics on the parsing function and all operators:

Parsing advances the read head to the first string after the parsed command and (optional) subcommand
All operators read from the read head to remove inconsistency between reading from pointers and read head.
count represents the number of arguments remaining in the current RESP package.

Separating parsing and command execution will allow easier integration with components such as ACL and easier replacement of the parsing logic. In addition, it allows us to properly identify any potential bottlenecks in the current parser.

Known Caveats

Currently all parsing functions return (RespCommand, byte), but most commands still do their own subcommand parsing inside the command executors. This will change in the future with new parser updates.
There are inconsistencies in the interfaces of different command executors. Eventually we will need to converge to a standardized interface.

aromaa · 2024-03-28T01:06:06Z

When working on #119 I ran to the problem that the heavy usage of MemoryMarshal.Read would hit the inlining budget where the JIT refuses to inline anymore methods no matter what (It starts to even ignore AggressiveInlining) at the end of the method. I worked around this by implementing simpler (and more unsafe) version of the method to reduce the IL size and folding the JIT needs to do. These limits are increased on .NET 8 compared to .NET 6. Did you happen to check the assembly that the JIT actually inlines all these calls?

badrishc · 2024-03-28T01:27:45Z

Hi @aromaa, we really appreciate your expertise with JIT and inlining to help make this work more optimized, thanks!

lmaas · 2024-03-28T01:50:23Z

When working on #119 I ran to the problem that the heavy usage of MemoryMarshal.Read would hit the inlining budget where the JIT refuses to inline anymore methods no matter what (It starts to even ignore AggressiveInlining) at the end of the method. I worked around this by implementing simpler (and more unsafe) version of the method to reduce the IL size and folding the JIT needs to do. These limits are increased on .NET 8 compared to .NET 6. Did you happen to check the assembly that the JIT actually inlines all these calls?

This is great insight! We have not checked yet if the inlining is happening correctly. This should definitely go on our to-do list.

I think we'll need a few optimization passes on this - especially on the core loop. So far, we have not tuned the parsing loop much and I am sure there is a lot of room for improvement. We'd definitely appreciate your help with this!

badrishc · 2024-03-28T04:56:11Z

When working on #119 I ran to the problem that the heavy usage of MemoryMarshal.Read would hit the inlining budget where the JIT refuses to inline anymore methods no matter what (It starts to even ignore AggressiveInlining) at the end of the method. I worked around this by implementing simpler (and more unsafe) version of the method to reduce the IL size and folding the JIT needs to do. These limits are increased on .NET 8 compared to .NET 6. Did you happen to check the assembly that the JIT actually inlines all these calls?

How do you check the assembly for this, any pointers?

…rnet into lumaas/parsing-refactor

PaulusParssinen · 2024-03-28T08:04:20Z

How do you check the assembly for this, any pointers?

I recommend using excellent VS extension by member of the .NET JIT team, called Disasmo. _{^{warning: it's pretty addicting}}

garnet_disasmo.mp4

I believe you need to clone and build the .NET runtime locally in order to use the print-inlinees feature to see the JIT inlining decisions. You can also toggle JitDump -option to see every stage of JIT compilation, it's pretty fascinating.

If VS is not available for you you can follow .NET Runtime guide to try getting the basic JitDisasm manually using dotnet run with DOTNET_JitDisasm=MethodName environment variables. However to get the FullOpts output (like in Disasmo), you have to wait for Tier1 to kick in.

As other JIT developers often say, developer should only mark methods with MethodImpl.AggressiveInlining rarely and for good reason as the JIT should ideally be doing the right thing™️ already with its heuristics. By using AggressiveInlining, you eat from the JITs inlining and other optimization budget.

It's very hard to find the right balance between JIT throughput, code size and the resulting inlined assembly which depend very much on callsites and other factors, which is a probably why people experienced in optimizing low-level stuff like this keep telling to measure and not assume something is faster.

Here's a random example (there's more) where removing AgressiveInlining made things much faster😅: dotnet/runtime#81565

…rnet into lumaas/parsing-refactor

libs/server/Resp/RespServerSession.cs

libs/server/Resp/RespCommand.cs

libs/server/Resp/RespServerSession.cs

libs/server/Resp/AdminCommands.cs

lmaas and others added 2 commits March 27, 2024 15:53

Separates RESP parsing and command execution

42f9b7d

Merge branch 'microsoft:main' into lumaas/parsing-refactor

d18017d

badrishc added the parser label Mar 28, 2024

lmaas mentioned this pull request Mar 28, 2024

Improve FastParseCommand readability #119

Merged

lmaas requested review from yrajas, badrishc, TalZaccai and vazois and removed request for badrishc, TalZaccai and vazois March 28, 2024 00:30

lmaas added 2 commits March 27, 2024 17:48

Merge from main

b91edea

Merge remote-tracking branch 'origin/main' into lumaas/parsing-refactor

463eb58

lmaas requested review from badrishc, TalZaccai and vazois March 28, 2024 00:55

Code cleanup

3f2df46

Merge branch 'main' into lumaas/parsing-refactor

371b16f

lmaas added 2 commits March 27, 2024 22:24

Ensures inline command parsing advances readHead

8969891

Merge branch 'lumaas/parsing-refactor' of https://github.com/lmaas/ga…

e80cea1

…rnet into lumaas/parsing-refactor

lmaas added 3 commits March 28, 2024 15:01

Merge from main

34aa5f6

Merge from main

660c33e

Merge branch 'lumaas/parsing-refactor' of https://github.com/lmaas/ga…

7ddc8e1

…rnet into lumaas/parsing-refactor

badrishc approved these changes Mar 29, 2024

View reviewed changes

libs/server/Resp/RespServerSession.cs Outdated Show resolved Hide resolved

vazois approved these changes Mar 29, 2024

View reviewed changes

libs/server/Resp/RespServerSession.cs Show resolved Hide resolved

TalZaccai reviewed Mar 29, 2024

View reviewed changes

lmaas added 10 commits March 28, 2024 23:01

Bug fix for count handling in HashExists

1dad46e

Removes unimplemented command 'RESET' from parsing loop.

e361c0c

Cleans up command function argument order across commands

a8a45d6

Fixes LPUSHX readHead management

0917006

Misc. parsing cleanup

dc17057

Code cleanup

674e5ef

Merge remote-tracking branch 'origin/main' into lumaas/parsing-refactor

50b0d47

Additional cleanup for ParseCommand()

8b3cf3c

Cleanup

1958353

Fix readHead bug in NetworkSETEXNX

0feeb78

lmaas marked this pull request as draft March 30, 2024 21:37

lmaas added 5 commits March 30, 2024 15:05

Fixes a bug in FastParseInlineCommand()

9502729

Merge remote-tracking branch 'origin/main' into lumaas/parsing-refactor

2d5749d

Adds max connection attempts for cluster tests.

3996aca

Fixes a bug in cluster tests

ace4e6f

Merge from main

a6c4599

lmaas marked this pull request as ready for review April 1, 2024 20:55

lmaas added 4 commits April 1, 2024 14:33

Merge branch 'main' into lumaas/parsing-refactor

0482395

Merge from main

76cd671

Code cleanup for set commands

7cd24f1

Merge branch 'main' into lumaas/parsing-refactor

06bc195

TalZaccai approved these changes Apr 2, 2024

View reviewed changes

TalZaccai merged commit 4cba6ff into microsoft:main Apr 2, 2024
21 checks passed

lmaas deleted the lumaas/parsing-refactor branch April 2, 2024 19:30

github-actions bot locked and limited conversation to collaborators Jun 2, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor: Separate RESP Command Parsing and Execution #164

Refactor: Separate RESP Command Parsing and Execution #164

lmaas commented Mar 27, 2024 •

edited

Loading

aromaa commented Mar 28, 2024 •

edited

Loading

badrishc commented Mar 28, 2024

lmaas commented Mar 28, 2024

badrishc commented Mar 28, 2024

PaulusParssinen commented Mar 28, 2024 •

edited

Loading

Refactor: Separate RESP Command Parsing and Execution #164

Refactor: Separate RESP Command Parsing and Execution #164

Conversation

lmaas commented Mar 27, 2024 • edited Loading

Overview

Known Caveats

aromaa commented Mar 28, 2024 • edited Loading

badrishc commented Mar 28, 2024

lmaas commented Mar 28, 2024

badrishc commented Mar 28, 2024

PaulusParssinen commented Mar 28, 2024 • edited Loading

lmaas commented Mar 27, 2024 •

edited

Loading

aromaa commented Mar 28, 2024 •

edited

Loading

PaulusParssinen commented Mar 28, 2024 •

edited

Loading