Parse cookie pairs without a regex #81

devinivy · 2022-04-18T01:50:54Z

Using a simple parser rather than a regex is a more direct, durable, and performant solution to collecting cookie name-value pairs. I think that writing this logic out also makes it a bit clearer how we treat bad/invalid cookies. In order to confirm that the behavior remains identical to the regex-based parser, I added tests for a couple additional edge-cases.

kanongil

Just a few quick comments.

I like that this logic allows value pairs inside a double-quoted string to be found, like
a="b; c=d; e=". Yours will find a="b, c=d, e=", while the regex just finds a=b; c=d; e=.

Did you verify that it is actually faster?

lib/index.js

devinivy · 2022-04-18T19:09:02Z

Thanks for the review @kanongil.

I agree with all your notes re: trim() and the ; SP delimiter, but these are more substantial behavioral changes to statehood than I'm hoping to make now. Our existing parser is "loose" in certain ways that I was trying to maintain, which is one reason it doesn't look identical to the parsers you'd find in express, fastify, or tough-cookie. I also have some questions about just how close to the spec we should keep— for example, it seems common in parsers across the node ecosystem to ignore a trailing semi-colon, which wouldn't jibe with the ; SP parsing. These are good conversations that we should have, but I am hoping to defer that to future work, and just maintain the behavior of the existing parser within reason as part of this work.

I did confirm it was faster, too! Here's a quick peek at the time for 100 random runs (x is a run with a given random cookie string, y is time).

Expand graph:

│                             •                                                                    
│                                                                                                  
│                                                                           •                      
│                                                                       •                          
│                                                                               •                  
│                                                                                                  
│                                                                                                  
│                                                                                                  
│                                                                                                  
│                                                                                                  
│                                                                                                  
│                                                                                •                 
│     •                                                                                            
│ •                                                                      •                         
│                     •                          ••                •         ••                    
│•     •     •                                                                                     
│                                                                                                  
│  •        •       •    •     ••         •   •                                                  • 
│   •   •         •     •        •  •  •   •                                                       
│    •   •      •      •    •           ••          •                      •                •     •
│         ••  •• • •      ••      •• ••     •  •                 •             •                   •
│                    •       •               •       • •  ••  • •     •              • •           
│                                               •  •  •     •  •  • •     •       •   • ••     ••  
│                                                       ••   •       •             ••     ••  •    
│                                                                      •                     •     
│                                                                                                  
│                                                                                                  
│                                                                                                  
│                                                                                                  
│*        *                                                                                        
│                                                                                                  
│          *                *                                                                      
│ *                          *                                                                     
│  ***   *  *    *                                                                                 
│     ***     *           **          *           *                   *                            
│            * ** ********    ********   * **   ** **  *****  * *      **  **    * * *  **  * **** 
│                                      ** *  ***     **     ** * *****   **  **** * * **  ** *    **
│                                                                                                  
┼──────────────────────────────────────────────────────────────────────────────────────────────────▶

kanongil

Loose parsing is probably for the best.

The main behavioural change is the mentioned exposing of new value-pairs with mis-matched '"', which I would consider a bug fix. Maybe you can add a test for when they match as well? Eg. for a="; b=2; "; c=3. Here the regex returns a=; b=2; & c=3, while your logic would return something like a=", b=2 & "; c=3.

devinivy · 2022-04-18T21:18:58Z

Sounds good, I will take a look at that and at minimum add a test 👍

devinivy · 2022-04-20T02:37:37Z

Thank you to @watson, @jportner, and AlanBugz in conjunction with Kibana's bug bounty program for disclosing a susceptibility to ReDoS in statehood's cookie parser, now fixed by this patch and available on npm.

devinivy added 2 commits April 17, 2022 18:21

Parse cookie pairs directly without a regex

19ecc44

Tidy parser, fix lone quote case

ed58e1b

kanongil reviewed Apr 18, 2022

View reviewed changes

lib/index.js Show resolved Hide resolved

lib/index.js Show resolved Hide resolved

lib/index.js Show resolved Hide resolved

lib/index.js Show resolved Hide resolved

kanongil approved these changes Apr 18, 2022

View reviewed changes

Test parsing mismatching, paired quotes

4c440eb

devinivy added this to the 7.0.4 milestone Apr 20, 2022

devinivy self-assigned this Apr 20, 2022

devinivy merged commit 61937fa into master Apr 20, 2022

devinivy deleted the parse-sans-regex branch April 20, 2022 02:13

devinivy added security Issue with security impact bug Bug or defect labels Apr 20, 2022

kanongil mentioned this pull request Oct 18, 2022

Cookie without Value Parsing Regression in v7.0.4 #83

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parse cookie pairs without a regex #81

Parse cookie pairs without a regex #81

devinivy commented Apr 18, 2022

kanongil left a comment

devinivy commented Apr 18, 2022 •

edited

kanongil left a comment

devinivy commented Apr 18, 2022

devinivy commented Apr 20, 2022

Parse cookie pairs without a regex #81

Parse cookie pairs without a regex #81

Conversation

devinivy commented Apr 18, 2022

kanongil left a comment

Choose a reason for hiding this comment

devinivy commented Apr 18, 2022 • edited

kanongil left a comment

Choose a reason for hiding this comment

devinivy commented Apr 18, 2022

devinivy commented Apr 20, 2022

devinivy commented Apr 18, 2022 •

edited