SafeRe is vulnerable to ReDoS #2757

gqgs · 2021-07-13T02:03:19Z

Step 1: Please describe your environment

ZeroNet version: 0.7.2 (4555)

Step 2: Describe the problem:

"To avoid the ReDoS algorithmic complexity attack" the function bellow is used to validate user defined regular expressions.

ZeroNet/src/util/SafeRe.py

Lines 10 to 22 in 454c0b2

    
           def isSafePattern(pattern): 
        
               if len(pattern) > 255: 
        
                   raise UnsafePatternError("Pattern too long: %s characters in %s" % (len(pattern), pattern)) 
        
               unsafe_pattern_match = re.search(r"[^\.][\*\{\+]", pattern)  # Always should be "." before "*{+" characters to avoid ReDoS 
        
               if unsafe_pattern_match: 
        
                   raise UnsafePatternError("Potentially unsafe part of the pattern: %s in %s" % (unsafe_pattern_match.group(0), pattern)) 
        
               repetitions = re.findall(r"\.[\*\{\+]", pattern) 
        
               if len(repetitions) >= 10: 
        
                   raise UnsafePatternError("More than 10 repetitions of %s in %s" % (repetitions[0], pattern)) 
        
               return True

This function fails to identify regular expressions that can require exponential time complexity to match user inputs.

Steps to reproduce:

>>> from SafeRe import isSafePattern, match
>>> p = "a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?a?aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa"
>>> isSafePattern(p)
True
>>> match(p, "aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa")

Observed Results:

match hangs and the execution never completes.

Expected Results:

isSafePattern should properly detect that the pattern is unsafe.
Alternatively, match should use an algorithm with guaranteed linear time complexity to compile and match inputs (e.g. Thompson NFA).

The text was updated successfully, but these errors were encountered:

rllola · 2021-07-26T08:29:29Z

We could replace this by the RE2 (https://github.com/google/re2). There is python bindings available (https://pypi.org/project/google-re2/).

wandrien · 2021-10-20T11:45:47Z

@rllola

Many zites make use of (?!...) and RE2 doesn't seem to support it. (https://github.com/google/re2/wiki/Syntax)
The problem is we neither check for formal allowed regexp syntax, nor have the formal definition at all. Our regexp syntax is implicitly python re syntax.

Not sure if it is possible to move to RE2 in a backward compatible way.

wandrien · 2021-10-20T12:15:41Z

https://github.com/zeronet-enhanced/ZeroNet/commit/2a25d61b968a21aa98c6db2ca9d64f1bbdc54773

In my fork, I (temporarily) fixed this by treating ?s in the same ways as other "repetitions", so the total number of repetition markers cannot exceed 9.

Not sure if it is a proper or a complete solution. I'm not familiar with the ReDoS type of attack and regexp implementation details.

caryoscelus pushed a commit to caryoscelus/zeronet-conservancy that referenced this issue Jul 31, 2023

Fix HelloZeroNet#2757

30db5a4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SafeRe is vulnerable to ReDoS #2757

SafeRe is vulnerable to ReDoS #2757

gqgs commented Jul 13, 2021 •

edited

Loading

rllola commented Jul 26, 2021

wandrien commented Oct 20, 2021

wandrien commented Oct 20, 2021

SafeRe is vulnerable to ReDoS #2757

SafeRe is vulnerable to ReDoS #2757

Comments

gqgs commented Jul 13, 2021 • edited Loading

Step 1: Please describe your environment

Step 2: Describe the problem:

Steps to reproduce:

Observed Results:

Expected Results:

rllola commented Jul 26, 2021

wandrien commented Oct 20, 2021

wandrien commented Oct 20, 2021

gqgs commented Jul 13, 2021 •

edited

Loading