enhancement(filters): use a stricter bot token regex by onerandomusername · Pull Request #2006 · python-discord/bot

onerandomusername · 2021-12-12T02:16:27Z

Approval: https://canary.discord.com/channels/267624335836053506/635950537262759947/919203386660884560

Enhances the regex of the token remover to use the same regex that discord itself uses, with a slight modification. The mfa section was removed, but depending on an updated #1421, may be implemented. Additionally, the sections were grouped to keep working with the current code.

I kept the existing validation to keep false positives at a minimum. The current code checks the user resolves, the timestamp is valid, and the last section has at least 3 different characters.

As per @jb3, to implement:

~~MFA User Token filter #1421~~ now irrelevant.

Rejected:

also check the upper bound of a token timestamp, to ensure it was in the past.

onerandomusername · 2021-12-12T16:33:27Z

I'll be fixing this commit history tonight

Bluenix2

Works as usual - I can't get access to an MFA token to test though. Thanks for this, I just have one comment that won't affect my review.

Akarys42

LGTM, just a small style change

wookie184

Just a couple things, could you also make the message match the other token message a bit closer, particularly:

having the user as a clickable mention and the ID in a codeblock afterwards
having the channel as a clickable mention
putting the censored token in a codeblock
adding the pfp of the user that send the message as a thumbnail

Co-authored-by: wookie184 <wookie1840@gmail.com>

wookie184 · 2022-09-21T21:54:59Z

This has been inactive for a while so I'll put it up for grabs.

gustavwilliam · 2022-10-08T17:34:47Z

This has been inactive for a while so I'll put it up for grabs.

What's left to do? Addressing your comment and fixing the merge conflict?

mbaruh · 2022-10-08T19:52:37Z

This has been inactive for a while so I'll put it up for grabs.

What's left to do? Addressing your comment and fixing the merge conflict?

Yeah, merge main, match the output embed to the existing one, and fix the bug

onerandomusername · 2022-10-09T05:20:44Z

As far as I can tell the mfa code is no longer necessary as discord doesn't have mfa tokens like that now.

onerandomusername · 2022-10-09T05:23:40Z

There's also a bit of the matter that this regex is now out of date. I don't know what the current token length is.

wookie184 · 2022-10-09T11:19:58Z

There's also a bit of the matter that this regex is now out of date. I don't know what the current token length is.

I can't recall any problems with false positives, even with the current regex, so we can afford to be very general with what we match. Could just check that the first and last parts are >= 10 and middle is >= 5 characters in length or something.

Akarys42 · 2022-10-09T18:46:57Z

I agree with wookie, I think having extensive matching for the sake of even future-proofing is okay, considering additional checks are being performed

onerandomusername · 2022-10-10T01:31:53Z

The only inaccurate part is the last section is too short as it is, but because of how the parsing works, that isn't an issue, i suppose

wookie184 · 2022-10-10T10:26:11Z

The only inaccurate part is the last section is too short as it is, but because of how the parsing works, that isn't an issue, i suppose

If that's the case I think we should just have no upper limit on the length of the last section. Otherwise the code is somewhat misleading in how it works.

mbaruh · 2022-10-14T12:31:47Z

Hey @onerandomusername, thanks for your work so far. We needed to get this merged so went ahead and implemented the comments. Sorry the MFA part didn't end up being used 😅

Requested changes were related to the removed MFA part.

onerandomusername · 2022-10-14T16:22:52Z

 # Each part only matches base64 URL-safe characters.
 # These regexes were taken from discord-developers, which are used by the client itself.
-TOKEN_RE = re.compile(r"([a-z0-9_-]{23,28})\.([a-z0-9_-]{6,7})\.([a-z0-9_-]{27})", re.IGNORECASE)
+TOKEN_RE = re.compile(r"([\w_-]{10,})\.([\w_-]{5,})\.([\w_-]{10,})", re.IGNORECASE)


way too lenient, this makes clearly impossible tokens match, which was the entire point of this pr.

Way too lenient is not defined by what it can match, but what it will match in practice, which so far has not been an issue. This PR made the regex a little more flexible in terms of false-positives, without increasing the possibility of false-negatives or creating something that has to be modified in the future.

onerandomusername requested review from jb3 and mbaruh as code owners December 12, 2021 02:16

onerandomusername mentioned this pull request Dec 12, 2021

MFA User Token filter #1421

Closed

onerandomusername force-pushed the enhance-token-regex branch 2 times, most recently from c676f1a to b5f1305 Compare December 12, 2021 06:02

onerandomusername added 2 commits December 12, 2021 19:42

enhancement: use a stricter bot token regex

3acdaf2

chore: don't use a second listener

941ea8c

onerandomusername force-pushed the enhance-token-regex branch from 033ca53 to 941ea8c Compare December 13, 2021 00:42

Xithrius requested review from Bluenix2 and MarkKoz December 15, 2021 14:03

Bluenix2 approved these changes Dec 16, 2021

View reviewed changes

Comment thread bot/exts/filters/token_remover.py

Akarys42 approved these changes Dec 17, 2021

View reviewed changes

Comment thread bot/exts/filters/token_remover.py

wookie184 previously requested changes Jan 15, 2022

View reviewed changes

Comment thread bot/exts/filters/token_remover.py Outdated

Comment thread bot/exts/filters/token_remover.py Outdated

fix: use the proper number of X's

8fc5033

Co-authored-by: wookie184 <wookie1840@gmail.com>

ToxicKidz added s: waiting for author Waiting for author to address a review or respond to a comment and removed s: needs review Author is waiting for someone to review and approve labels Feb 19, 2022

wookie184 added up for grabs Available for anyone to work on and removed s: waiting for author Waiting for author to address a review or respond to a comment labels Sep 21, 2022

Update token_remover.py

e9671c1

onerandomusername added 2 commits October 9, 2022 01:25

Update token_remover.py

7126a4d

Merge branch 'main' into enhance-token-regex

00a1fe3

onerandomusername marked this pull request as draft October 9, 2022 05:26

fix the applications link

8f99065

onerandomusername force-pushed the enhance-token-regex branch from 3234e2a to 8f99065 Compare October 9, 2022 05:30

onerandomusername marked this pull request as ready for review October 10, 2022 01:31

mbaruh removed the up for grabs Available for anyone to work on label Oct 10, 2022

HassanAbouelela added 4 commits October 14, 2022 15:11

Use More Lenient Length Validation

4b08a08

Update token_remover.py

66e0bda

Merge branch 'main' into enhance-token-regex

b88dc97

Update token_remover.py

232fe6b

mbaruh approved these changes Oct 14, 2022

View reviewed changes

mbaruh merged commit cdb9183 into python-discord:main Oct 14, 2022

onerandomusername deleted the enhance-token-regex branch October 14, 2022 16:22

onerandomusername commented Oct 14, 2022

View reviewed changes

Uh oh!

Conversation

onerandomusername commented Dec 12, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

onerandomusername commented Dec 12, 2021

Uh oh!

Bluenix2 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Akarys42 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

wookie184 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

wookie184 commented Sep 21, 2022

Uh oh!

gustavwilliam commented Oct 8, 2022

Uh oh!

mbaruh commented Oct 8, 2022

Uh oh!

onerandomusername commented Oct 9, 2022

Uh oh!

onerandomusername commented Oct 9, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

wookie184 commented Oct 9, 2022

Uh oh!

Akarys42 commented Oct 9, 2022

Uh oh!

onerandomusername commented Oct 10, 2022

Uh oh!

wookie184 commented Oct 10, 2022

Uh oh!

mbaruh commented Oct 14, 2022

Uh oh!

onerandomusername Oct 14, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

HassanAbouelela Oct 14, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

onerandomusername commented Dec 12, 2021 •

edited

Loading

onerandomusername commented Oct 9, 2022 •

edited

Loading

onerandomusername Oct 14, 2022 •

edited

Loading