Convert bcrypt to use OpenBSD code #68

reaperhulk · 2016-06-21T23:41:20Z

This allows us to add bcrypt_pbkdf to support OpenSSH keys much more easily.

Some review things:

Verify that the files added are the OpenBSD files (with a few changes like removing static on a definition, adding a header, and removing unused functions.)
Verify that the changes to sha2.c to use be64toh, htobe64, and le32toh are correct.
Carefully check portable_endian.h to verify it does what we expect on our supported platforms.

charmander · 2016-06-22T01:37:12Z

src/bcrypt/__init__.py

-    retval = _bcrypt.lib.crypt_gensalt_rn(
-        b"$" + prefix + b"$", rounds, salt, len(salt), output, len(output),
+    return (
+        b"$" + prefix + b"$" + ("%2.2u" % rounds).encode("ascii") + b"$" +


Any reason not to use b"%2.2u" % rounds (or b"$%s$%2.2u$" % (prefix, rounds))?

You can't use % with bytes in Py3 :)

And that’s what I get for “temporarily” linking python to python2. Sorry!

Haha, no problem. I'm pleased that you're reviewing this!

Reviewing history, I did actually check it properly: % works with bytes as of 3.5. (Seeing as 3.3 is the compatibility target, though, that doesn’t really matter.)

reaperhulk · 2016-06-22T02:44:33Z

Back to WIP because this doesn't work on Windows yet.

reaperhulk · 2016-06-22T04:52:07Z

This is now tested on Mac (El Capitan), Linux, and Windows under py2 and py3.

We should consider improving the test infra for bcrypt a blocker for merging this as that validation should be provided in the status checks, not just my assertion that I tested it and it works.

dstufft · 2016-06-22T16:12:11Z

src/bcrypt/__init__.py

    salt = os.urandom(16)
    output = _bcrypt.ffi.new("unsigned char[]", 30)
+    _bcrypt.lib.encode_base64(output, salt, len(salt))


Any reason to write/use our own base64 encoding instead of using what the standard library already supplies?

Right, I see we need our own base64 encoding in the C code to return the hashed password, but I think that this particular call can just use the standard library version. I guess there's an argument to be made for using this here so that we use the same encoding function everywhere, but it feels like to me it'd be better to just not expose the encode_base64 function via cffi and just treat it as an internal implementation detail of the library, and to use the standard library code in Python.

This one doesn't add trailing = but I don't know if there are any other differences. We may be able to switch (I originally had it using stdlib but converted it during the port for no good reason)

dstufft · 2016-06-22T16:23:07Z

Couple of general questions:

Does this support all the bcrypt hash types? ( $2a$ , $2y$ , and $2b$ )? I assume it supports at least $2b$ since that came from OpenBSD but what about the other two? If not we should at least document that as a backwards incompatible change.
What does updating this code to pull in new code from OpenBSD look like? Is this something that even makes sense to think about?

dstufft · 2016-06-22T16:26:00Z

Another thought: Since we're using different backing library, would it make sense to pull out the test vectors from openwall's code and add that into this library?

reaperhulk · 2016-06-22T17:32:51Z

$2a$ and $2b$ are both supported. $2y$ is not. I've added some tests pulled from openwall for $2a$ and updated the README to note the supported prefixes.

dstufft · 2016-06-22T17:40:05Z

@reaperhulk so, at one point I think we were defaulting to $2y$ prefixes so that I think could be a bit of a problem. Looking at the change log for the Openwall code I see this:

Version 1.3 adds support for the $2b$ prefix introduced in OpenBSD 5.5+, which behaves exactly the same as crypt_blowfish's $2y$. This way, full compatibility with OpenBSD's bcrypt is achieved at this new prefix.

Which suggests to me that we could regain support for the $2y$ prefix using something like:

import re

_normalize_re = re.compile(b"^\$2y\$")

def _normalize_prefix(hashed):
    return _normalize_re.sub(b"$2b$", hashed)

dstufft · 2016-06-22T17:42:42Z

I do think if we do that though, we should fail on passing a 2y prefix to gensalt, this should only be to support working with existing hashes, not to support continuing to generate 2y hashes.

reaperhulk · 2016-06-22T17:44:46Z

That seems fair. I'll update this PR to add support for understanding $2y$ prefixes and add tests.

reaperhulk · 2016-06-22T23:02:43Z

I realize I didn't address the "what does updating this look like" question... Updating this is going to be challenging because we've taken a snapshot of the OpenBSD code, removed various functions we don't need, changed a few function definitions from static, added some include headers to make it work across various platforms, and changed the way it does endian swapping. Updating will require doing a diff of the new version vs ours and then looking at the diff to understand what has changed that we may want to pull in and what is affecting areas we've intentionally removed/changed.

In practice updating is unlikely to be required except in the event of a security vulnerability.

One potential way we could split this up would be to make the C code a separate project called libbcrypt and then consume that for the Python bindings.

dstufft · 2016-06-23T00:17:16Z

@reaperhulk I'm not particularly opposed to the idea that updating is harder as long as we explicitly make that choice and we don't accidentally make it and end up regretting it. Looking at the code base for the openwall code, it appears that it's had practically no changes since it's original implementation besides the prefix handing and that one security fix, so I think it's a pretty safe thing to not really worry too hard about unless we can think of a low impact way to make it easier.

test vectors from openwall crypt-blowfish1.3

lvh · 2016-06-23T14:59:52Z

Are we using the OpenBSD C code verbatim, sans modification? Because man there are some missing braces for some if statements that I don't like.

reaperhulk · 2016-06-23T15:01:43Z

There have been changes (removal of functions, removal of static on one or two declarations, and the endian change), but I've tried hard to minimize those to make it vaguely possibly to compare them. I'd rather not diverge further, but I do agree that the lack of consistent bracing is pretty unfortunate.

painterjd · 2016-06-23T19:43:04Z

src/_csrc/blf.c

+
+	for (i = 0; i < 4; i++) {
+		for (k = 0; k < 256; k += 2) {
+			d[0]^= Blowfish_stream2word(data, databytes, &j);


consistency nitpick: space after right bracket?

Fixing this would make the diff from OpenBSD even harder to read 😢 (most of this is a direct copy from their source tree)

lvh · 2016-06-27T16:52:17Z

I have reviewed this PR.

reaperhulk · 2016-06-27T16:54:39Z

🎉 💃 🏆

This was referenced Jun 22, 2016

Add bcrypt pbkdf bindings #69

Merged

Add bcrypt pbkdf support #70

Merged

reaperhulk force-pushed the bcrypt3 branch from 2f0adf1 to ee8e8d7 Compare June 22, 2016 00:23

charmander reviewed Jun 22, 2016
View reviewed changes

reaperhulk changed the title ~~[WIP] Convert bcrypt to use OpenBSD code~~ Convert bcrypt to use OpenBSD code Jun 22, 2016

reaperhulk force-pushed the bcrypt3 branch from e2cb1d2 to 273a5e2 Compare June 22, 2016 02:36

reaperhulk changed the title ~~Convert bcrypt to use OpenBSD code~~ [WIP] Convert bcrypt to use OpenBSD code Jun 22, 2016

reaperhulk changed the title ~~[WIP] Convert bcrypt to use OpenBSD code~~ Convert bcrypt to use OpenBSD code Jun 22, 2016

dstufft reviewed Jun 22, 2016
View reviewed changes

reaperhulk force-pushed the bcrypt3 branch from b7d7428 to 6f85d80 Compare June 22, 2016 17:36

reaperhulk added 10 commits June 22, 2016 20:33

swap to using openbsd bcrypt

8234609

we should probably call this 3.0

c89955d

update tests to handle slight change in behavior, test better

ee616f0

strip out code we're not using

42aa2fc

define this for linux

64f23b8

py3 fix

82e9536

add a changelog to the readme

98d85f5

maybe work with windows

041ce23

portable endian header, replace swaps, other windows fixes

f356e66

handle older windows compilers properly, handle glibc < 2.9, retab

164b721

reaperhulk added 6 commits June 22, 2016 20:33

remove a todo, that's definitely the limit

2789a86

make these definitions conditional since some BSDs may already have them

736c7b9

add $2a$ tests from crypt_blowfish-1.3

be104ce

update readme to note supported prefixes

cd6cf3e

almost pointless commit

847718e

add support for $2y$

7d92e99

test vectors from openwall crypt-blowfish1.3

reaperhulk force-pushed the bcrypt3 branch from 2572abc to 7d92e99 Compare June 23, 2016 01:33

painterjd reviewed Jun 23, 2016
View reviewed changes

lvh merged commit 394882d into pyca:master Jun 27, 2016

reaperhulk deleted the bcrypt3 branch June 27, 2016 16:55

meanphil mentioned this pull request Aug 13, 2020

bcrypt 3.1.15 failing on FreeBSD 12 bcrypt-ruby/bcrypt-ruby#233

Closed

github-actions bot locked as resolved and limited conversation to collaborators Aug 16, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert bcrypt to use OpenBSD code #68

Convert bcrypt to use OpenBSD code #68

reaperhulk commented Jun 21, 2016 •

edited

charmander Jun 22, 2016

reaperhulk Jun 22, 2016

charmander Jun 22, 2016

reaperhulk Jun 22, 2016

charmander Jun 22, 2016

reaperhulk commented Jun 22, 2016

reaperhulk commented Jun 22, 2016

dstufft Jun 22, 2016

dstufft Jun 22, 2016

reaperhulk Jun 22, 2016

dstufft commented Jun 22, 2016

dstufft commented Jun 22, 2016

reaperhulk commented Jun 22, 2016

dstufft commented Jun 22, 2016

dstufft commented Jun 22, 2016

reaperhulk commented Jun 22, 2016

reaperhulk commented Jun 22, 2016

dstufft commented Jun 23, 2016

lvh commented Jun 23, 2016

reaperhulk commented Jun 23, 2016

painterjd Jun 23, 2016

reaperhulk Jun 23, 2016

lvh commented Jun 27, 2016

reaperhulk commented Jun 27, 2016

Convert bcrypt to use OpenBSD code #68

Convert bcrypt to use OpenBSD code #68

Conversation

reaperhulk commented Jun 21, 2016 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

reaperhulk commented Jun 22, 2016

reaperhulk commented Jun 22, 2016

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dstufft commented Jun 22, 2016

dstufft commented Jun 22, 2016

reaperhulk commented Jun 22, 2016

dstufft commented Jun 22, 2016

dstufft commented Jun 22, 2016

reaperhulk commented Jun 22, 2016

reaperhulk commented Jun 22, 2016

dstufft commented Jun 23, 2016

lvh commented Jun 23, 2016

reaperhulk commented Jun 23, 2016

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lvh commented Jun 27, 2016

reaperhulk commented Jun 27, 2016

reaperhulk commented Jun 21, 2016 •

edited