consider getting rid of base58 #599

MonsieurNicolas · 2015-07-09T23:58:33Z

This is our last chance to do this or we'll regret it forever!

We should use a format like base32-zooko that has nice properties like:

human friendly (case insensitive, avoids similar characters)
uses bit wise logic and lookups (simple and fast)

http://philzimmermann.com/docs/human-oriented-base-32-encoding.txt

This would also resolve #273 (where there is no real solution other than use a different encoding in the database).

Here are the differences in size between encodings:
hex => 64 characters
base32 => 51 characters
base64 => 42 characters
base58 => 46 or 47 characters

jedmccaleb · 2015-07-10T00:54:21Z

rough consensus that we should use the RFC base32 and change the checksum to a normal CRC

graydon · 2015-07-10T00:57:44Z

I feel like I've been providing all the ammunition on this one, despite not really caring.

Our tendency so far has been to follow RFCs when they exist, so in this case https://tools.ietf.org/html/rfc4648 is the canonical encoding.

If we're going to do this -- especially if speed is any concern -- then we should also get rid of the double-sha256-truncated-to-32-bits "checksum" appended to the identifier. This is a pointlessly overpowered "check" against typos on the part of the user; a simple/cheap CRC32 (ISO 3309 / Gzip algorithm -- tons of libraries do this) or CRC16-CCITT or a check-digit scheme designed to catch real transposition/replacement errors (a la https://en.wikipedia.org/wiki/Check_digit ) is sufficient.

Since we probably want to avoid padding characters (ew =) we will want to land on an encoding-group multiple. Encoding-groups are 40 input bits => 8 output chars we'd probably be fine with 7 groups = 56 output chars = 280 input bits = 8 bits typecode/flags/whatever + 256 bits key + 16 bits CRC?

MonsieurNicolas · 2015-07-10T03:58:44Z

that sounds good and yes, I came up with the same conclusion on the encoding. with the RFC one it's very easy for somebody to understand which digits are allowed vs not and we avoid pushing logic to clients that would have to deal with denormalized forms.
As for the checksum: CRC-16 seems like a good candidate as, I think, it would basically detect any error of up to 16 bits (which includes any mutation of 3 consecutive digits) and other errors with some good probability.

MonsieurNicolas added protocol discussion labels Jul 9, 2015

MonsieurNicolas added this to the Production milestone Jul 9, 2015

jedmccaleb added the Production label Jul 13, 2015

bekkibolthouse added in progress and removed Production labels Jul 13, 2015

MonsieurNicolas self-assigned this Jul 14, 2015

MonsieurNicolas mentioned this issue Jul 14, 2015

move to "StellarKey" instead of base58 #619

Merged

latobarita closed this as completed in #619 Jul 17, 2015

jedmccaleb removed the in progress label Jul 17, 2015

jedmccaleb mentioned this issue Jan 13, 2016

Format to use for AccountIDs stellar/stellar-protocol#24

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

consider getting rid of base58 #599

consider getting rid of base58 #599

MonsieurNicolas commented Jul 9, 2015

jedmccaleb commented Jul 10, 2015

graydon commented Jul 10, 2015

MonsieurNicolas commented Jul 10, 2015

consider getting rid of base58 #599

consider getting rid of base58 #599

Comments

MonsieurNicolas commented Jul 9, 2015

jedmccaleb commented Jul 10, 2015

graydon commented Jul 10, 2015

MonsieurNicolas commented Jul 10, 2015