Add minlen feature to base62.encode() #7

coolspeed · 2017-05-19T10:26:54Z

This PR will add minlen feature. In other words, it does zero padding for encode output string.

This implementation (and naming) is copied from the famous and widely used python bitcoin library
pybitcointools :

https://github.com/vbuterin/pybitcointools/blob/a82b00686b51677b047098e8968074a783e054a1/bitcoin/py2specials.py

coveralls · 2017-05-19T10:29:22Z

Coverage increased (+0.3%) to 91.525% when pulling 89d2f1b on coolspeed:develop into 0a423f3 on suminb:develop.

And there's no performance penalty.

coveralls · 2017-05-19T10:53:48Z

Coverage decreased (-1.9%) to 89.286% when pulling 49009df on coolspeed:develop into 0a423f3 on suminb:develop.

suminb · 2017-05-19T10:59:15Z

base62.py

-        return ord(ch) - ord('a') + 36
-    else:
+    try:
+        return CHARSET.index(ch)


이거 좋네요.

처음엔 문자열 스캔을 없애 성능을 최적화하려고 했었는데, ord() 의 성능이 불투명하기도 하고, 차이가 거의 없을 것 같기도 해서, 차라리 문자열 스캔을 편하게 해버리는 쪽으로 방향을 틀었습니다 ㅎㅎ

만약 성능이 걱정된다면 다음과 같이 상수 시간으로 접근할 수 있는 자료구조를 만드는 편이 좋을 것 같습니다.

CHARSET_INDEX = {'0': 0, '1': 1, ..., 'A': 10, ..., 'z': 61}

이흥섭님의 의견:

REVERSE_CHARSET = {v: k for k, v in enumerate(CHARSET)}

ㄴ 저도 이것이 더 좋아보입니다.

suminb · 2017-05-19T11:01:16Z

base62.py

@@ -31,22 +31,24 @@ def bytes_to_int(s, byteorder='big', signed=False):
        return sum(ds)


-def encode(n):
+def encode(n, minlen=0):


패딩의 길이가 아니라 인코딩 된 문자열의 최소 길이를 지정하는 인자니까 기본값이 1이 되어야 하지 않을까요?

ㄴ 문자열의 길이가 0 일 수 있어서요.

suminb · 2017-05-19T11:05:27Z

tests/test_basic.py

@@ -15,6 +15,7 @@

 def test_basic():
    assert base62.encode(0) == '0'
+    assert base62.encode(0, minlen=5) == '00000'


이것과 비슷하게 decode() 쪽에도 테스트 케이스를 추가했으면 좋겠습니다. 예를 들면,

assert base62.decode('00000') == 0

반영하여 추가했습니다.

suminb · 2017-05-19T11:12:42Z

base62.py

-        return ord(ch) - ord('a') + 36
-    else:
+    try:
+        return CHARSET.index(ch)


만약 성능이 걱정된다면 다음과 같이 상수 시간으로 접근할 수 있는 자료구조를 만드는 편이 좋을 것 같습니다.

CHARSET_INDEX = {'0': 0, '1': 1, ..., 'A': 10, ..., 'z': 61}

suminb · 2017-05-19T11:19:47Z

base62.py

-        return ord(ch) - ord('a') + 36
-    else:
+    try:
+        return CHARSET.index(ch)


이흥섭님의 의견:

REVERSE_CHARSET = {v: k for k, v in enumerate(CHARSET)}

coveralls · 2017-05-19T11:59:40Z

Coverage decreased (-1.9%) to 89.286% when pulling 6478bfa on coolspeed:develop into 0a423f3 on suminb:develop.

Woongryong Kim added 2 commits May 19, 2017 19:18

Add minlen to encode()

4444312

Add test case for minlen in encode()

89d2f1b

Simplify __value__ implementation through one-way string scanninng

49009df

And there's no performance penalty.

suminb reviewed May 19, 2017

View reviewed changes

Add test case for zero-padded string decoding

6478bfa

suminb merged commit ced868b into suminb:develop May 21, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add minlen feature to base62.encode() #7

Add minlen feature to base62.encode() #7

coolspeed commented May 19, 2017

coveralls commented May 19, 2017 •

edited

Loading

coveralls commented May 19, 2017 •

edited

Loading

suminb May 19, 2017

coolspeed May 19, 2017

suminb May 19, 2017

suminb May 19, 2017

coolspeed May 22, 2017

suminb May 19, 2017

coolspeed May 22, 2017

suminb May 19, 2017

coolspeed May 19, 2017

suminb May 19, 2017

suminb May 19, 2017

coveralls commented May 19, 2017 •

edited

Loading

Add minlen feature to base62.encode() #7

Add minlen feature to base62.encode() #7

Conversation

coolspeed commented May 19, 2017

coveralls commented May 19, 2017 • edited Loading

coveralls commented May 19, 2017 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

coveralls commented May 19, 2017 • edited Loading

coveralls commented May 19, 2017 •

edited

Loading

coveralls commented May 19, 2017 •

edited

Loading

coveralls commented May 19, 2017 •

edited

Loading