## Code and Unicode

These are basic demos showing bits, bytes, and unicode text in the system.

Data is a finite sequence of codas and a coda is a pair of data.  This means that everything is constructed from empty sequences.  This includes bits, bytes and and strings of bytes.  

In [1]:
from base import * 
import string
import Code 

In [2]:
string.printable  # python printables have equivalent codas

'0123456789abcdefghijklmnopqrstuvwxyzABCDEFGHIJKLMNOPQRSTUVWXYZ!"#$%&\'()*+,-./:;<=>?@[\\]^_`{|}~ \t\n\r\x0b\x0c'

Bits are made from pure data as follows.

In [3]:
z = data()
bit0 = z|z
bit1 = z|data(z|z)
print(bit0,repr(bit0),type(bit0))
print(bit1,repr(bit1),type(bit1))

𝟬 (:) <class 'base.coda'>
𝟭 (:(:)) <class 'base.coda'>


The general setup is that the Code module contains mappings between some codas and unicode strings.  This is used for display purposes only.  To create text data in python, one normally uses the functions `co` (to make a coda) or `da` (to make data).

In [4]:
cotext = co('Hello World \u03A0')
datext = da('Hello World \u03A9') 
print(cotext,type(cotext))
print(datext,type(datext))

(:Hello World (:𝟭𝟭𝟭𝟬𝟭𝟬𝟬𝟬𝟬𝟬)) <class 'base.coda'>
(:Hello World (:𝟭𝟭𝟭𝟬𝟭𝟬𝟭𝟬𝟬𝟭)) <class 'base.data'>


You might be wondering about the colon in (:Hello...).  The point of this is that this is a coda with left data equal to the empty sequence.  The empty sequence has a corresponding definition which is the identity.  The effect of this is to guarantee that the string "Hello World..." won't be modified by some other definition.  It is "atomic" data.  Bits, bytes and strings are created as atoms by `co` and `da`.  If you want, you can get at the components as follows.

In [5]:
for c in cotext.right(): print(c,repr(c))

H (:(:)(:(:))(:)(:)(:(:))(:)(:)(:))
e (:(:)(:(:))(:(:))(:)(:)(:(:))(:)(:(:)))
l (:(:)(:(:))(:(:))(:)(:(:))(:(:))(:)(:))
l (:(:)(:(:))(:(:))(:)(:(:))(:(:))(:)(:))
o (:(:)(:(:))(:(:))(:)(:(:))(:(:))(:(:))(:(:)))
  (:(:)(:)(:(:))(:)(:)(:)(:)(:))
W (:(:)(:(:))(:)(:(:))(:)(:(:))(:(:))(:(:)))
o (:(:)(:(:))(:(:))(:)(:(:))(:(:))(:(:))(:(:)))
r (:(:)(:(:))(:(:))(:(:))(:)(:)(:(:))(:))
l (:(:)(:(:))(:(:))(:)(:(:))(:(:))(:)(:))
d (:(:)(:(:))(:(:))(:)(:)(:(:))(:)(:))
  (:(:)(:)(:(:))(:)(:)(:)(:)(:))
(:𝟭𝟭𝟭𝟬𝟭𝟬𝟬𝟬𝟬𝟬) (:(:(:))(:(:))(:(:))(:)(:(:))(:)(:)(:)(:)(:))


In [6]:
Code.coda2str(cotext)

'Hello World (:𝟭𝟭𝟭𝟬𝟭𝟬𝟬𝟬𝟬𝟬)'

In [7]:
H = cotext.right()[0]
for b in H.right(): print(b,repr(b))

𝟬 (:)
𝟭 (:(:))
𝟬 (:)
𝟬 (:)
𝟭 (:(:))
𝟬 (:)
𝟬 (:)
𝟬 (:)


In [8]:
b = H.right()[0]
print(b,repr(b))

𝟬 (:)


In [9]:
#
#   You can also create individual bytes in the unlikely event this is necessary  
#
print('a',Code.byte('a'),'\u03A9',Code.byte('\u03A9'))

a a Ω (:𝟭𝟭𝟭𝟬𝟭𝟬𝟭𝟬𝟬𝟭)


In [10]:
#
#   The CODE dictionary in the Code module contains a standard coda<->unicode correspondence.  This is only 
#   used for display purposes.
#
for key,value in Code.CODE.items(): print(type(value),value,'<-',repr(key))

<class 'str'> 𝟬 <- (:)
<class 'str'> 𝟭 <- (:(:))
<class 'str'> 0 <- (:(:)(:)(:(:))(:(:))(:)(:)(:)(:))
<class 'str'> 1 <- (:(:)(:)(:(:))(:(:))(:)(:)(:)(:(:)))
<class 'str'> 2 <- (:(:)(:)(:(:))(:(:))(:)(:)(:(:))(:))
<class 'str'> 3 <- (:(:)(:)(:(:))(:(:))(:)(:)(:(:))(:(:)))
<class 'str'> 4 <- (:(:)(:)(:(:))(:(:))(:)(:(:))(:)(:))
<class 'str'> 5 <- (:(:)(:)(:(:))(:(:))(:)(:(:))(:)(:(:)))
<class 'str'> 6 <- (:(:)(:)(:(:))(:(:))(:)(:(:))(:(:))(:))
<class 'str'> 7 <- (:(:)(:)(:(:))(:(:))(:)(:(:))(:(:))(:(:)))
<class 'str'> 8 <- (:(:)(:)(:(:))(:(:))(:(:))(:)(:)(:))
<class 'str'> 9 <- (:(:)(:)(:(:))(:(:))(:(:))(:)(:)(:(:)))
<class 'str'> a <- (:(:)(:(:))(:(:))(:)(:)(:)(:)(:(:)))
<class 'str'> b <- (:(:)(:(:))(:(:))(:)(:)(:)(:(:))(:))
<class 'str'> c <- (:(:)(:(:))(:(:))(:)(:)(:)(:(:))(:(:)))
<class 'str'> d <- (:(:)(:(:))(:(:))(:)(:)(:(:))(:)(:))
<class 'str'> e <- (:(:)(:(:))(:(:))(:)(:)(:(:))(:)(:(:)))
<class 'str'> f <- (:(:)(:(:))(:(:))(:)(:)(:(:))(:(:))(:))
<class 'str'> g <- (:(:)(:(:))(

In [11]:
a = Code.byte('}')

In [12]:
a

(:(:)(:(:))(:(:))(:(:))(:(:))(:(:))(:)(:(:)))

In [13]:
type(a)

base.coda

In [14]:
type(a.left()),type(a.right())

(base.data, base.data)

In [15]:
a.left(),a.right()

(, (:)(:(:))(:(:))(:(:))(:(:))(:(:))(:)(:(:)))

In [16]:
str(a.left())

''

In [17]:
str(a.right())

'𝟬𝟭𝟭𝟭𝟭𝟭𝟬𝟭'

In [18]:
Code.byte('}')==Code.byte('{')

False

In [19]:
a = Code.byte('a'); b = Code.byte('b')

In [20]:
a==b

False

In [21]:
type(a.right()),type(b.right())

(base.data, base.data)

In [22]:
str(a.right())

'𝟬𝟭𝟭𝟬𝟬𝟬𝟬𝟭'

In [23]:
str(b.right())

'𝟬𝟭𝟭𝟬𝟬𝟬𝟭𝟬'

In [24]:
ar = a.right()
br = b.right()

In [25]:
for x in ar: print(type(x),x)

<class 'base.coda'> 𝟬
<class 'base.coda'> 𝟭
<class 'base.coda'> 𝟭
<class 'base.coda'> 𝟬
<class 'base.coda'> 𝟬
<class 'base.coda'> 𝟬
<class 'base.coda'> 𝟬
<class 'base.coda'> 𝟭


In [26]:
for x in br: print(type(x),x)

<class 'base.coda'> 𝟬
<class 'base.coda'> 𝟭
<class 'base.coda'> 𝟭
<class 'base.coda'> 𝟬
<class 'base.coda'> 𝟬
<class 'base.coda'> 𝟬
<class 'base.coda'> 𝟭
<class 'base.coda'> 𝟬


In [27]:
def equal(A,B): 
    return len(A)==len(B) and all(A[i]==B[i] for i in range(len(A)))

In [28]:
ar6 = ar[6]; br6 = br[6]

In [29]:
type(ar6)

base.coda

In [30]:
type(br6)

base.coda

In [31]:
print(ar6,br6)

𝟬 𝟭


In [32]:
type(ar6),type(br6)

(base.coda, base.coda)

In [33]:
Code.b0

(:)

In [34]:
Code.b1

(:(:))

In [35]:
type(Code.b1.left()),type(Code.b1.right())

(base.data, base.data)