Skip to content

keredson/ygit

Repository files navigation

header

ygit

A tiny (yocto) git client for MicroPython microcontrollers / other memory-constrained environments (<100k) like the $6 ESP32. It speaks to git HTTP/HTTPS servers using the ubiquitous smart client protocol.

It supports branches/tags, shallow clones and sparse checkouts (ie: only checking out a sub-directory of a larger project).

I use it to remotely deploy/update code over low-bandwidth connections.

Install

$ wget https://raw.githubusercontent.com/keredson/ygit/main/ygit.py
$ mpy-cross ygit.py
$ ampy -p /dev/ttyUSB0 put ygit.mpy

Get Started

To clone a repo, run:

>>> repo = ygit.clone('https://github.com/turfptax/ugit_test.git')

If you don't want to clone into the root directory of your device, pass a target directory as a second argument. This will produce a shallow clone (at HEAD) by default. It will not delete any files in the target directory, but it will overwrite them if conflicting. The normal git files you'd expect (config, *.pack, IDX) will be in .ygit. You only need to run this once.

To update:

>>> repo.pull()

Which is the same as:

>>> repo.fetch()
>>> repo.checkout()

These are incremental operations. It will only download git objects you don't already have, and only update files when their SHA1 values don't match.

API

# make a new clone
repo = ygit.clone(repo, directory='.', shallow=True, cone=None, 
                  quiet=False, ref='HEAD', username=None, password=None)

# control an already cloned repository
repo = ygit.Repo(directory='.')

# control
repo.checkout(ref='HEAD')
repo.pull(shallow=True, quiet=False, ref='HEAD')
repo.fetch(shallow=True, quiet=False, ref='HEAD')
repo.status(ref='HEAD')
repo.tags()
repo.branches()
repo.pulls()
repo.update_authentication(username, password)
repo.log()

A ref is one of:

  • HEAD
  • a commit (ex: 7b36b4cb1616694d8562f3adea656a709b9831d9)
  • a branch / tag / pull

Read the full documentation.

Design

Shallow Cloning

By default clones are shallow to save space. If you try to checkout an unknown ref, ygit will fetch a new packfile from the original server.

Subdirectory Cloning

Usually I don't want to clone an entire project onto my ESP32. The python I want on the device is in a subdirectory of a larger project. The cone argument will take a path, and only files in that directory will be checked out (as if it were the top level).

TODO: Do a blob filter to only fetch objects we intend to check out.

Authentication

Supply a username/password to clone(). The credentials will be stored on the device, AES encrypted with the machine id as the key. For GitHub, use your personal access token as the password (just like w/ regular git).

Tests

  • Prereq: Run nginx -c "$(pwd)/misc/test_nginx.conf" -e stderr in the background for the local tests.
  • pytest test_localhost.py (runs local tests)
  • pytest test_gh.py (runs github tests)
  • pytest test_micropython.py (WARNING: will wipe all files except boot.py from your MicroPython device at /dev/ttyUSB0.)

As a convenience, running python test_micropython.py (note python instead of pytest) will run only the reset device code. I typically run python test_micropython.py && picocom /dev/ttyUSB0 -b 115200 when making a change.

Example Run

$ picocom /dev/ttyUSB0 -b 115200

MicroPython v1.19.1 on 2022-06-18; ESP32 module with ESP32
Type "help()" for more information.

>>> import ygit
>>> repo = ygit.clone('https://github.com/keredson/ygit.git','ygit')
cloning https://github.com/keredson/ygit.git into ygit @ HEAD
fetching: https://github.com/keredson/ygit.git @ HEAD
fetching commit: bc0f8c042d06f3c78be2066af11419357d1b6e0e
Enumerating objects: 26, done.
Counting objects:   3% (1/26)
[...]
Counting objects: 100% (26/26), done.
Compressing objects:   8% (2/24)
[...]
Compressing objects: 100% (24/24), done.
>>>>>>>>>>>>>>>>>>>>>>>
Total 26 (delta 0), reused 15 (delta 0), pack-reused 0
##########################
checking out bc0f8c042d06f3c78be2066af11419357d1b6e0e
writing: ygit/.gitignore (BLOB)
writing: ygit/.readthedocs.yml (BLOB)
writing: ygit/LICENSE (BLOB)
writing: ygit/README.md (BLOB)
writing: ygit/deploy.sh (BLOB)
writing: ygit/pyproject.toml (BLOB)
writing: ygit/requirements.txt (BLOB)
writing: ygit/setup.py (BLOB)
writing: ygit/test_gh.py (BLOB)
writing: ygit/test_localhost.py (BLOB)
writing: ygit/test_micropython.py (BLOB)
writing: ygit/ygit.py (BLOB)
writing: ygit/docs/Makefile (BLOB)
writing: ygit/docs/api.rst (BLOB)
writing: ygit/docs/conf.py (BLOB)
writing: ygit/docs/index.rst (BLOB)
writing: ygit/docs/make.bat (BLOB)
writing: ygit/docs/usage.rst (BLOB)
writing: ygit/docs/source/api.rst (BLOB)
writing: ygit/docs/source/conf.py (BLOB)
writing: ygit/docs/source/index.rst (BLOB)
writing: ygit/docs/source/usage.rst (BLOB)
writing: ygit/misc/header.png (BLOB)
writing: ygit/misc/header.xcf (BLOB)
writing: ygit/misc/test_nginx.conf (BLOB)
>>> 
>>> repo.pull()
fetching: https://github.com/keredson/ygit.git @ HEAD
fetching commit: bc0f8c042d06f3c78be2066af11419357d1b6e0e
up to date!

Known Issues

Every object in a git repo is stored as a zlib compressed stream. Decompressing requires a 32k buffer, a serious contraint on a device w/ only ~100k available RAM. ygit only ever creates one at a time, but sometimes the MicroPython garbage collector will get overwhelmed, or memory get's fragmented, and you'll get a MemoryError. If you experience this, I've written a fork (and PR) that lets you reuse the same buffer for all objects without bothering the GC. ygit will use this automatically if available.

I've mostly seen this on repos large enough to max out flash storage or when pulling shallow=False, so it's unlikely to hit most users, but wanted to mention.

About

A tiny (yocto) git client for MicroPython.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published