GitHub - holiman/qvm-sync: Qvm-sync is a sync protocol/implementation for QubesOS, based on qvm-copy

Qvm-sync

qvm-sync is inspired by qvm-copy, and this ticket. It extends the functionality of qvm-copy, by only copying missing or modified files from one qube to another

Why go-lang

I wanted to write it in go-lang because I'm more proficient in go-lang than C, and felt more comfortable writing security-critical code using it.

It turned out that some tricks were not available from within go-lang (see the preloader readme), but I think the obstacles were overcome in the end.

Another snag was that it doesn't seem possble to

set permissions on symlinks, nor
set mtime/atime on symlinks ticket

Building it in go-lang made it easy to implement proper testcase support, and if someone wants to port it over to C, they can use my implementation as a base.

Installation

On at least one VM, you need to have go installed. On that vm, simply run install.sh, and you can later qvm-copy the binaries across vms. The install.sh scripts requires the binaries and the scripts in the ./scripts folder.

How it works

How `qvm-copy` works

In order to build this, I had to dive deep into how qubes-rpc and qvm-filecopy works under the hood.

When the command qvm-copy foo in invoked, it is bash script, which (eventually) resolved to

qrexec-client-vm @default qubes.Filecopy /usr/lib/qubes/qfile-agent foo

The qrexec-client-vm command does this:

Asks to invoke the rpc-service qubes.Filecopy on the remote side,
prompting qubes os to pop up a dialog for confirmation, and
starts the local binary /usr/lib/qubes/qfile-agent with the argument foo

The qfile-agent spits out the files on stdout.


$ echo "test">  test.txt
$ /usr/lib/qubes/qfile-agent test.txt | xxd
00000000: 0900 0000 b481 0000 0500 0000 0000 0000  ................
00000010: 2e28 c05d c008 9e2a 2928 c05d 8078 2514  .(.]...*)(.].x%.
00000020: 7465 7374 2e74 7874 0074 6573 740a 0000  test.txt.test...
00000030: 0000 0000 0000 0000 0000 0000 0000 0000  ................

If foo is a directory, it walks through it and spits out each file. Directories and symlinks are handled a bit differently. The transmission contains both the metadata (permissions, mtime etc) and the actual data content.

On the receiving end, there's the service defined in /etc/qubes-rpc/qubes.Filecopy, which looks like this:

#!/usr/bin/sh
exec /usr/lib/qubes/qfile-unpacker

It contains a shim which, when executed as root, root-jails the actual unpacker and drops back to user.

The unpacker/parser does the actual unpacking on the receiver side. Due to the root-jail, it is incapable of writing outside of /home/user/QubesIncoming/<source-vm>/.

A rough scheme of how it works is

A -> invoke listener on B
A -> sends [files] to B. Listens for confirmation 
B -> sends confirmation to A, 
A -> Hangs up

The actual sources for this can be found here:

How `qvm-sync` works

The qvm-sync scheme is roughly similar, but,

Instead of sending the file contents, with the first send, we only send the metadata.
The metadat optionally (and by default) includes crc32.
As metadata is received, the receiver checks if it already has the file.
- If it does not have file, add to requestlist.
- If it has the file, but metadata differs, add to `requestlist'
Send back requestlist, which is only a list of indexes.
The initiator then:
- For each file in requestlist, send over to receiver.

The first version was pretty dumb. It did not use crc on files, but only checks size/mtime/atime for differences.

Now, crc has been added, so the initiator places the crc32 in place of atime_nsec in the metadata. If/when the full file is sent the second time, that substitution is not made.

Thus, the same protocol message is reused.

The indices do not count directories, so for syncing the following directory:

a/
 - foo
 - b/
   - bar

The following data is sent, (with indices in parenthesis -- not actually transmitted over the wire):

a       (none)
a/foo   (0)
a/b/    (none)
a/b/bar (1)
a/b/    (none) // end-dir marker
a/      (none) // end-dir marker
EOT     (none) // end-of-transfer marker

Compression

qvm-sync can do compression (snappy). Example results, when syncing go-ethereum repository (106 diffs):

 [qsync-send@work] 2019/11/14 22:30:25 Data sent, raw: 6812458, compressed: 1075905
...
 [qsync-send@work] 2019/11/14 22:30:26 Data sent, raw: 22277128, compresed: 5103620

In the first phase,

6.8M data is sent, but only
1.1M compressed.

After the second phase, when the full files have also been sent,

22.3M data has been sent, but only
5.1M compressed.

For the other direction, where the receiver send data back to the originator, the effect on compression is marginal at best:

 [qsync-receive-temp-5577006791947779410@dockervm] 2019/11/14 21:30:28 Data sent, raw: 509, compresed: 441

509 versus 441 bytes.

Aside from the actual compression, using Snappy encoding also brings along the benefit of data checksumming. In the original qvm-copy protocol, data transferrred is also checksummed, and compared post-transmission. With snappy, we get that included 'under the hood', and don't have to do the checks on the application layer.

Notes

About the protocol

The protocol is intentionally kept as close to qvm-copy as possible, and has avoided any parts that would require multithreading. The entire thing is based on four sequential steps, and thus it should be fairly simple to extend qvm-copy to implement qvm-sync, if one wanted to.

Known issues

The go-lang implementation has a few known bugs:

It is unable to set permissions on symlinks,
It is unable to sync mtime/atime on symlinks,

Security

The qsync-preloader is meant to be run as a suid binary -- i.e. privileged. It is does not contain any imports other than the golang base libraries, so no external dependencies of any kind (that goes for all three parts).

In general, go-lang is memory-safe, and typically crashes rather than continues in a bad (insecure) state if corruption occurs.

The implementation does not make use of golang unsafe pointers.

Incompatibilities with `qvm-copy`

An initial version packet is sent from initiator to receiver. This packet contains info about (desired) verbosity, crc32 usage, compression and version.
Snappy compression added, if so configured.
There is no application-layer crc to verify data transmission correctness.
crc32 on file metadata, in place of atime_nsec.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
cmd		cmd
packer		packer
scripts		scripts
README.md		README.md
go.mod		go.mod
go.sum		go.sum
install.sh		install.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Qvm-sync

Why go-lang

Installation

How it works

How `qvm-copy` works

How `qvm-sync` works

Compression

Notes

About the protocol

Known issues

Security

Incompatibilities with `qvm-copy`

About

Releases

Packages

Languages

holiman/qvm-sync

Folders and files

Latest commit

History

Repository files navigation

Qvm-sync

Why go-lang

Installation

How it works

How qvm-copy works

How qvm-sync works

Compression

Notes

About the protocol

Known issues

Security

Incompatibilities with qvm-copy

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

How `qvm-copy` works

How `qvm-sync` works

Incompatibilities with `qvm-copy`

Packages