bop-bpf

Box Of Pain implementation using eBPF.

Overview

This is a work in progress (i.e. README only) repo to implement (part of) box of pain using eBPF. The main motivations of using eBPF over ptrace are

eBPF is widely adopted
the kernel part of code is verified

The drawbacks are

eBPF is mainly for tracing, so the fault injection is often not doable due to its read only nature.
- it is not the case for network, BPF stands for Berkley Packet Filter so it is good at dropping packet.
the logic inside eBPF is limited (due to the verifier) so complex logic still need to be implemented in user space.

TODO

BOP specific

compare with historical trace and only keep new trace (does this operation scale?)
generate the graph and everything between the trace and the graph

BOP extension

allow the tfj (tracer + fault injector) to provide an API for integrating with external system like molly so it can trace/inject dynamically and form a feedback control loop.
collect and analysis traces across boxes

Env

Vagrant environment so people w/o a linux box (hi Mac) can develop it
cloud environment so it really works on cloud and people can run it with a browser
check how ebpf work w/ container, i.e. do we only need to load a single ebpf code for the host (vm/metal), and how to distinguish different containers in the ebpf code

Lang

BPF code can be written in a limited subset of C
for user space language, currently I'd prefer Go because
- it's easy to learn, faster to write, and cloud native
- you can't reuse user space code inside bpf code
- the go binding is being used in production so its performance should not be too bad

Trace

keep record of tcp connection, accept, connect, read, write etc.
snoop content

Fault injection

drop packet
- it seems using AF_PACKET is not working, might need to switch to qdisc

Data

bop has its own format, but I prefer using protobuf (except inside kernel) so no logic is needed for serialization and works across language.
a list of proto shard by app and time should be good. A dedicated database for tracing would be better, are there dedicated database for tracing, can we do compute inside database? Many people are using Cassandra, like they did for time series data, and they are wrong for tsdb

References

eBPF

https://github.com/zoidbergwill/awesome-ebpf

Tracing

weaveworks/tcptracer-bpf Use kprobes to traces TCP events
ntop/libebpfflow Container traffic visibility library based on eBPF
- nDPI deep packet inspection
iovisor/kubectl-trace schedule bpftrace program using kubectl

Fault injection

Code

trailofbits/ebpfault A BPF-based syscall fault injector
cilium/chaos-testing-examples Use Cillium (BPF + XDP based container network) for chaos testing and fault injection

Reading

Fault injection on k8s

pingcap/chaos-mesh A Chaos Engineering Platform for Kubernetes
- chaos-mesh/bpfki A BPF-based kernel fault injection service

License

GPL v3

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
doc		doc
playground		playground
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

bop-bpf

Overview

TODO

Related

References

eBPF

Tracing

Fault injection

Fault injection on k8s

License

About

Releases

Packages

Languages

License

at15/bop-bpf

Folders and files

Latest commit

History

Repository files navigation

bop-bpf

Overview

TODO

Related

References

eBPF

Tracing

Fault injection

Fault injection on k8s

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages