dp3: multimodal log database

dp3 is an experimental database for management of multimodal log data, such as logs produced by sensors and internal processing logic on robots.

It is under development as a class project in Joe Hellerstein's CS286 class at UC Berkeley and is not suitable for production usage.

Quickstart

Requirements

Go 1.22

The following instructions will start dp3 with a data directory on local disk.

Build the dp3 binary.

    make build

Start the server.

    mkdir data
    ./dp3 server --data-dir data

Start the interpreter and type help to get started.

$ ./dp3 client

__        _      _____ 
\ \    __| |_ __|___ / 
 \ \  / _` | '_ \ |_ \ 
 / / | (_| | |_) |__) |
/_/   \__,_| .__/____/ 
           |_|         
Type "help" for help.

dp3:[default] # help
The dp3 client is an interactive interpreter for dp3.  dp3 is a
multimodal log database for low-latency playback and analytics.

The client supports interaction via either queries or dot commands. The
supported dot commands are:
  .h [topic] to print help text. If topic is blank, prints this text.
  .connect [database] to connect to a database
  .statrange to run a statrange query
  .import to import data to the database
  .delete to delete data from the database
  .tables to inspect tables available in the database

Available help topics are:
  query: Show examples of query syntax.
  statrange: Explain the .statrange command.
  import: Explain the .import command.
  delete: Explain the .delete command.
  tables: Explain the .tables command.

Any input aside from "help" that does not start with a dot is interpreted as
a query. Queries are terminated with a semicolon.
dp3:[default] #
dp3:[default] # .import my-robot example-data/fix.mcap
dp3:[default] #
dp3:[default] # from my-robot /fix limit 1;
{"topic":"/fix","sequence":13,"log_time":1479512770.309617340,"publish_time":1479512770.309617340,"data":{"header":{"seq":1,"stamp":1479512770.308916091,"frame_id":"/imu"},"status":{"status":0,"service":0},"latitude":37.39973449707031,"longitude":-122.108154296875,"altitude":-8.731653213500977,"position_covariance":[0,0,0,0,0,0,0,0,0],"position_covariance_type":0}}

Background

Multimodal log data may be characterized by,

High frequencies
Large volumes
Highly variable message sizes and schemas (images, pointclouds, text logs, numeric measurements, compressed video, ...)
Various different message encodings (protobuf, ros1msg, cdr, flatbuffers, ...)
Timeseries orientation

Common workloads on the data are,

Stream data in time order at a point in time, for a particular device or simulation, on a selection of "topics", into some sort of visualization application such as webviz or Foxglove Studio, or export to a local file and view with rviz or otherwise locally analyze. The selection of topics in this kind of query can be wide - frequently in the dozens or hundreds, and infrequently in the thousands depending on the architecture of the producer.
Run heavy computational workloads on a "narrow" selection of topics. For instance, run distributed Spark jobs over hundreds of terabytes of images. This workload may care less or not at all about ordering, but cares a lot about throughput, and that the available throughput, availability, and cost scalability effectively matches that of the underlying storage and networking, either on-premise or in the cloud.
Summarize individual message field values at multiple granularities, at low-enough latencies to drive responsive web applications. Consider for instance the plotting facilities available in datadog or cloud monitoring systems like Cloudwatch or GCP stackdriver, which can (sometimes) plot multigranular statistical aggregates spanning many weeks or years and trillions of points in under a second.

dp3 attempts to address all three of these in a single easy-to-administer solution.

Architecture

The architecture of dp3 is inspired by btrdb. It differs in that it supports multimodal data and multiplexed playback, and in drawing a slightly different contract with its consumers -- one based on "topics" and "producer IDs" rather than "stream IDs".

In large deployments, dp3 is envisioned as a component within a greater domain-specific data infrastructure. However, in smaller deployments the hope of dp3 is that incorporation of topics and producers in the core data model will enable orgs to make use of dp3 "right off the bat" without secondary indicies.

Glossary

Producer ID: a unique identifier assigned by the user to the producer of some data. For instance, a device identifier or a simulation run ID.
Topic: a string identifying a logical channel in the customer's data stream. For instance, "/images" or "/imu". See http://wiki.ros.org/Topics for more information on how topics relate to robot architecture.
MCAP: a heterogeneously-schematized binary log container format. See https://mcap.dev/.

Multigranular summarization

dp3's underlying storage is a time-partitioned tree spanning a range of time from the epoch to a future date. The typical height of the tree is 5 but it can vary based on parameter selection. Data is stored in the leaf nodes, and the inner nodes contain pointers to children as well as statistical summaries of children. Data consists of nanosecond-timestamped messages.

In the service, nodes are cached on read in an LRU cache of configurable byte capacity. In production deployments, this cache will be sized such that most important inner nodes will fit within it at all times. Multigranular summarization requires traversing the tree down to a sufficiently granular height, and then scanning the statistical summaries at that height for the requested range of time. If the cache is performing well this operation can be done in RAM.

Low-latency playback

Input files are associated with a producer ID by the user. During ingestion they are split by topic and messages are routed to a tree associated with that topic and the producer ID. Merged playback on a selection of topics requires doing simultaneous scans of one tree per topic, feeding into a streaming merge.

Read scalability for ML jobs

The query execution logic of dp3 can be coded in fat client libraries in other languages like python or scala. Large heavy read jobs can use one of these clients to execute their business. The ML cluster simply needs to query for the current root location, which can be done once and then passed to the job.

With the dp3 server out of the way, all the heavy reading goes straight to S3 and can scale accordingly. This mode of operation does come with some compromises - clients are accessing data directly which complicates your ACL management - but these complexities may be preferable to running an expensive and dynamically scaling service that, for many of these workloads, might as well be doing S3 passthrough.

MCAP-based

Data in the leaf nodes is stored in MCAP format. Initial focus is on ros1msg-serialized messages, but this should be extensible to other formats in use. The format of the playback datastream is also MCAP.

Users who are already producing MCAP files, such as ROS 2 users, will have automatic compatibility between dp3 and all of their internal data tooling. The message bytes logged by the device are exactly the ones stored in the database.

Users of ROS 1 bag files can try dp3 by converting their bags to mcap with the mcap CLI tool.

Developers

We use golangci-lint for linting. To install it it follow the directions here: https://golangci-lint.run/usage/install.

Run the tests

To run the tests:

    make test

To run the linter:

    make lint

Build the binary

    make build

Profiling the server

Profiles can be generated with the pprof webserver on port 6060. For example,

Heap snapshot

    go tool pprof http://localhost:6060/debug/pprof/heap

CPU profile

    go tool pprof http://localhost:6060/debug/pprof/profile?seconds=30

Goroutine blocking

    go tool pprof http://localhost:6060/debug/pprof/block

Mutex contention

    go tool pprof http://localhost:6060/debug/pprof/mutex

Function tracing

    curl -o trace.out http://localhost:6060/debug/pprof/trace?seconds=5
    go tool trace trace.out

See https://pkg.go.dev/net/http/pprof for additional information about pprof.

Name		Name	Last commit message	Last commit date
Latest commit History 254 Commits
.github/workflows		.github/workflows
client/dp3		client/dp3
example-data		example-data
executor		executor
mcap		mcap
nodestore		nodestore
plan		plan
ql		ql
rootmap		rootmap
routes		routes
service		service
storage		storage
tree		tree
treemgr		treemgr
util		util
versionstore		versionstore
wal		wal
.gitignore		.gitignore
.golangci.yaml		.golangci.yaml
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
go.mod		go.mod
go.sum		go.sum
main.go		main.go

License

wkalt/dp3

Folders and files

Latest commit

History

Repository files navigation