The Protobuf Data Generator creates realistic valid and invalid payloads for Protocol Buffer messages. It reads constraint annotations directly from your .proto files (for example, Protovalidate or Nanopb rules) and deterministically assembles data that either satisfies or intentionally violates those rules. The library is ideal for fuzzing, regression testing, and golden-data generation across embedded and backend protobuf workloads.
- Deterministic valid payloads generated from constraint metadata.
- Targeted invalid samples that break explicit rules (e.g., min/max, length, uniqueness).
- Constraint backends for Protovalidate and Nanopb, with a lightweight parser that understands enums and repeated fields.
- Formatter outputs for C arrays, raw bytes, JSON, and hexadecimal encodings.
- Showcase fixtures (
tests/fixtures/showcase.proto) illustrating the full feature set end-to-end.
pip install protobuf-data-generatorSupported Python versions: 3.8 through 3.13. Older Python releases (3.7 and below) are no longer tested.
For local development:
git clone https://github.com/OfekiAlm/protobuf-data-generator.git
cd protobuf-data-generator
pip install -r requirements-dev.txtfrom protobuf_test_generator import DataGenerator
generator = DataGenerator(
"tests/fixtures/showcase.proto",
include_paths=["tests/fixtures"],
constraints_type="protovalidate", # or "nanopb"
)
valid_payload = generator.generate_valid("Showcase", seed=42)
invalid_payload = generator.generate_invalid(
"Showcase",
violate_field="email",
violate_rule="min_len",
seed=42,
)
binary_blob = generator.encode_to_binary("Showcase", valid_payload)
c_array = generator.format_output(binary_blob, "c_array", "showcase_payload")tests/fixtures/showcase.proto– comprehensive proto covering numeric, string, enum, repeated, and nested-field constraints.tests/test_showcase.py– integration test demonstrating parsing, generation, validation, and formatting steps.- The helper
validate.protoshipped alongside the fixtures is a minimal stub replicating the option names used in the official protovalidate descriptors. It exists solely to exercise constraint parsing in tests.
protobuf-data-generator \
--proto-file tests/fixtures/showcase.proto \
--message Showcase \
--format jsonYou can also supply the proto file and message as positional arguments if you prefer:
protobuf-data-generator tests/fixtures/showcase.proto Showcase --format jsonOptional flags:
-I / --include path– repeatable include directories for proto imports.--invalid --field FIELD --rule RULE– produce a payload that violates a specific rule.--seed N– lock generation to deterministic output.
black --check src tests
flake8 src tests
mypy src
pytestSee the CHANGELOG for release history.
Issues and pull requests are welcome! Please discuss substantial changes in an issue before opening a PR.
Distributed under the MIT License. See LICENSE.