Skip to content

A repo to make a simple app to replay a time series table into a pub/sub system for load testing etc

License

Notifications You must be signed in to change notification settings

Data-drone/timeseries_replay

Repository files navigation

Timeseries Replay

Streaming Data frequently crosses different systems, frameworks and applications. This makes it really hard to architect for and test.

Generating fake data to load test often fails to replicate real life scenarios. Sometimes it maybe useful to take an existing events table with timestamps and replay those at the same rate (or perhaps sped up rate)

See examples folder for some how to use

Current Available Sources

Read:

  • SQLAlchemy compatible sources
  • Parquet File

Write:

  • Folder on Disk
  • Console
  • Kafka

Notes

  • If there are too many messages in parallel can bottleneck

Running Tests

  • TODO add how to trigger test in the right docker container from host commandline

# start containers
docker-compose -f docker_compose/kafka_testing.yml up


# still need to test this
# docker exec -it docker_compose_replay_service_1 /timeseries_replay/pytest

Testing Notes

Running individual tests

# run specific test file
pytest tests/<file>.py

# specific function
pytest tests/test_integration.py::test_runner_large_file

# specific function with logging to console
pytest tests/test_integration.py::test_runner_large_file -o log_cli=True

About

A repo to make a simple app to replay a time series table into a pub/sub system for load testing etc

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages