Chronos: An Automatical Testing Framework for Finding Timeout Bugs in Distributed Systems by Self-Adaptive Delay Model.
Delays are inevitable in complex distributed environments. Timeout mechanisms are commonly used to handle unexpected failures in distributed systems. However, incorrect timeout handling or implementation errors in timeout mechanisms can lead to system hang-ups or crashes. Such timeout bugs may be crucial and pose a significant threat to the availability and security of distributed systems.
In this work, we introduce Chronos, a general testing framework for automatically detecting timeout bugs in distributed systems with deep-priority transient delays. First, we propose general runtime delayed libraries that dynamically inject fine-grained delays in a Distributed System Under Test (DSUT). To effectively trigger delays and constantly explore timeout bugs in deep paths, Chronos harnesses a deep-priority guided fuzzing that dynamically generates high-quality delay sequences in the runtime. Then, Chronos utilizes transient delays to eliminate the time overhead caused by actual delays and accelerate the test process.
Directory libs includes runtime delay libraries.
Directory includes workloader for HDFS, ZooKeeper, MySQL-Cluster, Geth
- Setup HDFS environment, can be found in https://hadoop.apache.org/docs/r1.2.1/hdfs_user_guide.html.
- Replace the runtime libraries with our delayed libraries in your environment.
- Setup a test network
- Start HDFS workloader
- Setup ZooKeeper environment, can be found in https://zookeeper.apache.org/documentation.html.
- Replace the runtime libraries with our delayed libraries in your environment.
- Setup a test network
- Start ZooKeeper workloader
- Setup Go-Etheruem environment, can be found in https://geth.ethereum.org/docs.
- Replace the runtime libraries with our delayed libraries in your environment.
- Setup a test network
- Start Geth workloader
- Setup MySQL-Cluster environment, https://dev.mysql.com/doc/index-cluster.html.
- Replace the runtime libraries with our delayed libraries in your environment.
- Setup a test network
- Start MySQL-Cluster workloader
Create an issue for questions and bug reports.