SAT solver ANalyzer

An attempt at building a toolbox for analyzing performance and runtime characteristics of SAT solvers. This project is currently in its initial development stage.

In collaboration with and initiated by Benjamin Kaiser and Robert Clausecker. Developed by Cobalt.

Current state

Metrics database:
- duckdb (direct and batched)
- SQLite
- (planned, most likely in clustered scenarios) ClickHouse OR PostgreSQL
- (planned-feature): Merging, i.e., take multiple metric sets and compile into single database
config:
- YAML, able to express executors, sets of solvers and sets of test sets (see below)
executors:
- local parallel executor: Supervises locally spawned SAT solvers with a thread pool (rayon based, configurable concurrency, supports thread pinning)
  - The executor only parallelizes the actual execution of the tests, i.e., it is parallel on the data level. This means that the initial process of finding the tests and preparing the data for the solvers may be bound by a single thread. This may be changed in the future but is sufficient for the current test suites.
  - (planned, WIP) SLURM
tests:
- tests are grouped in tests sets and identified as files via a glob that may be searched within path(s) with ignore.
- Test sets that are supersets of other sets, i.e., test set c with tests from set a and b.
- GBD collector coming soonish
ingest:
- (WIP) minisat and cadical are planned as a first step

Runtime behavior (WIP)

Runner

Read & Deserialize config
Check for consistency etc., so called pre-flight checks
Construct Tasks, i.e., figure out what tests exists and map them to solvers
Start executing
(if necessary), teardown of stateful components like database connections

Building

Runner

Required dependencies:

A rust toolchain (MSRV: stable - 1.68.2), recommended: rustup
A C compiler (symlinked to cc) for bundled DuckDB, needs to be in PATH while cargo builds the runner
OpenSSL development file, usually found in package repositories as libssl-dev or openssl-devel
(optional) DuckDB header files, may be found in your package repositories as duckdb-dev

Building:

To create a debug build with the system DuckDB (placed in target/debug/satan-runner): cargo b cargo b --package satan-runner
To create a debug build (placed in target/debug/satan-runner): cargo b cargo b --package satan-runner --features bundled-duckdb
To create a release, i.e. optimized, build (placed in target/release/satan-runner): cargo b cargo b --package satan-runner --release
To create a release with the system DuckDB, i.e. optimized, build (placed in target/release/satan-runner): cargo b cargo b --package satan-runner --release --features bundled-duckdb

Config

The configuration is done in YAML, other languages may also be supported in the future.

# Executor, variant select with YAML tag:
# Support variants:
# - Local
#   - pinned: bool -> pin threads to logical CPUs (default: false)
#   - threads: integer -> number of threads in tread pool (default: number of logical CPUs) }
# - Distributed: A wrapper for distributed usage of the local executor
#   - synchronization: tagged enum (see below) ->
#      - Coordinated: Elect a leader runner and let it coordinate work over MPI, this node may also process tasks at the same time
#      - FilesystemLocks: Use SQLite with filesystem locks to coordinate the work queue. This method relies on the filesystem to handle file locking and employs a temporary SQLite database for distributing work
#        - path: string -> path to SQLite database (must be prepared beforehand AND be available on all compute nodes)
executor: !Local
  pinned: true

# Configuration for metric storage driver, the same tag handling applies here too
# - DuckDB: Uses DuckDB, an sqlite-like file based database, for storage. Recommended for local setups.
#           This driver is, like SQLite, limited to one write at time and works with an internal Mutex.
#   - path: string -> path to duckdb file
# - SQLite: Uses SQLite for storage. Recommended for local setups with high iteration count.
#   - path: string -> path to sqlite file
# - Batched: Uses the DuckDB driver with a buffer, intended for local setups with medium throughput
#   - path: string -> path to duckdb file
#   - size: unsigned integer -> size of buffer (default: 100)
# - Clickhouse: Uses ClickHouse as a full DBMS for metric storage. Recommended for distributed setups.
#   - { todo!() }
database: !DuckDB
  path: satan.db

# Wether to insert metrics directly after ingesting or as a bulk insert after all tests are executed
delayed: false

# Configuration for ingest driver, the same tag handling applies here too
# - Exec: A script that takes the output of the solver as stdin and produces metrics to stdout
#   - timeout: unsigned integer -> timeout in ms for ingest script (default: 5000 ms)
#   - executable: string -> path to ingest executable
# - Null: The solver outputs the TestMetrics natively in YAML as stdout
ingest:
  cadical: !Exec
    timeout: 2000
    executable:  ./ingestors/cadical.py
  minisat: !Exec
    timeout: 2000
    executable:  ./ingestors/minisat.py

# Map of solvers <name>:<test set attrbutes>
solvers:
  cadical:
    # Path to binary for executing SAT solver
    # NOTE: will be executed with: <exec> <solver params> <test set params> <test file>
    exec: <path to solver bin>
    # Name of ingest module, see ingest.cadical above
    ingest: cadical
    # parameters that are applied after the solver params and before the test set params
    params: ""

# Map of test sets <name>:<test set attrbutes>
tests:
  cadical-tests:
    # Collectors: Components that retrieve and prepare the DIMACS test files
    # - Glob: Collect files from a directory by a glob (old default)
    # - GBD: Collect tests from a GBD compatible web host and save as local files (todo)
    # - Grouped: See below
    collector: !Glob
      # Glob for selecting files in path(s)
      glob: "*.cnf"
      # reference to directory (can be used with(out) paths)
      path: ./solvers/cadical/test/cnf
      # reference to directories to search
      paths:
        - ./solvers/cadical/test/cnf
        - ./solvers/cadical/test/cnf-2
  grouped-cadical-tests:
    # - Grouped: Union of collector tests (note: nested groups are not possible atm)
    collector: !Grouped
      collectors:
	- cadical-tests
    # unsigned integer -> timeout for test executions in ms
    # will overwrite solver timeout for this set of tests (optional)
    timeout: 10000
    # number of times each test is executed (default: 1)
    iterations: 10
    # params that are appended after solver params and before the test file
    params: ""

Attribution

Some of the source code for the pooled connection managers was inspired by ivanceras r2d2-sqlite crate. There was no code re-used nor borrowed however it still deserves a shoutout for the good documentation.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
analysis		analysis
docs		docs
ingestors @ 2a50dcf		ingestors @ 2a50dcf
runner		runner
solvers		solvers
.gitignore		.gitignore
.gitlab-ci.yml		.gitlab-ci.yml
.gitmodules		.gitmodules
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
README.md		README.md
flake.lock		flake.lock
flake.nix		flake.nix
mkdocs.yml		mkdocs.yml
solvers-fs.yml		solvers-fs.yml

SAT-solver-ANalyzer/SATAn

Folders and files

Latest commit

History

Repository files navigation

SAT solver ANalyzer

Current state

Runtime behavior (WIP)

Runner

Building

Runner

Config

Attribution

About

Topics

Resources

Stars

Watchers

Forks

Languages