FinGuard

A Lightweight Guard/Verify Wrapper for Safer Financial Assistants

What Is FinGuard?

FinGuard is a two-layer safety wrapper for financial assistants: a pre-generation guard plus a post-generation verifier. It is not a RAG system, a fine-tuned model, or a replacement agent. Instead, FinGuard wraps an existing Hermes-style assistant and improves refusal correctness, numeric traceability, and over-refusal control without modifying the underlying model. We evaluate the same wrapper on Gemma 4 31B and Qwen3.5-27B local model endpoints.

Pipeline

docs/pipeline.png is reserved for the final exported paper figure. Until that asset is added, the pipeline is:

TODO: replace this ASCII placeholder with docs/pipeline.png

User Query
   |
   v
FinGuard Layer (Pre-Generation)
   |  query classification, injection detection, temporal intent
   v
Hermes Agent Core (Unchanged)
   |  local model generation
   v
FinVerify Layer (Post-Generation)
   |  numeric claim checks, source matching, downgrade/disclaimer
   v
Final Response + Structured Metadata
   |
   v
Benchmark Observer

Key Results

Model	System	Aligned Behavior-Safe	Over-Refusal
Gemma 31B	vanilla	0.933	0.019
Gemma 31B	finguard	0.989	0.000
Qwen3.5-27B	vanilla	0.922	0.154
Qwen3.5-27B	finguard	1.000	0.000

For full results including the naive RAG baseline and category breakdown, see our paper.

Quick Start

git clone https://github.com/DuckTraDo/finguard.git
cd finguard
uv venv venv --python 3.11
source venv/bin/activate  # Windows: venv\Scripts\activate
uv pip install -e ".[dev]"
python -m pytest tests/finguard -q

Reproduce Paper Results

Prerequisites:

A local llama.cpp, vLLM, or compatible server running at http://localhost:18080/v1.
The endpoint must expose an OpenAI-compatible chat completions API.
The served model should be Gemma 4 31B or Qwen3.5-27B for direct comparison with the paper.

Run the 90-case local smoke benchmark:

# Run vanilla baseline (90 cases)
python -m finguard.benchmark_smoke \
  --dataset-path benchmarks/finguard/local_comparison_v3.jsonl \
  --baseline-mode vanilla \
  --run-profile benchmark_local_smoke_profile \
  --output-dir benchmarks/finguard/live_vanilla \
  --limit 90 \
  --max-tokens 192

# Run FinGuard baseline (90 cases)
python -m finguard.benchmark_smoke \
  --dataset-path benchmarks/finguard/local_comparison_v3.jsonl \
  --baseline-mode finguard \
  --run-profile benchmark_local_smoke_profile \
  --output-dir benchmarks/finguard/live_finguard \
  --limit 90 \
  --max-tokens 192

Results are written to benchmarks/finguard/live_*/ as rows.jsonl and summary.json.

Project Structure

finguard/
|-- finguard/                 # Core FinGuard package
|   |-- fin_guard.py          # Pre-generation guard layer
|   |-- fin_classifier.py     # Query classification (rule + LLM hybrid)
|   |-- fin_verify.py         # Post-generation verification
|   |-- fin_utils.py          # Source normalization, numeric helpers
|   |-- config.py             # Feature flags and thresholds
|   `-- benchmark_smoke.py    # Benchmark runner
|-- skills/finance/           # Financial skills (Hermes format)
|   |-- fin-source-citation/
|   `-- fin-temporal-awareness/
|-- benchmarks/finguard/      # Benchmark datasets, results, and paper assets
|-- tests/finguard/           # Test suite (77+ tests)
|-- docs/                     # Documentation
|   `-- finguard-behavior-matrix.md
|-- run_agent.py              # Hermes agent with FinGuard hooks
`-- README.md

How It Works

FinGuard Layer (pre-generation). FinGuard classifies each query as factual, compliance-sensitive, operational, or injection-like. It detects prompt-injection patterns, extracts temporal intent, and assigns a second-level expected_behavior label for compliance-sensitive requests, such as refuse_with_disclaimer or answer_with_disclaimer.

FinVerify Layer (post-generation). FinVerify extracts numeric claims from the model response and checks whether each number is supported by normalized sources. If support is insufficient, it downgrades the certainty of the answer and adds a verification note. For compliance-sensitive answers, it enforces disclaimer behavior before the final response is returned.

Benchmark Observer. The observer records both raw and aligned refusal patterns, separating metadata refusal from visible refusal. It also distinguishes behavior-safe outputs from metadata-aligned outputs, which makes it easier to diagnose whether an error is a real unsafe answer or an instrumentation/taxonomy mismatch.

Citation

@article{lu2026finguard,
  title={FinGuard: A Lightweight Guard/Verify Wrapper for Safer Financial Assistants},
  author={Lu, Yuxin and Lin, Huijia},
  journal={arXiv preprint arXiv:XXXX.XXXXX},
  year={2026}
}

License

MIT License, inherited from Hermes Agent.

Acknowledgments

FinGuard is built on top of Hermes Agent by Nous Research. We add two runtime hooks and a benchmark harness; the Hermes core agent loop is unchanged. See the original Hermes documentation in docs/hermes-original-README.md.

Name		Name	Last commit message	Last commit date
Latest commit History 4,861 Commits
.github		.github
.plans		.plans
acp_adapter		acp_adapter
acp_registry		acp_registry
agent		agent
assets		assets
benchmarks/finguard		benchmarks/finguard
cron		cron
datagen-config-examples		datagen-config-examples
docker		docker
docs		docs
environments		environments
finguard		finguard
gateway		gateway
hermes_cli		hermes_cli
nix		nix
optional-skills		optional-skills
packaging/homebrew		packaging/homebrew
plans		plans
plugins		plugins
scripts		scripts
skills		skills
tests		tests
tinker-atropos @ 65f084e		tinker-atropos @ 65f084e
tools		tools
tui_gateway		tui_gateway
ui-tui		ui-tui
web		web
website		website
.dockerignore		.dockerignore
.env.example		.env.example
.envrc		.envrc
.gitattributes		.gitattributes
.gitignore		.gitignore
.gitmodules		.gitmodules
.mailmap		.mailmap
AGENTS.md		AGENTS.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
RELEASE_v0.10.0.md		RELEASE_v0.10.0.md
RELEASE_v0.2.0.md		RELEASE_v0.2.0.md
RELEASE_v0.3.0.md		RELEASE_v0.3.0.md
RELEASE_v0.4.0.md		RELEASE_v0.4.0.md
RELEASE_v0.5.0.md		RELEASE_v0.5.0.md
RELEASE_v0.6.0.md		RELEASE_v0.6.0.md
RELEASE_v0.7.0.md		RELEASE_v0.7.0.md
RELEASE_v0.8.0.md		RELEASE_v0.8.0.md
RELEASE_v0.9.0.md		RELEASE_v0.9.0.md
SECURITY.md		SECURITY.md
batch_runner.py		batch_runner.py
cli-config.yaml.example		cli-config.yaml.example
cli.py		cli.py
constraints-termux.txt		constraints-termux.txt
flake.lock		flake.lock
flake.nix		flake.nix
hermes		hermes
hermes-already-has-routines.md		hermes-already-has-routines.md
hermes_constants.py		hermes_constants.py
hermes_logging.py		hermes_logging.py
hermes_state.py		hermes_state.py
hermes_time.py		hermes_time.py
mcp_serve.py		mcp_serve.py
mini_swe_runner.py		mini_swe_runner.py
model_tools.py		model_tools.py
package-lock.json		package-lock.json
package.json		package.json
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
rl_cli.py		rl_cli.py
run_agent.py		run_agent.py
setup-hermes.sh		setup-hermes.sh
toolset_distributions.py		toolset_distributions.py
toolsets.py		toolsets.py
trajectory_compressor.py		trajectory_compressor.py
utils.py		utils.py
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FinGuard

What Is FinGuard?

Pipeline

Key Results

Quick Start

Reproduce Paper Results

Project Structure

How It Works

Citation

License

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

FinGuard

What Is FinGuard?

Pipeline

Key Results

Quick Start

Reproduce Paper Results

Project Structure

How It Works

Citation

License

Acknowledgments

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages