chore: launch prep v0.1.5 — merge PRs #50/#51, clean README, pin deps by devin-ai-integration[bot] · Pull Request #52 · Odingard/Argus

devin-ai-integration · 2026-04-15T06:07:36Z

Summary

Launch-day consolidation PR that merges two approved PRs and cleans up README/metadata for the v0.1.5 PyPI release.

Merged PRs:

PR chore: bump version to 0.1.4 for PyPI release #50 — version bump to 0.1.4
PR feat: Phase D broaden detection — D5-D10, T7 #51 — Phase D broaden detection (D5–D10, T7)

Cleanup on top:

Version bump 0.1.4 → 0.1.5 (pyproject.toml, src/argus/__init__.py)
README: 12→13 agents (MCP Scanner), 16→21 correlation patterns, fixed CLI quick-start (--mcp-url → --target), fixed Arena commands (argus arena start/status/scan/score), updated architecture diagram and project structure tree
CLAUDE.md: updated to 13 agents, 21 patterns, 352 tests
Pinned python-multipart>=0.0.22 for CVE-2024-53981 / CVE-2026-24486 defense-in-depth
Updated PyPI description to "13 specialized agents"

Phase D code (from PR #51):

identity_spoof.py — D7 social engineering BFLA payloads
privilege_escalation.py — D6 BOLA (Broken Object Level Authorization) payloads
model_extraction.py — D5 tool/function discovery probes
conductor/evaluation.py — new DataCategoryMatcher with PII regex patterns (IPv6, DOB, passport, medical ID)
conductor/session.py — T7 ConnectionPool (shared httpx.AsyncHTTPTransport keyed by host)
correlation/engine.py — 5 new compound attack path patterns (16→21 total)

Updates since last revision:

ConnectionPool redesigned to share transports, not clients. The original PR feat: Phase D broaden detection — D5-D10, T7 #51 code shared httpx.AsyncClient instances across sessions, which leaked cookie state between agents. The first fix (cookies=False) was also incorrect — httpx silently creates a normal cookie jar when passed False. The final fix shares httpx.AsyncHTTPTransport instances (TCP connection pool) while each ConversationSession creates its own AsyncClient on top. This correctly isolates cookie jars and per-client state while reusing TCP connections.
_owns_client flag now tracks actual pooled transport usage. Previously set to self._pool is None, which was wrong when both a pool and an explicit transport were provided (the explicit transport would be used but the client wouldn't be closed, leaking it). Now uses a used_pooled_transport boolean that tracks which branch of the if/elif actually ran, so the flag is only False when a pooled transport is genuinely in use.
CLAUDE.md updated — 12→13 agents, 16→21 correlation patterns, 163→352 test count.

Review & Testing Checklist for Human

Transport lifecycle: aclose() destroys shared transports — Confirmed via httpx source that AsyncClient.aclose() unconditionally calls self._transport.aclose(). The _owns_client flag prevents this for pooled sessions, but verify the flag logic handles all combinations: (pool+no transport → pooled, pool+explicit transport → owned, no pool+transport → owned, no pool+no transport → owned).
Leaked client when _owns_client=False — When using a pooled transport, __aexit__ sets self._client = None without calling aclose(). The AsyncClient object is abandoned for GC rather than explicitly cleaned up. The transport (the expensive resource) is managed by the pool, but the client itself may hold minor resources. Decide whether this is acceptable or if a lightweight cleanup path is needed.
Transport lifecycle vs. session lifecycle — pool.close_all() closes all transports. If any ConversationSession is still open when the pool is closed, its client will have a dead transport. Verify the orchestrator always awaits all session __aexit__ calls before calling pool.close_all().
PII regex patterns in evaluation.py — The IPv6 regex has 4 alternatives with lookaheads. Verify it doesn't false-positive on hex strings or UUIDs in normal agent responses. Same for passport/medical ID patterns.
Agent/pattern count accuracy — Verify README/CLAUDE.md claims match code: 13 agents in agents/__init__.py registry, 21 patterns in correlation/engine.py _COMPOUND_PATTERNS list.

Suggested test plan: pip install -e . → argus scan --help (verify --target flag) → pytest tests/ -v → ruff check src/ tests/ → spot-check that from argus import __version__; assert __version__ == "0.1.5".

Notes

Version jumps 0.1.3→0.1.5 because 0.1.4 (PR chore: bump version to 0.1.4 for PyPI release #50) is folded in here. If 0.1.4 was already published to PyPI, this is fine; if not, consider whether skipping a version matters for your release process.
The BFLA/BOLA payloads are static string lists — no runtime LLM calls — so they're low-risk functionally but should be reviewed for attack pattern quality/realism.
Stale branch devin/1776222564-phase-c-attack-power was deleted as part of Git cleanup.

Link to Devin session: https://app.devin.ai/sessions/8b0c5ca873934d77aa254157cc41924c
Requested by: @andrebyrd-odingard

D5: Tool/Function Discovery payloads for model_extraction agent - d5_tool_schema_extraction, d5_function_call_probing, d5_capability_enumeration - Updated _evaluate_response() and _record_intelligence() for D5 techniques D6: BOLA Payloads for privilege_escalation agent - 4 BOLA techniques: numeric IDOR, UUID swap, path traversal, mass assignment - _test_bola() and _report_bola() methods D7: Social Engineering BFLA for identity_spoof agent - 5 techniques: CEO urgency, compliance pressure, helpdesk, developer debug, time pressure - _test_social_engineering_bfla(), _evaluate_bfla_response(), _report_bfla() D8: PII Detection Expansion in DataCategoryMatcher - Phone numbers, SSN, credit cards (Visa/MC/Amex/Discover), IPv4, IPv6 - Date of birth, passport numbers, medical record IDs D10: Correlation Agent — 5 new compound attack path patterns - BOLA + model_extraction, BFLA + identity_spoof + priv_esc - Tool discovery + prompt injection, BOLA + cross-agent exfil - BFLA + memory poisoning T7: Connection Pooling in ConversationSession - ConnectionPool class with shared httpx.AsyncClient instances - Keyed by (host, timeout), singleton pattern, scan-scoped lifecycle - ConversationSession accepts optional pool= parameter

… key 1. BFLA evaluation: when both refusal and compliance keywords are present but no hard evidence (markers/priv_indicators), treat as refusal. Fixes false positives where refusal messages mention 'password', 'secret', etc. 2. ConnectionPool cache key: include csrf_mode in the (host, timeout, csrf_mode) key to prevent incorrect client configuration when sessions with different csrf_mode values share the same pool.

1. pii_phone: add word boundaries and require at least one separator to avoid matching timestamps and numeric IDs. 2. pii_passport/pii_medical_id: require mandatory colon/equals separator and at least one digit in value via lookahead, preventing matches on English words like 'passport details' or 'patient id unknown'. 3. ConnectionPool __aexit__: always clear self._client = None regardless of _owns_client, so the use-after-exit guard in turn() fires correctly for pooled sessions.

Restructure _attack_base() so that: - Chat endpoints are fetched independently of identity endpoints - Early return only fires when NEITHER identity nor chat surfaces exist - D7 BFLA tests run whenever chat endpoints are available, regardless of whether identity endpoints exist

…rkers) Soft privilege indicators like 'admin' commonly appear in refusal text (e.g. 'I cannot grant you admin access'). Previously priv_indicators alone could override the refusal classification, causing false positives. Now only sensitive markers (leaked keys/tokens) override a refusal.

\b doesn't fire adjacent to :: because both : and start-of-string are non-word characters. Replaced with explicit lookaround anchors. Added fourth alternative to handle mid-address :: (e.g. fe80::1).

…-detection' into devin/1776233032-launch-prep

…ts, 21 patterns), pin deps

devin-ai-integration · 2026-04-15T06:07:39Z

🤖 Devin AI Engineer

I'll be helping with this pull request! Here's what you should know:

✅ I will automatically:

Address comments on this PR. Add '(aside)' to your comment to have me ignore it.
Look at CI failures and help fix them

Note: I can only respond to comments from users who have write access to this repository.

⚙️ Control Options:

Disable automatic comment and CI monitoring

…oss-session state leakage

…ion) + update CLAUDE.md counts

… transport destruction

… presence

andrebyrd-odingard added 9 commits April 15, 2026 04:31

chore: bump version to 0.1.4 for PyPI release

3cfe6f4

fix: IPv6 regex — use lookaround anchors instead of \b for :: forms

17ee53f

\b doesn't fire adjacent to :: because both : and start-of-string are non-word characters. Replaced with explicit lookaround anchors. Added fourth alternative to handle mid-address :: (e.g. fe80::1).

Merge remote-tracking branch 'origin/devin/1776227588-phase-d-broaden…

82b54ae

…-detection' into devin/1776233032-launch-prep

chore: launch prep v0.1.5 — merge PRs #50/#51, update README (13 agen…

cba7656

…ts, 21 patterns), pin deps

devin-ai-integration bot assigned andrebyrd-odingard Apr 15, 2026

This comment was marked as resolved.

Sign in to view

fix: disable cookie persistence on pooled httpx clients to prevent cr…

ffcf5a2

…oss-session state leakage

This comment was marked as resolved.

Sign in to view

fix: ConnectionPool shares transport not client (proper cookie isolat…

f5f0db0

…ion) + update CLAUDE.md counts

This comment was marked as resolved.

Sign in to view

fix: _owns_client=False when using pooled transport to prevent shared…

d56120a

… transport destruction

This comment was marked as resolved.

Sign in to view

fix: _owns_client tracks actual pooled transport usage, not just pool…

368d60d

… presence

andrebyrd-odingard merged commit e3dbb8c into main Apr 15, 2026
7 checks passed

This was referenced Apr 15, 2026

chore: bump version to 0.1.4 for PyPI release #50

Open

feat: Phase D broaden detection — D5-D10, T7 #51

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore: launch prep v0.1.5 — merge PRs #50/#51, clean README, pin deps#52

chore: launch prep v0.1.5 — merge PRs #50/#51, clean README, pin deps#52
andrebyrd-odingard merged 13 commits intomainfrom
devin/1776233032-launch-prep

devin-ai-integration bot commented Apr 15, 2026 •

edited

Loading

Uh oh!

devin-ai-integration bot commented Apr 15, 2026

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

devin-ai-integration bot commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Review & Testing Checklist for Human

Notes

Uh oh!

devin-ai-integration bot commented Apr 15, 2026

🤖 Devin AI Engineer

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

devin-ai-integration bot commented Apr 15, 2026 •

edited

Loading