THN Threat Atlas — Splunk + Defender Use Case Generator

Pulls the latest 10 articles from The Hacker News, extracts security indicators + ATT&CK techniques + kill chain phases, and auto-generates Splunk SPL (CIM-conformant) and Microsoft Defender Advanced Hunting KQL detections per article.

File map

thn-usecases/
  generate.py           ← Build the static site (index.html)
  sync.py               ← Pull authoritative sources -> registry.json
  validate.py           ← Verify generate.py SPL + ATT&CK refs are spec-valid
  data_sources/
    registry.json       ← Built by sync.py (CIM datasets, ESCU detections, ATT&CK)
    cim_spec_fields.json ← Hand-curated CIM 8.5 spec fields not seen in ESCU
    defender_spec_tables.json ← Hand-curated Defender XDR schema (Microsoft Learn)
    sync_log.json       ← Last sync metadata
    validation_report.json ← Last validation result
  CHANGELOG.md          ← Auto-appended by sync.py on each run
  index.html            ← The generated site

Workflow

+-------------+       +----------+       +---------------+
| sync.py     |  -->  | registry |  -->  | validate.py   |
| (weekly)    |       | .json    |       | (every build) |
+-------------+       +----------+       +---------------+
                            |
                            v
                      +-------------+
                      | generate.py | --> index.html
                      | (daily)     |
                      +-------------+

1) `sync.py` — pull authoritative sources

Pulls (no auth required, all public):

Splunk Security Content (ESCU) — splunk/security_content develop branch tarball. Source of truth for what CIM datasets/fields are actually used in production Splunk detection content.
MITRE ATT&CK enterprise-attack STIX bundle — mitre-attack/attack-stix-data master. Canonical list of all current technique IDs + names + tactics.

Builds data_sources/registry.json with:

cim_datasets — {full_path: [field_names]} derived from real ESCU SPL.
escu_detections — slim copy of every ESCU TTP/Anomaly/Hunting detection.
attack_techniques — {TID: {name, kill_chain_phases, deprecated}}.
attack_tactics — {shortname: long_name}.

Run weekly (or on demand):

py sync.py

2) `validate.py` — verify SPL and KQL templates

Splunk SPL checks (per UseCase):

Extracts from datamodel=X.Y and nodename="X.Y" references.
Extracts Y.field_name references.
Checks dataset path exists in ESCU usage OR cim_spec_fields.json allowlist.
Checks every field is valid against the same union.

Defender KQL checks (per UseCase):

Identifies every Defender XDR table referenced (DeviceProcessEvents, EmailEvents, AADSignInEventsBeta, etc.).
Extracts column references in | where Col, | project ... (alias-aware).
Validates against the union of "observed in microsoft/Microsoft-365-Defender-Hunting-Queries" and the hand-curated defender_spec_tables.json schema allowlist.
Flags Defender-shaped table names that aren't in the schema (e.g. typos).

ATT&CK checks:

Every technique ID exists in the current MITRE catalog and isn't deprecated.

Failure modes:

Errors (exit 1) — unknown CIM dataset/field, unknown Defender table, missing/deprecated ATT&CK ID.
Warnings — column name not found in any referenced Defender table (likely typo or alias issue).
Infos — spec-valid identifier never used in production corpus (still passes).

Run:

py validate.py

3) `generate.py` — build the site

Pulls last 10 RSS articles, runs the rule engine, renders the HTML:

py generate.py

Authoritative documentation

These are the upstream references the pipeline keeps us aligned with:

Splunk

Splunk CIM 8.5 — Overview
CIM Endpoint data model
SPL Search Reference
Search Macros
Splunk Security Content (ESCU) — gold-standard production SPL
research.splunk.com — searchable detection catalog
Splunk contentctl — Splunk's own validation CLI
CIM Add-on on Splunkbase

Microsoft Defender / Sentinel

Cross-platform

MITRE ATT&CK Enterprise
MITRE attack-stix-data (canonical JSON)
SigmaHQ rules — vendor-neutral detection format

Updating

When the CIM spec changes (8.5 → 8.6 etc.), update the cim_spec_fields.json allowlist by hand from the latest Splunk CIM docs data-model pages.

ESCU and ATT&CK refresh automatically on sync.py runs. Diff is appended to CHANGELOG.md so drift is visible.

Scheduling — running this on autopilot

Two batch wrappers are included:

Script	What it does	Suggested cadence
`run_daily.bat`	validate → generate → digest (writes `daily_digest.md`)	daily, e.g. 08:00
`run_weekly.bat`	sync → daily	weekly, e.g. Mon 06:00

Both write to logs\<flow>_YYYY-MM-DD.log.

digest.py produces:

daily_digest.md — markdown summary of new articles since last run + all High/Critical
data_sources/last_run.json — state for the next-run diff
Optional webhook POST when WEBHOOK_URL env var is set (for Slack/Teams/email-relay)

Register with Windows Task Scheduler

Run as your user (no admin needed) in an elevated cmd or PowerShell:

schtasks /create /tn "THN Daily" /tr "C:\Users\mckec\Downloads\DexCore 1.1.9\thn-usecases\run_daily.bat" /sc daily /st 08:00 /f

schtasks /create /tn "THN Weekly Sync" /tr "C:\Users\mckec\Downloads\DexCore 1.1.9\thn-usecases\run_weekly.bat" /sc weekly /d MON /st 06:00 /f

Optional — set the webhook URL system-wide before registering:

setx WEBHOOK_URL "https://hooks.example.com/your-webhook"

Verify:

schtasks /query /tn "THN Daily"
schtasks /query /tn "THN Weekly Sync"

Run on demand:

schtasks /run /tn "THN Daily"

Remove later:

schtasks /delete /tn "THN Daily" /f
schtasks /delete /tn "THN Weekly Sync" /f

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
.github/workflows		.github/workflows
briefings		briefings
catalog		catalog
data_sources		data_sources
intel		intel
rule_packs		rule_packs
rules		rules
use_cases		use_cases
.gitignore		.gitignore
.nojekyll		.nojekyll
CHANGELOG.md		CHANGELOG.md
CNAME		CNAME
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
digest.py		digest.py
generate.py		generate.py
index.html		index.html
logo.png		logo.png
requirements.txt		requirements.txt
review.py		review.py
run_daily.bat		run_daily.bat
run_weekly.bat		run_weekly.bat
sync.py		sync.py
validate.py		validate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

THN Threat Atlas — Splunk + Defender Use Case Generator

File map

Workflow

1) `sync.py` — pull authoritative sources

2) `validate.py` — verify SPL and KQL templates

3) `generate.py` — build the site

Authoritative documentation

Splunk

Microsoft Defender / Sentinel

Cross-platform

Updating

Scheduling — running this on autopilot

Register with Windows Task Scheduler

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

THN Threat Atlas — Splunk + Defender Use Case Generator

File map

Workflow

1) sync.py — pull authoritative sources

2) validate.py — verify SPL and KQL templates

3) generate.py — build the site

Authoritative documentation

Splunk

Microsoft Defender / Sentinel

Cross-platform

Updating

Scheduling — running this on autopilot

Register with Windows Task Scheduler

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

1) `sync.py` — pull authoritative sources

2) `validate.py` — verify SPL and KQL templates

3) `generate.py` — build the site

Packages