You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Status (after today's cog v0.0.1 ship — see PRs #642, #643, #644)
ADR-079 P7 (data collection), P8 (alignment + train + eval) and cog packaging end-to-end all ran today. The pipeline is validated and a signed cog-pose-estimation@0.0.1 binary is live at gs://cognitum-apps/cogs/{arm,x86_64}/, installed on cognitum-v0. The remaining work for a useful model is data-bound.
Per-joint PCK@50 ranks show the model is learning where the camera lets it:
r_hip 76.9% ← excellent (right side most consistently in frame)
r_knee 35.2%
l_hip 27.3%
l_elbow 26.4%
l_wrist 24.1%
l_knee 20.8%
r_shoulder 19.9%
...
nose 5.1% ← essentially random (face joints at desk-level zoom)
l_ankle 7.9%
r_ankle 9.3%
The asymmetry is a direct reading of the seated-at-desk camera framing — not a model defect. CSI at 56 subcarriers × 20 frames carries enough spatial info for proximal joints with consistent visibility; it doesn't carry enough for fine-grained extremities. More data won't fix that subcarrier-density bottleneck for fingertips / face, but multi-room full-body data will solve it for the 11 joints that today already show some signal.
Suggested data-collection plan
3 × 30-min sessions with the camera backed up so head→ankles fits in frame. Different rooms (or different times of day for the same room) to give the model spatial diversity. Vary movements: walk pattern, arm raises, sit/stand transitions, squats, reaches, lying down.
Train via the existing Candle pipeline on ruvultra's RTX 5080. Expected wall time: still well under a minute even for 30K samples / 1000 epochs.
Re-evaluate. PCK@20 should approach the 35% target if the framing + variety land.
Optimizations available within the pipeline (do not require new data)
LoRA cross-environment fine-tune (per ADR-079 P9). Today's encoder was random-initialized because the HF presence encoder's MLP architecture didn't match; with multi-room data we can train a real shared encoder first and then per-room LoRA adapters.
Status (after today's cog v0.0.1 ship — see PRs #642, #643, #644)
ADR-079 P7 (data collection), P8 (alignment + train + eval) and cog packaging end-to-end all ran today. The pipeline is validated and a signed
cog-pose-estimation@0.0.1binary is live atgs://cognitum-apps/cogs/{arm,x86_64}/, installed on cognitum-v0. The remaining work for a useful model is data-bound.What the v0.0.1 numbers tell us
Per-joint PCK@50 ranks show the model is learning where the camera lets it:
The asymmetry is a direct reading of the seated-at-desk camera framing — not a model defect. CSI at 56 subcarriers × 20 frames carries enough spatial info for proximal joints with consistent visibility; it doesn't carry enough for fine-grained extremities. More data won't fix that subcarrier-density bottleneck for fingertips / face, but multi-room full-body data will solve it for the 11 joints that today already show some signal.
Suggested data-collection plan
scripts/align-ground-truth.js(now streaming-loader-safe per fix(align): stream JSONL + support sensing_update format (unblocks ADR-079 P8) #641) to produce a multi-session paired set.Optimizations available within the pipeline (do not require new data)
Artifacts shipped today (for context)
gs://cognitum-apps/cogs/arm/cog-pose-estimation-arm+.../x86_64/....models/wifi-densepose-pretrained.safetensors→pose_v1.safetensors(507 KB) +pose_v1.onnx(12 KB).docs/benchmarks/pose-estimation-cog.md./var/lib/cognitum/apps/pose-estimation/on cognitum-v0.Acceptance criteria for closing this issue
cog-pose-estimation@0.1.0with the new weights (no code change required — same Candle inference path, just better weights).🤖 Generated with claude-flow