Parent PRD
#1
What to build
The core of gemma4_brain.py: a process_frame(pil_image) method that makes a single Gemma 4 call and returns a navigation + scene analysis dict compatible with the existing RoverBrain/BehaviorArbiter interface. No memory injection yet — just the bare pipe working.
Acceptance criteria
Blocked by
None — can start immediately
User stories addressed
- Gemma 4 makes navigation decisions using camera vision
- Single model replaces both LLaVA and Llama 3.1
Parent PRD
#1
What to build
The core of
gemma4_brain.py: aprocess_frame(pil_image)method that makes a single Gemma 4 call and returns a navigation + scene analysis dict compatible with the existing RoverBrain/BehaviorArbiter interface. No memory injection yet — just the bare pipe working.Acceptance criteria
Gemma4Brain.process_frame(pil_image)calls Ollama/api/chatwith image_b64 and returns a dictscene,nav_decision,danger_level,reaction,objectsnav_decisionis one of: FORWARD, TURN_LEFT, TURN_RIGHT, BACKUP, STOPdanger_levelis an integer 0-3_fallback_result()(no crash)Gemma4Brain._check_ollama()returns True when gemma4:e4b is loadedBlocked by
None — can start immediately
User stories addressed