Docs: Prevent context clipping issues with local models

Improve Strix docs for running local models (especially Ollama) to clearly explain context window sizing. Many users run with very limited context to save VRAM, and Ollama often defaults to ~4096 context unless configured otherwise, which makes Strix unreliable/unusable.