Improve Strix docs for running local models (especially Ollama) to clearly explain context window sizing. Many users run with very limited context to save VRAM, and Ollama often defaults to ~4096 context unless configured otherwise, which makes Strix unreliable/unusable.