**Requirements** - Token streaming - partial rendering - cancel response - retry generation **Stack** - React - Python gRPC - LLM API