New Terminal Option: `--no_live_response` #1278

Steve235lab · 2024-05-23T15:32:46Z

Describe the changes you have made:

Add a new terminal option which allows users to config whether rendering responses while receiving chunks (classic and default behavior) or perform a one-time rendering after all chunks were received (new behavior).

Perform a one-time rendering after all chunks were received will prevent showing duplicate lines in terminal and especially when using via SSH it will reduce bandwidth usage and twinkling.

Reference any relevant issues (e.g. "Fixes #000"):

Temporally fixes #1127

Pre-Submission Checklist (optional but appreciated):

I have included relevant documentation updates (stored in /docs)
I have read docs/CONTRIBUTING.md
I have read docs/ROADMAP.md

OS Tests (optional but appreciated):

Tested on Windows
Tested on MacOS
Tested on Linux

Steve235lab · 2024-05-23T15:36:31Z

This has annoyed me for a long time since the very first time I use OI, and this could be a not perfect but working solution. Just pull and give it a try, you'll know what I'm talking about.

tyfiero · 2024-05-23T15:46:00Z

This is so cool, its been an issue for a while. thanks @Steve235lab

KillianLucas · 2024-06-15T05:00:36Z

Hi @Steve235lab, this is fantastic. I am annoyed by the original behavior as well! But I want to float two other solutions.

I think the streaming is an important UX component to lots of modern AI systems, and I think we can fix the issue in two other ways:

--plain — a flag that just removes Rich. It merely would merely print(chunk, end="") the chunks as plain text, more like Ollama. Would also work if someone wanted to pipe OI's output into something else. This should fix all problems, unless there's something deeper about the rate of streaming that's bad for SSH!
Always printing the last 5 messages at the end of a message stream. This would fix the weird repeating behavior, I believe, because you'd scroll up, and it would be 5 solid messages printed at once. It wouldn't fix twinkling during streaming, and it wouldn't help with SSH bandwidth, but it would fix the repeating bug.

What do you think?

Steve235lab · 2024-06-15T16:27:46Z

--plain — a flag that just removes Rich. It merely would merely print(chunk, end="") the chunks as plain text, more like Ollama. Would also work if someone wanted to pipe OI's output into something else. This should fix all problems, unless there's something deeper about the rate of streaming that's bad for SSH!

This one is great, I will implement this later.

Steve235lab · 2024-07-23T16:01:54Z

Long time no see, it seems somebody had already implemented this, and it works well for me, closed.

Steve235lab added 4 commits May 23, 2024 21:37

Turn off live refreshing.

71a84a6

Add a customized spinner.

07582da

Add "no_live_response" param as config.

a63f519

Update documents and profile templates.

107babd

Steve235lab closed this Jul 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New Terminal Option: `--no_live_response` #1278

New Terminal Option: `--no_live_response` #1278

Steve235lab commented May 23, 2024

Steve235lab commented May 23, 2024

tyfiero commented May 23, 2024

KillianLucas commented Jun 15, 2024

Steve235lab commented Jun 15, 2024

Steve235lab commented Jul 23, 2024

New Terminal Option: --no_live_response #1278

New Terminal Option: --no_live_response #1278

Conversation

Steve235lab commented May 23, 2024

Describe the changes you have made:

Reference any relevant issues (e.g. "Fixes #000"):

Pre-Submission Checklist (optional but appreciated):

OS Tests (optional but appreciated):

Steve235lab commented May 23, 2024

tyfiero commented May 23, 2024

KillianLucas commented Jun 15, 2024

Steve235lab commented Jun 15, 2024

Steve235lab commented Jul 23, 2024

New Terminal Option: `--no_live_response` #1278

New Terminal Option: `--no_live_response` #1278