Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sample app response is slow like 10-20s sometimes #999

Closed
poweihuang0817 opened this issue May 15, 2023 · 4 comments
Closed

Sample app response is slow like 10-20s sometimes #999

poweihuang0817 opened this issue May 15, 2023 · 4 comments
Assignees

Comments

@poweihuang0817
Copy link
Contributor

Describe the bug
Sample app response is slow like 10-20s sometimes. Even though I've used paid account with openAI, probably too many round trip.

  Request finished HTTP/2 POST https://localhost:40443/skills/ChatSkill/functions/Chat/invoke application/json 1132 - 200 - application/json;+charset=utf-8 103352.8697ms

To Reproduce
Steps to reproduce the behavior:

  1. Go to '...'
  2. Click on '....'
  3. Scroll down to '....'
  4. See error

Expected behavior
A clear and concise description of what you expected to happen.

Screenshots
If applicable, add screenshots to help explain your problem.

Desktop (please complete the following information):

  • OS: [e.g. Windows]
  • IDE: [e.g. Visual Studio, VS Code]
  • NuGet Package Version [e.g. 0.1.0]

Additional context
Add any other context about the problem here.

@craigomatic
Copy link
Contributor

It looks like you're using the Copilot Chat sample - for the moment, to speed things up you can edit appsettings.json and set Planner:Enabled to false. While will disable plugins and some other interesting features, you will get a faster response.

We are continuing to explore strategies to improve the response and will likely add chat streaming in to copilot chat in the near future (#829 added streaming support to SK)

@poweihuang0817
Copy link
Contributor Author

But with planner disabled, could we still invoke custom skill? Would we be able to select one skill from many? Looks like no?

@adrianwyatt
Copy link
Contributor

We recently removed the planner:enabled setting and it now automatically disables if the user on the frontend has not enabled any plugins. #1151

@adrianwyatt
Copy link
Contributor

The lag in chat responses has to do with the chat app running 1-2 round trips with the AI (3 if a plugin/planner is enabled). We also use the AI /embeddings endpoints to create long-term memories which adds at least one more round trip. After the /Build conference we are taking a look at how we might be able to bring this response time down through parallelization and a different memories architecture.

@adrianwyatt adrianwyatt self-assigned this May 24, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants