Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bug: It's tough to get started with a 4k context on my Windows computer #2257

Closed
louis-jan opened this issue Mar 6, 2024 · 1 comment
Closed
Assignees
Labels
type: bug Something isn't working
Milestone

Comments

@louis-jan
Copy link
Contributor

louis-jan commented Mar 6, 2024

Describe the bug
I get a very slow response and the entire OS becomes laggy when generating a response with a 4k context size (~30s TTFT). After changing the context size to 1k, it works quickly and smoothly (<1s TTFT). That was a very bad experience during the onboarding of a new user.

A better default medium context size would be ideal.

Expected behavior
The newly created thread should have the default context size set to 1024 or 2048 (depends on user's device specs).

Environment details

  • Windows 11
  • CPU: i5 10th
  • GPU: RTX 2060 6GB VRAM
  • RAM: 32GB

Logs
If the cause of the error is not clear, kindly provide your usage logs:

  • tail -n 50 ~/jan/logs/app.log if you are using the UI
  • tail -n 50 ~/jan/logs/server.log if you are using the local api server
    Making sure to redact any private information.

Additional context
Might need to add continue button support when the response reaches the limit.

@louis-jan louis-jan added the type: bug Something isn't working label Mar 6, 2024
@louis-jan louis-jan self-assigned this Mar 6, 2024
@louis-jan louis-jan added the P1: important Important feature / fix label Mar 6, 2024
@louis-jan louis-jan added this to the v0.4.9 milestone Mar 6, 2024
@Van-QA Van-QA modified the milestones: v0.4.9, v0.4.10 Mar 11, 2024
@Van-QA Van-QA modified the milestones: v0.4.10, v0.4.11 Mar 26, 2024
@Van-QA Van-QA modified the milestones: v0.4.11, v0.4.12 Apr 4, 2024
@louis-jan louis-jan removed the P1: important Important feature / fix label Apr 16, 2024
@Van-QA
Copy link
Contributor

Van-QA commented Apr 23, 2024

image
tested for the case of default context length 2048 for the imported model ✅

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
type: bug Something isn't working
Projects
Archived in project
Development

No branches or pull requests

3 participants