-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Returned file contents from non-default branch is hallucinated. #19
Comments
I've taken a look at the file from your repository, and I wanted to clarify that the issue you're encountering is not due to the custom branch but rather the size of the Currently, ChatGPT is powered by the GPT-4 model, which has a context window of 8k tokens. Roughly speaking, each token translates to about 3-4 characters, placing the cap near 32k characters. However, this limit must encompass the entire conversation, including system data such as:
This means that the operational tokens available for the plugin's response and the GPT model's output are significantly fewer than 8k. When the plugin responds with a large reply, as in the case of your file's raw content, ChatGPT recognizes that it won't fit within the context as is and thus compresses the file content. The larger the file, the more aggressive the compression, leading to data loss and the "hallucinations" you've observed. To mitigate this limitation when working with large files, I've recently released improvements to the search feature. You can now pre-search a file to list types/functions defined within it and then inquire about a specific type or function. I've prepared a demo for you to illustrate this: AskTheCode Search Feature Demo. This functionality is quite new, and I must admit, I haven't yet created comprehensive documentation for it. Rest assured, I will be working on this. I encourage you to try out this feature for yourself. Please reach out to me if it works for your needs or if there's a particular workflow that isn't supported through this method. P.S. I'm currently working on the AskTheCode custom GPT, which will greatly increase the file sizes that can be processed at once. |
Repo type: public
Auth type: email
The source printed isn't from the actual file. I believe that GPT shouldn't display source in this instance either but, should instead link to the requested file? A similar response is returned from just asking to discuss rather than display the file. The hallucinated source is also semi-relevant as the requested file does relate to database access.
GPT seems to do okay if the branch is the default.
Here are the plugin logs:
The text was updated successfully, but these errors were encountered: