Two Step Edits #530

biobootloader · 2024-02-21T22:38:34Z

move _stream_model_response out of Conversation and into new file

Pull Request Checklist

Documentation has been updated, or this change doesn't require that

mentat-butler · 2024-03-05T07:17:42Z

mentat/conversation.py

-        )
-
-        return parsed_llm_response
-
    async def get_model_response(self) -> ParsedLLMResponse:


The get_model_response method has been significantly simplified by moving the logic for streaming model responses into the stream_model_response.py file. This is a good refactor as it reduces the complexity of the Conversation class and adheres to the single responsibility principle. However, it's important to ensure that all functionalities related to model response handling are thoroughly tested in their new location to prevent any regressions.

mentat-butler · 2024-03-05T07:17:44Z

mentat/stream_model_response.py

+            stream.send(line)
+
+
+async def stream_model_response_two_step(


The stream_model_response_two_step function contains several TODO comments that suggest incomplete implementation, such as handling interrupts, tracking costs of all calls, and logging API calls. It's crucial to address these TODOs before merging this feature into the main branch to ensure the two-step edit process is fully functional and robust.

mentat-butler · 2024-03-05T07:17:45Z

mentat/stream_model_response.py

+
+    list_files_response = await llm_api_handler.call_llm_api(
+        list_files_messages,
+        model="gpt-3.5-turbo-0125",  # TODO add config for secondary model


Using a hardcoded model (gpt-3.5-turbo-0125) for the two-step edit process might limit the flexibility of this feature. Consider adding a configuration option for the secondary model used in the two-step edit process. This would allow users to choose the model that best fits their needs and potentially improve the accuracy and efficiency of the edits.

mentat-butler · 2024-03-05T07:17:49Z

MENTAT CODE REVIEW IN ACTIVE DEVELOPMENT. Only in use on mentat and internal repos.
Please Reply with feedback.

This pull request introduces a significant refactor and a new feature for handling two-step edits with the LLM. The changes overall are well-structured, and the separation of concerns between files is improved. However, there are several areas, particularly in the new stream_model_response_two_step function, that require attention to ensure the feature is complete and robust. Addressing the hardcoded model usage, ensuring TODOs are resolved, and handling potential assertion failures gracefully are key areas to focus on before finalizing this feature.

mentat-butler · 2024-03-06T16:26:16Z

mentat/stream_model_response.py

+
+    list_files_response = await llm_api_handler.call_llm_api(
+        list_files_messages,
+        model="gpt-3.5-turbo-0125",  # TODO add config for secondary model


Consider implementing the interrupt functionality for the two-step edit process to enhance user experience and control. This would allow users to stop the model's response if it's taking too long or if they notice an issue early on.

mentat-butler · 2024-03-06T16:26:17Z

mentat/stream_model_response.py

+
+        rewrite_file_response = await llm_api_handler.call_llm_api(
+            rewrite_file_messages,
+            model="gpt-3.5-turbo-0125",  # TODO add config for secondary model


It's important to handle potential assertion failures gracefully in production code. Consider adding error handling for cases where the expected rewrite_file_response format does not match the assumptions (e.g., missing ``` markers). This could include logging a warning and skipping the file or providing a clear error message to the user.

mentat-butler · 2024-03-06T16:26:19Z

mentat/stream_model_response.py

+
+    list_files_response = await llm_api_handler.call_llm_api(
+        list_files_messages,
+        model="gpt-3.5-turbo-0125",  # TODO add config for secondary model


To ensure the feature's robustness, consider adding tests specifically for the two-step edit process. These tests should cover various scenarios, including successful edits, user interruptions, and handling of unexpected model responses.

mentat-butler · 2024-03-06T16:26:21Z

MENTAT CODE REVIEW IN ACTIVE DEVELOPMENT. Only in use on mentat and internal repos.
Please Reply with feedback.

The introduction of the two-step edit process is a promising addition, enhancing the flexibility and capability of the system. However, attention to detail in handling edge cases, user control, and testing will be crucial to ensure its success and reliability. Addressing the mentioned concerns and implementing the suggested improvements will significantly contribute to the robustness and user satisfaction of this feature.

biobootloader added 11 commits February 21, 2024 14:37

move stream_model_response from Conversation to new file

421efeb

add two-step option to config, two step prompt

8806dc0

two step stream_and_parse, initial version

799911a

extract edited file paths

febe979

improve prompt

31b9d89

prompt rewrite

ed97498

add rewrite file prompt

76c1be0

update prompt: use full file path

f34c992

rewrite file

4b02a0f

print colored diff

a6febf7

add rewritten files to output

b42f9f0

mentat-butler bot reviewed Mar 5, 2024

View reviewed changes

biobootloader added 6 commits March 5, 2024 18:05

changes applied!

e2407ce

ruff

9ad8598

fix tests

a90bdd6

fix

adcb7e9

Merge branch 'main' into two-step-edits

b816358

fix

33d22db

mentat-butler bot reviewed Mar 6, 2024

View reviewed changes

biobootloader mentioned this pull request Mar 29, 2024

Optimization of non-GPT 4 major model outputs #548

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Two Step Edits #530

Two Step Edits #530

biobootloader commented Feb 21, 2024

mentat-butler bot Mar 5, 2024

mentat-butler bot Mar 5, 2024

mentat-butler bot Mar 5, 2024

mentat-butler bot commented Mar 5, 2024

mentat-butler bot Mar 6, 2024

mentat-butler bot Mar 6, 2024

mentat-butler bot Mar 6, 2024

mentat-butler bot commented Mar 6, 2024

Two Step Edits #530

Are you sure you want to change the base?

Two Step Edits #530

Conversation

biobootloader commented Feb 21, 2024

Pull Request Checklist

mentat-butler bot Mar 5, 2024

Choose a reason for hiding this comment

mentat-butler bot Mar 5, 2024

Choose a reason for hiding this comment

mentat-butler bot Mar 5, 2024

Choose a reason for hiding this comment

mentat-butler bot commented Mar 5, 2024

mentat-butler bot Mar 6, 2024

Choose a reason for hiding this comment

mentat-butler bot Mar 6, 2024

Choose a reason for hiding this comment

mentat-butler bot Mar 6, 2024

Choose a reason for hiding this comment

mentat-butler bot commented Mar 6, 2024