Update memories.py by john-gitdev · Pull Request #4864 · BasedHardware/omi

john-gitdev · 2026-02-18T07:07:59Z

delete associated transcripts when a summary (conversation) is deleted

delete associated transcripts when a summary is deleted

gemini-code-assist

Code Review

This pull request introduces functionality to automatically delete associated conversation transcripts and their corresponding vector embeddings when a memory is deleted. This is a good step towards maintaining data consistency and preventing orphaned data in the system.

add try and throw error Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

update to delete audio blob of the deleted transcript/summary

updated error log to printf to match conversations.py

update to delete audio for single transcript and summary

update to remove audio files related to deleted conversations and summaries

john-gitdev · 2026-02-18T16:45:31Z

updated storage and conversations to delete the associated audio file as well

john-gitdev · 2026-02-18T16:49:12Z

/gemini review

gemini-code-assist

Code Review

This pull request enhances the deletion logic for memories and conversations. When a memory is deleted, it now correctly triggers the deletion of the associated conversation, its vector embeddings, and all related audio files. Similarly, deleting a conversation now also cleans up its associated audio files. The implementation uses try-except blocks to ensure that failures in deleting associated files do not cause the main deletion operation to fail, which is a good resilient pattern. My feedback focuses on improving logging by using the standard logging module instead of print for better error tracking and consistency.

gemini-code-assist · 2026-02-18T16:50:32Z

+        delete_conversation_audio_files(uid, conversation_id)
+        delete_conversation_recording(uid, conversation_id)
+    except Exception as e:
+        print(f"Failed to delete audio files for conversation {conversation_id}: {e}")


For better observability and consistency with other parts of the application (like routers/memories.py), please use the logging module instead of print for error messages. This will ensure errors are properly captured by your logging infrastructure. You'll need to add import logging at the top of the file.

Suggested change

print(f"Failed to delete audio files for conversation {conversation_id}: {e}")

logging.error(f"Failed to delete audio files for conversation {conversation_id}: {e}")

gemini-code-assist · 2026-02-18T16:50:32Z

+            delete_conversation_recording(uid, conversation_id)     # memories_recordings_bucket
+        except Exception as e:
+            # Log the error, but don't block the memory deletion as it's already done
+            print(f"Failed to delete conversation {conversation_id} or its vector for memory {memory_id}: {e}")


This file already imports and uses the logging module. For consistency and to ensure errors are properly tracked, please use logger.error or logger.warning instead of print.

Suggested change

print(f"Failed to delete conversation {conversation_id} or its vector for memory {memory_id}: {e}")

logger.error(f"Failed to delete conversation {conversation_id} or its vector for memory {memory_id}: {e}")

john-gitdev · 2026-02-18T16:55:00Z

@aaravgarg @mdmohsin7 thoughts? up to you on how to handle error logging

beastoin · 2026-02-19T09:14:59Z

Hey 👋 — thanks for putting this together! Before we can review, could you share a quick live demo (screenshot, screen recording, or terminal output) showing this working on your local or dev environment?

In the AI era, writing code is the easy part — what really makes a PR stand out is proof that it works end-to-end. A short video or even a screenshot goes a long way in helping reviewers feel confident about merging.

Feel free to update this PR whenever you have something to show. Thanks! 🙏

john-gitdev · 2026-02-19T19:53:28Z

https://www.dropbox.com/scl/fi/2hzcfwh0u8f83gur9bsax/screen-20260219-114550-1771530281340.mp4?rlkey=ah25vqy6zmh3ifzk14qpscjbb&dl=0

Here you can see a transcript with mention of 'mega potion'

I ask omi if i mentioned 'mega potion' and it says yes

I deleted the summary, and then omi says no I didnt mention mega potion

I can't verify backend audio was deleted, if you want me to remove that code snippet so that just the transcript gets removed, i can do that, you just have orphaned audio data on your backend

@beastoin

mdmohsin7 · 2026-02-21T14:11:12Z

@john-gitdev don't you think we should let the user decide whether to also delete the conversation (while deleting the memory) or not? Some might just want to delete the memory and not the conversation?

john-gitdev · 2026-02-21T16:11:58Z

@john-gitdev don't you think we should let the user decide whether to also delete the conversation (while deleting the memory) or not? Some might just want to delete the memory and not the conversation?

for me personally, if I delete a conversation, I don't want 'ask omi' to reference it at all anymore. however, I can see your point. I can either ask discord users for their preference or I could add a toggle in the developer settings (under experimental) to determine if the deletion of the transcript would occur or not upon deletion of the conversation

github-actions · 2026-02-23T04:54:17Z

Hey @john-gitdev 👋

Thank you so much for taking the time to contribute to Omi! We truly appreciate you putting in the effort to submit this pull request.

After careful review, we've decided not to merge this particular PR. Please don't take this personally — we genuinely try to merge as many contributions as possible, but sometimes we have to make tough calls based on:

Project standards — Ensuring consistency across the codebase
User needs — Making sure changes align with what our users need
Code best practices — Maintaining code quality and maintainability
Project direction — Keeping aligned with our roadmap and vision

Your contribution is still valuable to us, and we'd love to see you contribute again in the future! If you'd like feedback on how to improve this PR or want to discuss alternative approaches, please don't hesitate to reach out.

Thank you for being part of the Omi community! 💜

beastoin · 2026-02-24T01:53:43Z

We see you provided a video demo, engaged constructively with the UX feedback, and proposed multiple solutions. The discussion stalled on our side — that's on us. If you'd like to revisit this, we're happy to continue the conversation.

beastoin · 2026-02-24T02:29:20Z

Hey @john-gitdev — thanks for your patience, and sorry again that the discussion stalled on our end while you were actively engaging.

We've done a deeper code review of your diff and wanted to share feedback so you have a clear path forward if you'd like to resubmit:

What's good:

The conversation-level audio cleanup (delete_conversation_audio_files) is solid — cleaning up orphaned audio blobs is a real need.
Your demo video clearly showed the feature working end-to-end. The "mega potion" test was a smart way to prove the delete cascaded to the LLM context.

Items that need attention:

1. Over-deletion risk (high)
A single conversation can generate multiple memories (the backend creates separate memory entries from one transcript). Your current implementation deletes the entire conversation when any one of its memories is deleted. This means deleting Memory A could also wipe the transcript context for Memory B, C, etc. — which the user didn't intend.

@mdmohsin7 raised this exact concern, and after reviewing the code we think he was right: the user should have a choice, or at minimum the cascade should only happen when the last memory linked to that conversation is deleted.

2. API consistency (medium)
The cascade behavior would exist in the v3 memories route, but other memory-delete paths (like the MCP route) still delete only the memory. This inconsistency could confuse API consumers.

3. Error handling (low)
Gemini's earlier suggestion about using logging instead of print for error output is worth adopting for production consistency.

Suggested path forward:

Scope the cascade: only delete the conversation when the user explicitly deletes the conversation itself, or when the last linked memory is removed
Keep the audio cleanup logic — it's good and addresses real storage orphaning
Consider the toggle approach you proposed to @mdmohsin7

Your demo was honest — you even noted "I can't verify backend audio was deleted" which was transparent and appreciated. The core idea is sound; it just needs tighter scoping around user intent.

Happy to discuss further if you'd like to pick this back up.

beastoin · 2026-02-24T02:38:14Z

@john-gitdev Some pointers on the items mentioned above:

Over-deletion: The core issue is that one conversation can have multiple memories. Before deleting the conversation, you'd want to check if any other memories still reference the same conversation_id. Look at how memories are stored in backend/database/memories.py — a query filtering by conversation_id excluding the memory being deleted will tell you if it's safe to cascade.

API consistency: Right now the cascade would only exist in the v3 memories route. The MCP route (backend/routers/mcp.py) also has a memory delete endpoint that doesn't cascade. Either add the same logic there, or consider keeping cascade behavior only on conversation delete (not memory delete) — which is cleaner semantically.

User choice (mdmohsin7's point): An optional query parameter on the delete endpoint (defaulting to no cascade) would let the caller decide. That way the default behavior is safe and explicit.

Regression test idea: Two memories sharing one conversation — delete one memory, verify conversation still exists. Delete the second, verify conversation is cleaned up.

Happy to discuss the approach if you'd like to pick this back up.

john-gitdev · 2026-02-24T05:58:34Z

thanks @beastoin

Actually, I was wrong and my video was not valid. This was a bug I was tracking since last December.

However, the behavior was already fixed this commit: 6569495

The bug existed specifically in the version using search_conversations() from utils/conversations/search.py.
That function queried Typesense directly and returned results straight to the LLM without a Firestore lookup:

When a conversation was deleted, it was removed from Firestore but never removed from Typesense, so it remained in the search index and the LLM could still reference it.

The current code reverted to Pinecone-based search, which does a Firestore fetch after the vector lookup. Since deleted conversations are gone from Firestore, they get silently dropped before reaching the LLM.

So my video evidence was flawed -- the behavior is already fixed as of the current version.

However, the backend bug still exists and Typesense will continue to store orphan, stale data until it is fixed. Sorry for the inconvenience. I can redo the branch and factor in mohsin's and your comments if you want, but I am not running my own backend so my evidence is not valid. I'm still learning, thanks for being patient with me

@beastoin @mdmohsin7

Update memories.py

3489e08

delete associated transcripts when a summary is deleted

gemini-code-assist Bot reviewed Feb 18, 2026

View reviewed changes

Comment thread backend/routers/memories.py Outdated

Update backend/routers/memories.py

39c7b1e

add try and throw error Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

john-gitdev marked this pull request as draft February 18, 2026 07:49

john-gitdev added 4 commits February 18, 2026 00:26

Update memories.py

42e8f44

update to delete audio blob of the deleted transcript/summary

Update memories.py

4717ead

updated error log to printf to match conversations.py

Update storage.py

d12d06b

update to delete audio for single transcript and summary

Update conversations.py

ebdfcb5

update to remove audio files related to deleted conversations and summaries

john-gitdev marked this pull request as ready for review February 18, 2026 16:45

gemini-code-assist Bot reviewed Feb 18, 2026

View reviewed changes

john-gitdev mentioned this pull request Feb 18, 2026

Deleting summaries does not delete the underlying transcript or audio file #4868

Closed

Merge branch 'BasedHardware:main' into delete-memories-fix

c5ac316

john-gitdev closed this Feb 23, 2026

john-gitdev deleted the delete-memories-fix branch February 23, 2026 08:22

	print(f"Failed to delete audio files for conversation {conversation_id}: {e}")
	logging.error(f"Failed to delete audio files for conversation {conversation_id}: {e}")

	print(f"Failed to delete conversation {conversation_id} or its vector for memory {memory_id}: {e}")
	logger.error(f"Failed to delete conversation {conversation_id} or its vector for memory {memory_id}: {e}")

Conversation

john-gitdev commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

john-gitdev commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

john-gitdev commented Feb 18, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

john-gitdev commented Feb 18, 2026

Uh oh!

beastoin commented Feb 19, 2026

Uh oh!

john-gitdev commented Feb 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mdmohsin7 commented Feb 21, 2026

Uh oh!

john-gitdev commented Feb 21, 2026

Uh oh!

github-actions Bot commented Feb 23, 2026

Uh oh!

beastoin commented Feb 24, 2026

Uh oh!

beastoin commented Feb 24, 2026

Uh oh!

beastoin commented Feb 24, 2026

Uh oh!

john-gitdev commented Feb 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

john-gitdev commented Feb 18, 2026 •

edited

Loading

john-gitdev commented Feb 18, 2026 •

edited

Loading

john-gitdev commented Feb 19, 2026 •

edited

Loading