Replies: 9 comments 6 replies
-
|
First of all i want to congratulate you for the work you put into this project. My use case:
The problem i got was all three write directly to ChromaDB. With the palace on a network mount and concurrent writers, I was getting SQLITE_IOERR and compaction errors regularly. After going down the rabbit and a lot of back-and-forth in claude code(with claude doing the heavy lifting) I came up with palace-daemon: It also adds a /mine endpoint that runs mempalace mine under the lock, so bulk imports can't race with live queries. Clients connect via a zero-dependency stdio proxy (clients/mempalace-mcp.py) that forwards MCP JSON-RPC to the daemon over HTTP — works with Claude Code, Claude Desktop, or any MCP client. Now that i have this working, i'll make some time to see if someone else figured out a more elegant solution for this, but for now i'm quite happy with how it turned out. |
Beta Was this translation helpful? Give feedback.
-
|
I have switched from my original codex-mcp-memory (PostgreSQL with pgvector) to MemPalace since the global memory instructions I had for my original server had to be adapted whenever the underlying mechanism in Claude changed. With the hooks provided by MemPalace this is no longer an issue. I have added a usage tracker hook though to evaluate the token usage with MemPalace. So far it performs really well and faster than my own memory implementation, the question will be whether it's also as token efficient. My main usage is for my own coding projects, aka I scanned the whole directory containing ALL of my repositories to be able to work on all projects without having to care where I start Claude from. What I did notice is that when closing vscode while the mining process still runs leads to a zombie process that needs to be killed manually before being able to use Claude again. Definitely curious to follow this projects development and to contribute :) |
Beta Was this translation helpful? Give feedback.
-
|
i am using it to make running dungeons and dragons campaigns more fun :-) |
Beta Was this translation helpful? Give feedback.
-
|
I’m using MemPalace mainly as a continuity and compression layer for software engineering work. The most useful pattern so far has been:
That has been especially useful when moving work between Claude and Codex. The biggest win is not that the model “remembers everything”; it is that the next session can recover the important shape of the work quickly, then validate against the source files. The strongest result I’ve seen so far is on recurring architecture questions. On one private work repo, I benchmarked repo-only context retrieval versus MemPalace-backed retrieval for questions like:
Very rough token estimates, but the reductions were large:
The important part for me is not raw retrieval speed. Local disk reads are already fast, and MCP transport can add overhead. The real value is prompt-budget recovery: instead of spending most of the model context on raw repo mass, I can spend it on the actual question, constraints, decisions, acceptance criteria, and handoff state. What is working well:
What still needs care:
The most interesting shift for me is that MemPalace is starting to feel less like “search over notes” and more like an engineering continuity substrate. The next thing I’m working toward is using MemPalace underneath a CDD-style workbench, where memory retrieval is only one part of the context bundle. The idea is: MemPalace provides compact reusable memory, then the workbench spends the saved context budget on the current wave, business context, decisions, provenance, and validation criteria. So my short answer is: I’m using MemPalace to make long-running AI-assisted engineering work resumable, cheaper to re-enter, and less dependent on dumping huge chunks of repo context into every new session. |
Beta Was this translation helpful? Give feedback.
-
|
\Hey! I'm building AI agents for diagnostic centres — multiple patients, multiple workflows (report gen, follow-up, billing), long-running sessions. I came across MemPalace's benchmark numbers — 96.6% LongMemEval R@5, 88.9% LoCoMo R@10 (hybrid v5), and 92.9% ConvoMem — and want to understand what those mean in a real product setup before committing. A few honest questions, because I've read the GitHub issues too:
Basically trying to go from benchmark → working pilot at one centre as fast as possible — want to understand what the architecture genuinely delivers before building on it. |
Beta Was this translation helpful? Give feedback.
-
|
jphein/mempalace fork running on a ~160K-drawer palace behind palace-daemon at Three patterns that have paid off:
Compounding pattern: every Stop hook adds drawers; every search retrieves verbatim; the index gets sharper without me curating. Closest sibling: @rboarescu's homelab setup mentioned above — palace-daemon downstreams his work. |
Beta Was this translation helpful? Give feedback.
-
|
Not using MemPalace directly, but built a parallel system for the same problem — sharing what works and what doesn't for cross-pollination. Use case: Personal AI command center (SwarmAI) where one person + AI operates at team scale. Memory is the entire competitive moat. What we learned after 2 months of daily use (300+ sessions): 1. Recall without judgment is noise. 2. Two-tier beats flat.
This is structurally similar to MemPalace's episodic vs semantic split, but we let the AI do the promotion (distillation) rather than relying on embedding similarity. 3. Corrections are the highest-value memory type. 4. Conversation mining has an N+1 problem. What's not working: Pure vector retrieval for "what did we decide about X?" questions. The decision is often 1 sentence buried in a 200-turn conversation. Embedding similarity finds the conversation, not the sentence. We switched to keyword-indexed structured entries for decisions, and vector only for "vibe match" queries. |
Beta Was this translation helpful? Give feedback.
-
|
I use it for a continuous Claude Cowork memory. Comes in very handy with long projects, or projects with massive token usage. Ive had projects that maxed out Claude's tiny 200k token limit in a couple page conversation. I also brought in my projects with other rAIs so I can manage their memories with Claude. Ive had conversations with gemini run into the upper 10ks of mostly words. I also once lost my local files for cowork so all my chats were gone, but the MemPalace entries made restoring most of it pretty easy. |
Beta Was this translation helpful? Give feedback.
-
|
Hello, I'm using it with this project with the stated goal of: I have been fighting with Chromadb issues since day one. Mempalace is a great idea although I've recently discovered it doesn't satisfy my stated goal because it's broken more often than not. Additionally, it's not efficient for my purposes. I'm about to replace it with codebase-memory-mcp for things such as coding and memweave for things such as conversations. I will post a very deep comparison if you or anyone else is interested. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Lets talk about what is working for you and what is not working.
Beta Was this translation helpful? Give feedback.
All reactions