Release: Production-ready improvements - Performance, metadata, and developer experience enhancements by savourylie · Pull Request #23 · memfuse/memfuse-python

savourylie · 2025-10-10T13:02:41Z

🎯 Overview

This release brings significant improvements to MemFuse Python SDK focusing on performance optimizations, new API features, and enhanced developer experience. Includes 2,308 additions across 51 files with major refactorings and new capabilities.

✨ Key Features

Performance & Reliability

Optimized agent creation (Bug/118 none agentname #15): Implemented idempotent agent creation with singleflight pattern to prevent concurrent creation of duplicate agents
Health check optimization (feat: streamline server health checks during session initialization #22): Streamlined server health checks to only run during session initialization instead of on every request
Version compatibility checking (feat: add version field to health endpoint for SDK compatibility chec… #16): Added version field to health endpoint for automatic SDK-server compatibility validation

API Enhancements

Metadata support: Added metadata fields to messages and memory APIs with deprecation warnings for legacy parameters
Debug logging (feat: add debug logging support controlled by MEMFUSE_DEBUG environme… #19): Added MEMFUSE_DEBUG environment variable for detailed API call logging
Improved query schemas (feat: update MemFuse base URL to use port 8765 across examples and tests #20): Updated query response fields and request models for better type safety

Developer Experience

Optional UI dependencies (feat: add optional UI dependencies and update examples to handle miss… #21): Moved Gradio to extras dependency group to reduce core package size
Configuration flexibility: Model references now use environment variables for better compatibility with OpenAI and Gemini APIs
Port standardization (feat: update MemFuse base URL to use port 8765 across examples and tests #20): Updated default MemFuse base URL to port 8765 across all examples and tests
Enhanced version parsing: Support for release candidates and multi-digit patch versions

Infrastructure

OpenAI client migration (refactor: replace openrouter with openai compatible client in MSC acc… #17): Replaced LiteLLM with native OpenAI-compatible client in benchmarks for better reliability
Enhanced benchmarking (Feat/110 retrieval tracking #14): Added concurrent evaluation support with detailed retrieval content tracking and verbose mode

🧪 Testing

Added comprehensive test suites including:
Agent idempotency tests
Metadata type validation tests
Version compatibility tests
Query schema validation tests
Enhanced retrieval metrics testing

📦 Migration Notes

Port change: Default server port is now 8765 (was 8000)
Dependencies: Install UI examples with poetry install -E ui
Deprecated parameters: Legacy parameters now trigger deprecation warnings (still functional)

🔗 Related PRs

feat: streamline server health checks during session initialization #22 - Health check optimization
feat: add optional UI dependencies and update examples to handle miss… #21 - Optional Gradio dependencies
feat: update MemFuse base URL to use port 8765 across examples and tests #20 - Port and query schema updates
feat: add debug logging support controlled by MEMFUSE_DEBUG environme… #19 - Debug logging support
Feat/146 prompts handle new query resp fields #18 - Model references and metadata
refactor: replace openrouter with openai compatible client in MSC acc… #17 - OpenAI client replacement
feat: add version field to health endpoint for SDK compatibility chec… #16 - Version health endpoint
Bug/118 none agentname #15 - Agent creation bug fix
Feat/110 retrieval tracking #14 - Retrieval tracking

- Implement concurrent LLM calls with controlled delays to avoid server overload - Add new command-line arguments for concurrency and delay settings - Refactor run_benchmark_evaluation to use concurrent tasks - Update related scripts to support new concurrency options

…tions - Remove concurrent_delay argument from run_benchmark.py, scripts/load_and_run.py, and scripts/run_evaluation.py - Update _evaluate_single_question and run_benchmark_evaluation functions to remove staggered execution logic - This change simplifies the concurrency control mechanism, using asyncio's built-in concurrency primitives without adding artificial delays

…iciency - Implement concurrent evaluation of questions using asyncio Semaphore - Refactor question evaluation logic into a separate function - Update default LLM provider to OpenAI in run_benchmark.py - Change default LLM provider to Gemini in utils.py - Optimize data loading and evaluation flow

…king improvements - Added concurrent evaluation support from feat/concurrent branch - Extended retrieval metrics to support MSC dataset alongside LME - Enhanced string normalization with regex-based punctuation removal for better matching - Improved argument parsing with provider-specific model defaults - Updated gitignore to include .vscode/ directory - Maintained backward compatibility while adding new features

…luation - Fixed 'list' object has no attribute 'lower' error by using correct function call - Restored calculate_enhanced_retrieval_metrics() instead of simple calculate_retrieval_metrics() - Includes both enhanced and legacy metrics for comparison - Maintains concurrent evaluation while preserving enhanced retrieval tracking features

…e mode - Add --retrieval-verbose flag to capture and display full retrieved memory content - Enhance incorrect questions CSV export with recall flags for better analysis - Support CSV format parsing in question IDs file for compatibility - Improve memory content display with score information and truncation

Feat/110 retrieval tracking

…attern Add singleflight pattern to prevent concurrent agent creation attempts for the same agent name. Implement POST-first flow with server-side idempotent create and fallback GET handling. Add retry logic with exponential backoff for network and server errors. Include thread-safe locking for sync client and proper error handling for conflict scenarios. Also includes: - Add idempotency_key parameter to agent creation API - Fix potential NoneType errors in benchmark retrieval - Update default LLM provider for benchmarks - Add comprehensive tests for agent idempotency

- Add .cursorindexingignore to .gitignore - Exclude .specstory directory from version control

Bug/118 none agentname

…king Add version compatibility utility and update health endpoint to include version information for SDK compatibility checking as requested in issue #116.

feat: add version field to health endpoint for SDK compatibility chec…

…uracy test

refactor: replace openrouter with openai compatible client in MSC acc…

…acy parameters

…-digit patch versions

…t models, and implement deprecation warnings for legacy parameters

…bility with OpenAI and Gemini APIs

…-resp-fields Feat/146 prompts handle new query resp fields

…nt variable

…i-calls feat: add debug logging support controlled by MEMFUSE_DEBUG environme…

feat: update MemFuse base URL to use port 8765 across examples and tests

…ing installation

feat: add optional UI dependencies and update examples to handle miss…

feat: streamline server health checks during session initialization

M1n9X and others added 30 commits August 25, 2025 17:28

Merge dev branch updates

c74f2e7

Merge pull request #14 from memfuse/feat/110-retrieval-tracking

525eb07

Feat/110 retrieval tracking

chore: update .gitignore for specstory integration

7fa090b

- Add .cursorindexingignore to .gitignore - Exclude .specstory directory from version control

Merge pull request #15 from memfuse/bug/118-none-agentname

41bc077

Bug/118 none agentname

feat: add version field to health endpoint for SDK compatibility chec…

fab8cd2

…king Add version compatibility utility and update health endpoint to include version information for SDK compatibility checking as requested in issue #116.

Merge pull request #16 from memfuse/feat/116-add-version-health-endpoint

a7a50d6

feat: add version field to health endpoint for SDK compatibility chec…

refactor: replace openrouter with openai compatible client in MSC acc…

7baa182

…uracy test

Merge pull request #17 from memfuse/feat/120-replace-litellm-with-openai

f7ec416

refactor: replace openrouter with openai compatible client in MSC acc…

fix: update OpenAI model to use environment variable for compatibility

55c1363

feat: add metadata support to UsersApi and AsyncMemory, deprecate leg…

099b3cf

…acy parameters

feat: enhance version parsing to support release candidates and multi…

1ecdcf2

…-digit patch versions

feat: add metadata support to messages and memory APIs, update reques…

5af0bd6

…t models, and implement deprecation warnings for legacy parameters

feat: update model references to use environment variable for compati…

d486e63

…bility with OpenAI and Gemini APIs

Merge pull request #18 from memfuse/feat/146-prompts-handle-new-query…

038db17

…-resp-fields Feat/146 prompts handle new query resp fields

feat: add debug logging support controlled by MEMFUSE_DEBUG environme…

861217b

…nt variable

Merge pull request #19 from memfuse/feat/148-add-debug-logging-for-ap…

2881bc1

…i-calls feat: add debug logging support controlled by MEMFUSE_DEBUG environme…

feat: update MemFuse base URL to use port 8765 across examples and tests

43a0810

Merge pull request #20 from memfuse/fix/change-port-and-query-schema

d6e0b9b

feat: update MemFuse base URL to use port 8765 across examples and tests

feat: add optional UI dependencies and update examples to handle miss…

69c3cc9

…ing installation

Merge pull request #21 from memfuse/feat/move-gradio-dep-to-extras

933a8f3

feat: add optional UI dependencies and update examples to handle miss…

feat: streamline server health checks during session initialization

958354a

Merge pull request #22 from memfuse/fix/health-check-on-every-request

50f376e

feat: streamline server health checks during session initialization

Merge remote-tracking branch 'origin/main' into dev

c13add7

savourylie merged commit f4c0e17 into main Oct 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release: Production-ready improvements - Performance, metadata, and developer experience enhancements#23

Release: Production-ready improvements - Performance, metadata, and developer experience enhancements#23
savourylie merged 30 commits intomainfrom
dev

savourylie commented Oct 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

savourylie commented Oct 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants