This is Python SDK for enabling Maxim observability. Maxim is an enterprise grade evaluation and observability platform.
pip install maxim-py
You can find detailed documentation and available integrations here.
See cookbook/agno_agent.py for an example of tracing an Agno agent.
Anthropic | Bedrock Anthropic | Bedrock Meta | OpenAI | Azure | |
---|---|---|---|---|---|
Chat (0.3.x) | β | β | β | β | β |
Chat (0.1.x) | β | β | β | β | β |
Tool call (0.3.x) | β | β | β | β | β |
Tool call (0.1.x) | β | β | β | β | β |
Chain (via LLM) (0.3.x) | β | β | β | β | β |
Chain (via LLM) (0.1.x) | β | β | β | β | β |
Streaming (0.3.x) | β | β | β | β | β³οΈ Token usage is not supported by Langchain |
Streaming (0.1.x) Token usage is not supported by Langchain | β³οΈ | β³οΈ | β³οΈ | β³οΈ | β³οΈ |
Agent (0.3.x) | βοΈ | βοΈ | βοΈ | βοΈ | βοΈ |
Agent (0.1.x) | βοΈ | βοΈ | βοΈ | βοΈ | βοΈ |
Please reach out to us if you need support for any other package + provider + classes.
completion | acompletion | fallback | Prompt Management |
---|---|---|---|
β | β | βοΈ | βοΈ |
Provider | Audio | ToolCalls | Video |
---|---|---|---|
OpenAI (RealtimeAPI) | β | βοΈ | βοΈ |
Gemini (RealtimeAPI) | β | βοΈ | βοΈ |
- feat: Adds one line integration for tracing Agno agents.
- improvement: Adds support for uploading large payloads as part of logs.
- improvement: Increased connection pool max size to 20 for more connections at high throughput.
- improvement: Moves network stack from requests to httpx for better stability
- feat: Adds new query type for querying prompts, prompt chains and folders called mutli-select. Learn more.
- chore: Changed scribe (Maxim Logger) default level to warning.
- improvement: Adds try except for all LiveKit callbacks call and gracefully moves forward with tracing.
- improvement: Improves the network connection error handling for connection pool.
- fix: Fixes
enable_prompt_management
method bug.
- fix: Some minor fixes in Maxim log repo checks, Anthropic client and Gemini client
- feat: Improved memory management and better interrupt detection for LiveKit + Gemini.
- feat: LiveKit + Tool call support is live.
- feat: LiveKit support is now in beta (up from Alpha).
- chore: Adds session_id and room_id
- fix: Fixes gemini + langchain integration to capture None finish_reason and usage_metadata
- fix: Fixes chunk and chat auto-tracing for gemini client
- feat: Adds one line integration for Portkey AI
- fix: Fixes tool call parsing for OpenAI one line integration.
- feat: Adds auto attachment parsing for vision models
- fix: Exposes YieldedOutputTokens and YieldedOutputCost classes
- feat: Adds files support for test_runs
- feat: Adds mistral tracing support
- breaking change: We have renamed a few entities used in test runs.
- chore: Updates default log level of scribe to debug
- feat: LiveKit support for Google and OpenAI
- chore: improvements in crewai logging integration
- chore: deprecated span.output() method removed
- feat: LiveKit one line integration (alpha)
- fix: Signal registration only happens if the current thread is main thread
- feat: Prompt, PromptVersionConfig, and RunnablePrompt now expose a
provider
field to indicate the LLM provider (e.g., 'openai', 'anthropic'). - improvement: Maxim SDK listens for
atexit
and termination signals, and triggers cleanup automatically
- chore: crewai python 3.9+ support
- fix: now reports metadata errors silently
- fix: minor fixes to crew-ai instrumentation
- fix: minor version check fixes.
- feat: crew-ai intercept
- fix: trace tag fixes for langchain
- feat: Adds attachment support (beta): Read more
- fix: Fixes anthropic one line integration
- chore: Fixes anthropic messages parsing for streaming client
- chore: Added extra guards for lang-graph evaluation config.
- chore: Adds new Maxim SDK level logger. You can set specific level for Maxim SDK by
logging.getLogger('maxim').setLevel(logging.DEBUG)
- fix: minor cleanups on Langchain tracer.
- feat: adds special handling for apps running on AWS lambda. As runtime execution is unpredictable, logger.flush() now pushes logs immediately vs submitting to a thread pool worker.
- chore: all logs emitted form the SDK now has [MaximSDK] prefix.
- fix: adds special handling when langchain streaming response handler raises an exception in user-land.
- fix: fixes empty cache issue for prompt management
- chore: adds max limit to the commit log queue size
- chore: auto flush when the writer level max in memory message size reaches
- chore: adds global container list for langchain tracer to use it across multiple instances
- chore: adds custom trace-id support for MaximOpenAIClient
- feat: adds error component
- deprecate: old Config classes - now all logging constructs support TypedDict
- feat: adds one line integration for OpenAI
- fix: fixes agents import in OpenAI SDK
- fix: fixes langchain callback tracer
- chore: adds support to wrap LiteLLMProxy tracer for docker deployments
- feat: adds new MaximLiteLLMProxyTracer file for LitellmProxy logger
- feat: adds new MaximLiteLLMProxyTracer file for LitellmProxy logger
- chore: adds ID validation on client side.
- feat: adds support for bedrock client
- chore: Some minor bugfixes for langhchain handler (its them not us)
- fix: handles some network level exceptions during connection resets.
- feat: OpenAI agents tracing out of beta. And we are also on OpenAI docs -
- feat: adds support for running test runs using prompt chains
- fix: fixes testruns using dataset, and runs using local workflows.
- fix: fixes incorrect import for TypedDict
- feat: openai-agents adds session support, adds error support for llm calls
- feat: OpenAI agents tracing support (beta)
- feat: Generation messages adds support for dicts
- fix: Handles optional PromptResponse fields gracefully.
- feat: Adds support for prompt and prompt chain run
- fix: Resolved duplicate serialization of metadata entries
- Breaking change: Prompt and Prompt chain object properties are now with snake cases
- fix: Prompt chain nodes are properly parsed in all cases
- fix: fixes litellm pre_api_call message parsing
- fix: updates create test run api to use v2 api
- fix: handles marking test run as failed if the test run raises error at any point after creating it on the platform.
- feat: adds support for
context_to_evaluate
inwith_prompt_version_id
andwith_workflow_id
(by passing it as the second parameter) to be able to choose whichever variable or dataset column to use as context to evaluate, as opposed to only having the dataset column as context through theCONTEXT_TO_EVALUATE
datastructure mapping.
- fix: fixes garbled message formatting when invalid testrun config is passed to the TestRunBuilder
- chore: now sdk propagates system errors in formatted structures (specifically for test runs)
- fix: add missing deps in the requirement
- chore: minor bug fixes
- feat: adds support for gemini outputs
- feat: adds local evaluator support for test runs
- chore: Litellm failure exceptions will be sent to the default logger.
- feat: Adds litellm support (Beta)
- fix: Fixes duplicate container ids for langchain tracer
- fix: Langgraph capture fixes
- chore: Adds missing docstrings
- fix: Adds support for dict as an output to yields_output function during test runs.
- fix: Fixed dependency issues
- feat: Adds new flow to trigger test runs via Python SDK
- fix: Minor bug fixes
v3.0.1 Breaking changes
- beta release
- feat: New decorators support for tracing, langchain and langgraph
- feat: Adds new decorator for langgraph. @langgraph_agent
- feat: Adds support for chains in langchain tracer
- fix: Some minor bug fixes
- chore: Keeps logger till function call context is present
- fix: Fixes automatic retrieval capture from vector dbs
- fix: Fixes langchain_llm_call to handle chat models
- fix: Minor bug fixes
- Check upgrade steps
- feat: Adds new decorators flow to simplify tracing
- chore: apiKey and baseUrl parameters in MaximConfig are now api_key and base_url respectively.
- feat: Jinja 2.0 variables support
- fix: Fixes issue where model was None for some prompt versions.
- fix: Fixes edge case of race condition while fetching prompts, prompt chains and folders.
- fix: Fixes import of dataclasse
- feat: Adds new config called
raise_exceptions
. Unless this is set toTrue
, the SDK will not raise any exceptions.
- Chore - Removes raising alert when repo not found
- fix - Removes a no-op command for retrieval
- fix - Fixes retrieval output command
- feat - Supports 0.1.x langchain
- chore - Improved langchain support
- chore - Improves cleanups for log writer for quick returns.
- chore - Improved fs access checks.
- chore - Fixes threading locks for periodic syncs in Python3.9
- chore - Adds lambda env support for SDK with no access to filesystem.
- feat - Adds support to new langchain_openai.AzureChatOpenAI class in langchain tracer
- fix - Adds Python 3.9 compatibility
- chore - Updates connection pool to use session that enforces re-connects before making API calls.
- chore - Adds backoff retries to failed REST calls.
- chore - langchain becomes optional dependency
- fix - connection pooling for network calls.
- fix - connection close issue.
- fix - connection close issue
- Adds validation for provider in generation
- Now generation.result accepts
- OpenAI chat completion object
- Azure OpenAI chat completion object
- Langchain LLMResult, AIMessage object
- fix: Fixes message_parser
- fix: Fixes utility function for langchain to parse AIMessage into Maxim logger completion result
- feat: Adds tool call parsing support for Langchain tracer
- feat: Adds support for ChatCompletion in generations
- feat: Adds type safety for retrieval results
- fix: Bug fix where input sent with trace.config was getting overridden with None
- chore: Adds
trace.set_input
andtrace.set_output
methods to control what to show in logs dashboard
- chore: Removes one no_op command while creating spans
- fix: Minor bug fixes
- fix: Fixed MaximLangchainTracer error logging flow.
- feat: Adds langchain support
- chore: Adds local parsers to validate payloads on client side
- fix: Minor bug fixes around log writer cleanup
- Public release