feat: Adds the atlas-list-performance-advisor base tool #528

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

kylelai1 wants to merge 6 commits into atlas-list-performance-advisor-tool from atlas-list-performance-advisor-base-tool

+913 −0

Collaborator

kylelai1 commented Sep 8, 2025 •

edited

Loading

Proposed changes

This PR adds the atlas-list-performance-advisor tool to the MCP server, which retrieves the following performance advisor recommendations from the admin API: index suggestions, drop index suggestions, schema suggestions, slow query logs.

This PR merges the changes into the atlas-list-performance-advisor-tool branch.

Testing

Manually tested that the MCP server is able to retrieve performance advisor suggestions.

Checklist

I have signed the MongoDB CLA

kylelai1 added 6 commits

September 4, 2025 11:18


          Add base atlas performance advisor MCP server tool

bd69450


          Cleanup comments

f331bac


          Merge main

78f9888


          Fix API return type

2f023eb


          Clean up getting slow query logs from atlas admin api

a4cbc8b


          Fix types for performance advisor api response

2ef0ded

kylelai1 marked this pull request as ready for review

September 8, 2025 20:24

kylelai1 requested a review from a team as a code owner

September 8, 2025 20:24

kylelai1 requested a review from blva

September 8, 2025 20:24

kylelai1 changed the title ~~Adds the atlas-list-performance-advisor base tool~~ feat: Adds the atlas-list-performance-advisor base tool

nirinchev reviewed

View reviewed changes

Collaborator

nirinchev left a comment

Did a quick pass - overall, looks reasonable, but I'm worried that it might not be too LLM-friendly. I suggest testing it thoroughly with different agents/models and confirming it's outputting meaningful insights.

src/tools/atlas/read/listPerformanceAdvisor.ts

+                          .array(z.nativeEnum(PerformanceAdvisorOperation))
+                          .describe("Operations to list performance advisor recommendations"),
+                      since: z.number().describe("Date to list slow query logs since").optional(),
+                      processId: z.string().describe("Process ID to list slow query logs").optional(),

Collaborator

nirinchev Sep 10, 2025

Is this something we expect the LLM to know how to get? As far as I can tell, you get it by calling atlas processes list but we don't have any tools that mirror that behavior in the MCP server.

src/tools/atlas/read/listPerformanceAdvisor.ts

+                      operations: z
+                          .array(z.nativeEnum(PerformanceAdvisorOperation))
+                          .describe("Operations to list performance advisor recommendations"),
+                      since: z.number().describe("Date to list slow query logs since").optional(),

Collaborator

nirinchev Sep 10, 2025

Should this be z.date instead? Does the LLM do a good job at converting dates to unix epoch?

src/tools/atlas/read/listPerformanceAdvisor.ts

+                      // If operations is empty, get all performance advisor recommendations
+                      // Otherwise, get only the specified operations
+                      const operationsToExecute = operations.length === 0 ? Object.values(PerformanceAdvisorOperation) : operations;

Collaborator

nirinchev Sep 10, 2025

Should we mark operations as optional and provide a default instead? Right now there's nothing to hint to the LLM it could provide an empty array here.

src/tools/atlas/read/listPerformanceAdvisor.ts

+                      try {
+                          if (operationsToExecute.includes(PerformanceAdvisorOperation.SUGGESTED_INDEXES)) {
+                              const { suggestedIndexes } = await getSuggestedIndexes(this.session.apiClient, projectId, clusterName);

Collaborator

nirinchev Sep 10, 2025

This is probably not super critical, but right now, all of these async operations are evaluated sequentially, which means that we need to wait for one to finish before starting the next one. Instead, it would be a good idea to run them in parallel.

src/tools/atlas/read/listPerformanceAdvisor.ts

Comment on lines +90 to +92

+                      return {
+                          content: [{ type: "text", text: JSON.stringify(data, null, 2) }],
+                      };

Collaborator

nirinchev Sep 10, 2025

We should wrap the response in formatUntrustedData to avoid injection attacks where someone creates a slow query that contains llm instructions. Also, it might be helpful to give hints to the llm what the different fields in the json data represent and how those can be used.

src/common/atlas/performanceAdvisorUtils.ts

+              interface DropIndexSuggestion {
+                  accessCount?: number;
+                  index?: Array<{ [key: string]: 1 | -1 }>;

Collaborator

nirinchev Sep 10, 2025

Is this type definition true? Would PA not suggest dropping geo or text indexes?

Collaborator Author

kylelai1 Sep 10, 2025

Let me look into this. I used the Open API definitions for the Atlas Admin API, but it would be weird to not include text indexes to drop suggestions for example (same for index creation suggestions!)

src/common/atlas/performanceAdvisorUtils.ts

Comment on lines +55 to +71

+              type SchemaTriggerType =
+                  | "PERCENT_QUERIES_USE_LOOKUP"
+                  | "NUMBER_OF_QUERIES_USE_LOOKUP"
+                  | "DOCS_CONTAIN_UNBOUNDED_ARRAY"
+                  | "NUMBER_OF_NAMESPACES"
+                  | "DOC_SIZE_TOO_LARGE"
+                  | "NUM_INDEXES"
+                  | "QUERIES_CONTAIN_CASE_INSENSITIVE_REGEX";
+              type SchemaRecommedationType =
+                  | "REDUCE_LOOKUP_OPS"
+                  | "AVOID_UNBOUNDED_ARRAY"
+                  | "REDUCE_DOCUMENT_SIZE"
+                  | "REMOVE_UNNECESSARY_INDEXES"
+                  | "REDUCE_NUMBER_OF_NAMESPACES"
+                  | "OPTIMIZE_CASE_INSENSITIVE_REGEX_QUERIES"
+                  | "OPTIMIZE_TEXT_QUERIES";

Collaborator

nirinchev Sep 10, 2025

Do we need to translate these to something the LLM would have an easier time interpreting?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet