Optimize TI analytic rule deduplication and filters by v-sabiraj · Pull Request #13904 · Azure/Azure-Sentinel

v-sabiraj · 2026-03-25T08:18:22Z

Reorder and simplify the Threat Intelligence analytic query to deduplicate records earlier and apply active/expiration filters afterwards. Added summarize arg_max(TimeGenerated, *) by Id, Url and by Id, ObservableValue to get the latest record before filtering, moved IsActive/ValidUntil/ExpirationDateTime checks to follow deduplication, and removed redundant time-order checks and extra arg_max usages related to EmailUrlInfo. These changes ensure the latest valid indicators are retained and simplify the join logic. Bumped version 1.0.5 -> 1.0.6.

Required items, please complete

Change(s):

See guidance below

Reason for Change(s):

See guidance below

Version Updated:

Required only for Detections/Analytic Rule templates
See guidance below

Testing Completed:

See guidance below

Checked that the validations are passing and have addressed any issues that are present:

See guidance below

Guidance <- remove section before submitting

Before submitting this PR please ensure that you have read the following sections and filled out the changes, reason for change and testing complete sections:

Thank you for your contribution to the Microsoft Sentinel Github repo.

Details of the code changes in your submitted PR. Providing descriptions for pull requests ensures there is context to changes being made and greatly enhances the code review process. Providing associated Issues that this resolves also easily connects the reason.

Change(s):

Updated syntax for XYZ.yaml

Reason for Change(s):

New schema used for XYZ.yaml
Resolves ISSUE Add Logstash required version to Readme file #1234

Version updated:

Yes
Detections/Analytic Rule templates are required to have the version updated

The code should have been tested in a Microsoft Sentinel environment that does not have any custom parsers, functions or tables, so that you validate no incorrect syntax and execution functions properly. If your submission requires a custom parser or function, it must be submitted with the PR.

Testing Completed:

Yes/No/Need Help

Note: If updating a detection, you must update the version field.

Before the submission has been made, please look at running the KQL and Yaml Validation Checks locally.
https://github.com/Azure/Azure-Sentinel#run-kql-validation-locally

Checked that the validations are passing and have addressed any issues that are present:

Yes/No/Need Help

Note: Let us know if you have tried fixing the validation error and need help.

References:

Guidance for Detection checks

General contribution guidance

PR validation troubleshooting

Reorder and simplify the Threat Intelligence analytic query to deduplicate records earlier and apply active/expiration filters afterwards. Added summarize arg_max(TimeGenerated, *) by Id, Url and by Id, ObservableValue to get the latest record before filtering, moved IsActive/ValidUntil/ExpirationDateTime checks to follow deduplication, and removed redundant time-order checks and extra arg_max usages related to EmailUrlInfo. These changes ensure the latest valid indicators are retained and simplify the join logic. Bumped version 1.0.5 -> 1.0.6.

contentautomationbot · 2026-03-25T08:19:10Z

Hello how are you I am GitHub bot
😀😀
I see that you changed templates under the detections/analytic rules folder. Did you remember to update the version of the templates you changed?
If not, and if you want customers to be aware that a new version of this template is available, please update the version property of the template you changed.

Copilot

Pull request overview

This PR refactors the Threat Intelligence analytic rule query to deduplicate TI indicator records earlier (via arg_max) and streamline filtering/join logic, with a corresponding template version bump.

Changes:

Deduplicate URL and domain TI indicators earlier using summarize arg_max(TimeGenerated, *) ....
Reorder/adjust active/expiration filtering and simplify join-related logic with EmailUrlInfo_.
Bump analytic rule template version from 1.0.5 to 1.0.6.

Copilot · 2026-03-25T08:20:19Z

+    | summarize arg_max(TimeGenerated, *) by Id, Url // Get latest record per Id FIRST
+    | where IsActive == true and ValidUntil > now() // THEN filter for active indicators that haven't expired


Reordering to deduplicate before applying IsActive/ValidUntil can drop valid indicators: if the most recent record for a given Id, Url is inactive/expired but an earlier record is still active/valid, the arg_max keeps the inactive one and the subsequent where removes it entirely. To preserve the latest valid indicator, apply the IsActive/ValidUntil filter before arg_max, or compute arg_max over only the valid subset (e.g., filter first, then summarize).

Copilot · 2026-03-25T08:20:19Z

+    | summarize arg_max(TimeGenerated, *) by Id, Url // Get latest record per Id FIRST
+    | where IsActive == true and ValidUntil > now() // THEN filter for active indicators that haven't expired
    | join kind=innerunique (EmailUrlInfo_) on Url // Join with email URL info on URL


After switching to join kind=innerunique without re-selecting the latest EmailUrlInfo_TimeGenerated per indicator/URL, the chosen EmailUrlInfo_ row becomes effectively arbitrary when multiple email-URL records exist for the same Url. This can yield nondeterministic results and may associate older/newer email evidence incorrectly. Consider deduplicating EmailUrlInfo_ (e.g., arg_max(EmailUrlInfo_TimeGenerated, *) by Url) before the join, or restoring a post-join summarize arg_max(EmailUrlInfo_TimeGenerated, *) ... to make the selection deterministic.

Copilot · 2026-03-25T08:20:19Z

+    | summarize arg_max(TimeGenerated, *) by Id, ObservableValue // Get latest record per Id FIRST
+    | where IsActive == true // THEN filter for active indicators
    | extend TrafficLightProtocolLevel = AdditionalFields.TLPLevel
    | extend DomainName = tolower(ObservableValue) // Convert domain names to lowercase


In the domain branch, tolower(ObservableValue) is applied after deduplication/grouping by ObservableValue. If the same domain appears with different casing, it will be treated as distinct groups during summarize, potentially producing duplicates downstream even though DomainName is later normalized. Normalize first (create a lowercased domain column) and then summarize ... by Id, <lowercasedDomain> (or summarize by the normalized key you actually use later).

Suggested change

| summarize arg_max(TimeGenerated, *) by Id, ObservableValue // Get latest record per Id FIRST

| where IsActive == true // THEN filter for active indicators

| extend TrafficLightProtocolLevel = AdditionalFields.TLPLevel

| extend DomainName = tolower(ObservableValue) // Convert domain names to lowercase

| extend DomainName = tolower(ObservableValue) // Normalize domain names to lowercase before summarization

| summarize arg_max(TimeGenerated, *) by Id, DomainName // Get latest record per Id and normalized domain

| where IsActive == true // THEN filter for active indicators

| extend TrafficLightProtocolLevel = AdditionalFields.TLPLevel

rahul0216

Please address discussed pointers.

Reorder and clean up ThreatIntelIndicators handling for URL and domain matches. For URLs, summarize (arg_max) by Id/ObservableValue first, then filter for active/unexpired indicators, and normalize ObservableValue to lowercase before joining with EmailUrlInfo_. For domains, normalize DomainName to lowercase earlier and correct the projected active field name to IsActive. These changes ensure joins use the latest, active indicators and consistent casing for reliable matching.

rahul0216

Looks good.

Release update for Threat Intelligence (NEW): add packaged artifact 3.0.16.zip and bump solution/template versions to 3.0.16. Update analytic rule 6 version to 1.0.6 and refine its KQL query logic to use arg_max(TimeGenerated,*) and improved filtering for URL/domain indicators. Standardize/roll back several resource apiVersion values (various Microsoft.OperationalInsights and Microsoft.SecurityInsights providers) and update resource descriptions and field mappings formatting across templates.

v-sabiraj requested review from a team as code owners March 25, 2026 08:18

v-sabiraj requested a review from Copilot March 25, 2026 08:19

Copilot AI reviewed Mar 25, 2026

View reviewed changes

rahul0216 reviewed Mar 25, 2026

View reviewed changes

v-atulyadav assigned rahul0216 and v-atulyadav Mar 25, 2026

v-atulyadav added Solution Solution specialty review needed Analytic Rules labels Mar 25, 2026

rahul0216 previously approved these changes Mar 25, 2026

View reviewed changes

v-sabiraj dismissed rahul0216’s stale review via a1be0df March 25, 2026 10:52

Update ReleaseNotes.md

31f516f

v-atulyadav approved these changes Mar 25, 2026

View reviewed changes

v-atulyadav merged commit 3a5fea0 into master Mar 25, 2026
36 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize TI analytic rule deduplication and filters#13904

Optimize TI analytic rule deduplication and filters#13904
v-atulyadav merged 4 commits intomasterfrom
v-sabiraj-addinganalyticrulesinstructions

v-sabiraj commented Mar 25, 2026

Uh oh!

contentautomationbot bot commented Mar 25, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 25, 2026

Uh oh!

Copilot AI Mar 25, 2026

Uh oh!

Copilot AI Mar 25, 2026

Uh oh!

Uh oh!

rahul0216 left a comment

Uh oh!

rahul0216 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		\| summarize arg_max(TimeGenerated, *) by Id, Url // Get latest record per Id FIRST
		\| where IsActive == true and ValidUntil > now() // THEN filter for active indicators that haven't expired

Conversation

v-sabiraj commented Mar 25, 2026

Guidance <- remove section before submitting

Before submitting this PR please ensure that you have read the following sections and filled out the changes, reason for change and testing complete sections:

Uh oh!

contentautomationbot bot commented Mar 25, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rahul0216 left a comment

Choose a reason for hiding this comment

Uh oh!

rahul0216 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants