[Rule-based Auto-tagging] Add autotagging label resolving logic for multiple attributes #19424

ruai0511 · 2025-09-25T19:50:29Z

Description

In the past, the auto tagging label resolving logic is only suitable for single attribute evaluation. Since we're adding more attributes into the feature now (username, role, index_pattern), we need a more comprehensive logic to find the best suited rule and label.

Feature documentation: https://docs.opensearch.org/latest/tuning-your-cluster/availability-and-recovery/rule-based-autotagging/autotagging/

Main classes & functions introduced:

Entry Point – evaluateLabel

public Optional<String> evaluateLabel(List<AttributeExtractor<String>> attributeExtractors)

Sorts extractors by priority.
Delegates resolution to FeatureValueResolver.resolve(...).
Returns the final label from FeatureValueResolutionResult.resolveLabel().

Central class to evaluate candidate values – FeatureValueResolver

Iterates over each AttributeExtractor.
For each extractor, delegates to FeatureValueCollector.
Maintains a running intersection across extractors (AND logic between attributes).

Extracting values for a single attribute– FeatureValueCollector

Each extractor may have subfields (e.g., "principal.username", "principal.role").
If multiple values are extracted, merges them according to the extractor’s logical operator (OR/AND)

Related Issues

Resolves #[Issue number to be closed when this PR is merged]

Check List

Functionality includes testing.
API changes companion pull request created, if applicable.
Public documentation issue/PR created, if applicable.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.
For more information on following Developer Certificate of Origin and signing off your commits, please check here.

Signed-off-by: Ruirui Zhang <[email protected]>

github-actions · 2025-09-25T20:46:53Z

❌ Gradle check result for a8b2860: FAILURE

Please examine the workflow log, locate, and copy-paste the failure(s) below, then iterate to green. Is the failure a flaky test unrelated to your change?

jainankitk

Still reviewing the PR, few initial comments

modules/autotagging-commons/common/build.gradle

jainankitk · 2025-09-26T01:31:40Z

...commons/common/src/main/java/org/opensearch/rule/attribute_extractor/AttributeExtractor.java

+    enum LogicalOperator {
+        /**
+         * Logical AND
+         */
+        AND,
+        /**
+         * Logical OR
+         */
+        OR
+    }


Are we expecting anything other than AND/OR? If not, might be better to have method return boolean value, say isConjunction()?

I think that will not be ideal since the return value might be little ambiguous in instructions after the method call e,g;

boolean isAnd = isConjuntion(); .... if (!isAnd) // this is ambiguos as this doesn't directly imply OR here vs LogicalOperator.OR

jainankitk · 2025-09-26T01:41:44Z

...commons/src/main/java/org/opensearch/rule/feature_value_resolver/CandidateFeatureValues.java

+     * This helps in tie-breaking: values appearing earlier in the list (i.e., more specific matches)
+     * are considered better matches when resolving the final label.
+     */
+    private final Map<String, Integer> firstOccurrenceIndex = new HashMap<>();


I am assuming this is for optimizing the lookup? Have we considered the latency impact without having this index?

We haven’t measured the latency impact/ run latency tests yet, but expect this should make lookups faster. Without it, we would need to iterate through every element in the list to determine the earliest occurrence, which would be way less efficient.

jainankitk

As discussed offline, for evaluating principal attribute, we should use exact match for username and role instead of prefix. The admins can create role mapping for specific user patterns (which supports regex, not just prefix) instead of working with prefix/patterns as part principal attribute in WLM. Hopefully, that should make FeatureValueResolver logic simpler and easier to follow.

We can always support that prefix based principal values, if there is strong ask for it in future, but unable to see that for now.

ruai0511 requested a review from a team as a code owner September 25, 2025 19:50

add autotagging label resolving logic

a8b2860

Signed-off-by: Ruirui Zhang <[email protected]>

ruai0511 force-pushed the security-label branch from ad13630 to a8b2860 Compare September 25, 2025 20:38

jainankitk reviewed Sep 26, 2025

View reviewed changes

jainankitk requested changes Sep 27, 2025

View reviewed changes

ruai0511 mentioned this pull request Sep 29, 2025

[DOC] Update Rule-based Auto-tagging documentation to include principal attributes opensearch-project/documentation-website#11134

Closed

4 tasks

jainankitk added the v3.3.0 label Sep 29, 2025

ruai0511 closed this Sep 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Rule-based Auto-tagging] Add autotagging label resolving logic for multiple attributes #19424

[Rule-based Auto-tagging] Add autotagging label resolving logic for multiple attributes #19424

Uh oh!

ruai0511 commented Sep 25, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Sep 25, 2025

Uh oh!

jainankitk left a comment

Uh oh!

Uh oh!

jainankitk Sep 26, 2025

Uh oh!

kaushalmahi12 Sep 30, 2025

Uh oh!

jainankitk Sep 26, 2025

Uh oh!

ruai0511 Sep 26, 2025

Uh oh!

jainankitk left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[Rule-based Auto-tagging] Add autotagging label resolving logic for multiple attributes #19424

[Rule-based Auto-tagging] Add autotagging label resolving logic for multiple attributes #19424

Uh oh!

Conversation

ruai0511 commented Sep 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issues

Check List

Uh oh!

github-actions bot commented Sep 25, 2025

Uh oh!

jainankitk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jainankitk Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

kaushalmahi12 Sep 30, 2025

Choose a reason for hiding this comment

Uh oh!

jainankitk Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

ruai0511 Sep 26, 2025

Choose a reason for hiding this comment

Uh oh!

jainankitk left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ruai0511 commented Sep 25, 2025 •

edited

Loading