[BUG]Span query in PPL is slower than date histogram aggregation in query DSL #3528

gaobinlong · 2025-04-09T06:05:17Z

What is the bug?

In Discover page of OSD, when executing a PPL query against an index pattern containing very huge data(about 750B documents), the query latency of PPL is much more than the latency of the similar query in DQL, which is one hundred seconds vs. seconds.

The PPL is: source = index* | where @timestamp>= '2025-03-25 03:31:32' and@timestamp<= '2025-04-09 03:31:32' | stats count() by span(@timestamp, 12h),

and the similar query DSL is:

{
  "query": {
    "bool": {
      "must": [],
      "filter": [
       
        {
          "range": {
            "@timestamp": {
              "gte": "2025-03-25T03:31:32.935Z",
              "lte": "2025-04-09T03:31:32.935Z",
              "format": "strict_date_optional_time"
            }
          }
        }
      ]
    }
  },
  "size":500,
  "aggs": {
    "2": {
      "date_histogram": {
        "field": "@timestamp",
        "fixed_interval": "12h",
        "time_zone": "+00:00",
        "min_doc_count": 1
      }
    }
  }
}

.

I see PPL will convert the span query to a composite aggregation, like this:

"aggregations": {
    "composite_buckets": {
      "composite": {
        "size": 1000,
        "sources": [
          {
            "span(@timestamp,12h)": {
              "date_histogram": {
                "field": "@timestamp",
                "missing_bucket": true,
                "missing_order": "first",
                "order": "asc",
                "fixed_interval": "12h"
              }
            }
          }
        ]
      },
      "aggregations": {
        "count()": {
          "value_count": {
            "field": "_index"
          }
        }
      }
    }
  }

, seems composite aggregation is slower than the date histogram aggregation.

How can one reproduce the bug?
Steps to reproduce the behavior:

Find a big dataset which contains billions of documents
Execute both the PPL and query DSL above to compare the latency

What is the expected behavior?
PPL should improve the performance.

What is your host/environment?
OpenSearch 3.0.0

Do you have any screenshots?
If applicable, add screenshots to help explain your problem.

Do you have any additional context?
Add any other context about the problem.

The text was updated successfully, but these errors were encountered:

gaobinlong added bug Something isn't working untriaged labels Apr 9, 2025

Swiddis added performance Make it fast! enhancement New feature or request and removed bug Something isn't working labels Apr 9, 2025

LantaoJin removed the untriaged label Apr 11, 2025

This was referenced Apr 15, 2025

Speed up aggregation pushdown for single group-by expression #3550

Draft

[BUG] TermsAggregation should accept null bucket opensearch-project/OpenSearch#17959

Open

penghuo mentioned this issue May 27, 2025

Make query.size_limit only affect the final results #3623

Merged

7 tasks

penghuo added the PPL Piped processing language label May 29, 2025

penghuo added this to PPL 2025 May 30, 2025

github-project-automation bot moved this to Not Started in PPL 2025 May 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[BUG]Span query in PPL is slower than date histogram aggregation in query DSL #3528

[BUG]Span query in PPL is slower than date histogram aggregation in query DSL #3528

gaobinlong commented Apr 9, 2025 •

edited

Loading

[BUG]Span query in PPL is slower than date histogram aggregation in query DSL #3528

[BUG]Span query in PPL is slower than date histogram aggregation in query DSL #3528

Comments

gaobinlong commented Apr 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

gaobinlong commented Apr 9, 2025 •

edited

Loading