Bug #3846: Release search version sorting is lexicographic instead of natural numeric order by YianZhao · Pull Request #3847 · eclipse-sw360/sw360

YianZhao · 2026-03-11T06:33:14Z

Fixes #3846

What happened

Release search sorted by version used lexicographic order, so versions like 1.10 could be ordered before 1.2.

Root cause

version_sort was indexed from raw doc.version string.

What changed

Added version normalization for sorting in ReleaseSearchHandler Lucene index function.
Added matching Java normalization utility used by tests.
Ensured numeric-length prefix padding is 6 digits (same behavior as JS logic).
Added ReleaseSearchHandlerTest regression tests:
- natural numeric segment sort (1.2 < 1.10)
- leading zero equivalence (1.02 == 1.2)
- numeric suffix sort (alpha2 < alpha10)
- explicit 6-digit length-prefix assertion

Verification

mvn -pl backend/common "-Dbase.deploy.dir=." -Dtest=ReleaseSearchHandlerTest test

Signed-off-by: Alex <[email protected]>

GMishx

Few questions

GMishx · 2026-03-12T12:52:56Z

backend/common/src/main/java/org/eclipse/sw360/datahandler/db/ReleaseSearchHandler.java

+                "    return lower.replace(/\\d+/g, function(match) {" +
+                "      var normalized = match.replace(/^0+(?!$)/, '');" +
+                "      var length = normalized.length.toString();" +
+                "      while (length.length < 6) { length = '0' + length; }" +
+                "      return '{' + length + normalized + '}';" +
+                "    });" +


Would you mind explaining the magic here?

This builds a natural sort key for version-like strings. It lowercases the input, then rewrites every numeric chunk into a sortable token: first its length (zero-padded), then the normalized number itself. That way lexicographic sorting compares numbers by numeric magnitude instead of plain string order, so for example 1.2.10 sorts after 1.2.3.

lower: first convert everything to lowercase, so case differences do not affect sorting.

replace(/\d+/g, ...): finds each contiguous numeric segment in the string.

match.replace(/^0+(?!$)/, ''): removes leading zeros, while still preserving a single 0.

for example, 0012 -> 12

and 000 -> 0

normalized.length: gets the length of that normalized numeric string.

Then the length is left-padded to a fixed width of 6 digits.

2 -> 000002

10 -> 000010

Finally it returns '{'+ length + normalized + '}'.

The purpose of this is to make plain string sorting behave like numeric sorting for embedded numbers.

For example:

1.2.3 → the numeric segment 3 becomes something like {0000013}

1.2.10 → the numeric segment 10 becomes something like {00000210}

Because the comparison looks at the length first, and then the value, this ensures that:

3 is smaller than 10

and you do not get the usual lexicographic problem where "10" < "3"

GMishx · 2026-03-12T12:53:12Z

backend/common/src/main/java/org/eclipse/sw360/datahandler/db/ReleaseSearchHandler.java

+                "    return lower.replace(/\\d+/g, function(match) {" +
+                "      var normalized = match.replace(/^0+(?!$)/, '');" +
+                "      var length = normalized.length.toString();" +
+                "      while (length.length < 6) { length = '0' + length; }" +


Why the magic number 6?

Thank for asking, 6 is just a fixed padding width for the length prefix, so all rewritten numeric tokens have a comparable shape.

For example:

3 -> length 1 -> 000001

10 -> length 2 -> 000002

123 -> length 3 -> 000003

I chose 6 simply as a sufficiently large constant for expected version segments, not because it has special meaning. The goal is only to keep the length field fixed-width so lexicographic comparison works reliably. If we want, I can replace it with a named constant or add a short comment to make that clearer.

GMishx · 2026-03-12T12:53:27Z

backend/common/src/main/java/org/eclipse/sw360/datahandler/db/ReleaseSearchHandler.java

        };
    }
+
+    static String normalizeVersionForSort(String version) {


Unused function declared??

normalizeVersionForSort(String) is currently used by unit tests as a Java mirror of the JS index normalization logic, to ensure both implementations stay consistent.

fix(release-search): use natural numeric version sort key

956de43

Signed-off-by: Alex <[email protected]>

YianZhao requested review from GMishx, KoukiHama, ag4ums and arunazhakesan as code owners March 11, 2026 06:33

Merge branch 'main' into bug/release-version-natural-sort

76d6dc7

GMishx added needs code review needs general test This is general testing, meaning that there is no org specific issue to check for labels Mar 12, 2026

GMishx requested changes Mar 12, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug #3846: Release search version sorting is lexicographic instead of natural numeric order#3847

Bug #3846: Release search version sorting is lexicographic instead of natural numeric order#3847
YianZhao wants to merge 2 commits intoeclipse-sw360:mainfrom
YianZhao:bug/release-version-natural-sort

YianZhao commented Mar 11, 2026

Uh oh!

GMishx left a comment

Uh oh!

GMishx Mar 12, 2026

Uh oh!

YianZhao Mar 12, 2026

Uh oh!

GMishx Mar 12, 2026

Uh oh!

YianZhao Mar 12, 2026

Uh oh!

GMishx Mar 12, 2026

Uh oh!

YianZhao Mar 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

YianZhao commented Mar 11, 2026

What happened

Root cause

What changed

Verification

Uh oh!

GMishx left a comment

Choose a reason for hiding this comment

Uh oh!

GMishx Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

YianZhao Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

GMishx Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

YianZhao Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

GMishx Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

YianZhao Mar 12, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants