GH-5291 Add asynchronous fsync to LuceneSail #5446

Ostrzyciel · 2025-09-22T16:53:52Z

GitHub issue resolved: #5291

Briefly describe the changes proposed in this PR:

As described in the issue, the current setup with an fsync after each transaction is very safe, but also a huge bottleneck when dealing with many small transactions. This PR introduces an option that allows for asynchronous fsyncs in the background, on a fixed interval. If there is nothing to sync, it does nothing.

I tested this with the original workload with which I found the issue. When I set fsyncInterval to 5000 ms, it went from ~10–12 TX/s to ~100–150 TX/s, over an HTTP connection. That's basically the same as when I tried removing the fsync entirely. Great :)

PR Author Checklist (see the contributor guidelines for more details):

my pull request is self-contained
I've added tests for the changes I made
I've applied code formatting (you can use mvn process-resources to format from the command line)
I've squashed my commits where necessary
every commit message starts with the issue number (GH-xxxx) followed by a meaningful description of the change

This requires this PR to be merged: eclipse-rdf4j/rdf4j#5446

hmottestad · 2025-09-24T19:39:55Z

I don't think that the close() method on the directory in the LuceneIndex class is ever called.

Ostrzyciel · 2025-09-24T19:54:58Z

I don't think that the close() method on the directory in the LuceneIndex class is ever called.

@hmottestad oops, you are right! Fixed that and added a test to make sure it happens.

hmottestad · 2025-09-25T03:29:25Z

...ail/lucene/src/main/java/org/eclipse/rdf4j/sail/lucene/impl/DelayedSyncDirectoryWrapper.java

+			try {
+				super.syncMetaData();
+			} catch (IOException e) {
+				logger.error("IO error during a periodic sync of Lucene index metadata", e);


I'm a bit worried that if for some reason there is a persistent issue, then we may end up logging continuously but never actually throwing an exception.

What would usually happen if an IO exception was thrown (with the original code)? Would it bring down the entire application or just a particular transaction?

This would result in a transaction rollback:

rdf4j/core/sail/lucene-api/src/main/java/org/eclipse/rdf4j/sail/lucene/LuceneSailConnection.java

Lines 266 to 269 in 18f9a56

luceneIndex.commit();

} catch (IOException | SailException e) {

logger.error("Rolling back", e);

luceneIndex.rollback();

We cannot do the same thing 1:1 with asynchronous fsyncs, because we don't wait for the result of the fsync. The next best thing we can do is to throw an exception on the next transaction.

I've added a bit of code for that, along with a test.

...ail/lucene/src/main/java/org/eclipse/rdf4j/sail/lucene/impl/DelayedSyncDirectoryWrapper.java

core/sail/lucene/src/main/java/org/eclipse/rdf4j/sail/lucene/impl/LuceneIndex.java

hmottestad · 2025-09-25T03:34:30Z

Thanks for the good fix. I think this will be a good solution overall, just some small things I want to be sure are robust.

hmottestad · 2025-09-25T03:36:23Z

...ail/lucene/src/main/java/org/eclipse/rdf4j/sail/lucene/impl/DelayedSyncDirectoryWrapper.java

+	@Override
+	public void sync(Collection<String> names) throws IOException {
+		synchronized (pendingSyncs) {
+			pendingSyncs.addAll(names);


How big is this likely to grow? Should we have a hard limit (possibly configurable) so that we don't run out of memory before we sync?

From what I can tell, there is no limit on this, it depends on Lucene index size. I added a configurable limit for this, set to 5000 files by default – should be good enough. There is also a test for this.

Ostrzyciel added a commit to Ostrzyciel/nanopub-query that referenced this pull request Sep 22, 2025

Draft: speed up Lucene indexing by deferring fsyncs

44645d9

This requires this PR to be merged: eclipse-rdf4j/rdf4j#5446

Ostrzyciel force-pushed the GH-5291-lucene-fsync branch from b6a649b to 8f2d4b7 Compare September 24, 2025 19:54

hmottestad reviewed Sep 25, 2025

View reviewed changes

eclipse-rdf4jGH-5291 Add asynchronous fsync to LuceneSail

ed3e249

Ostrzyciel force-pushed the GH-5291-lucene-fsync branch from 8f2d4b7 to ed3e249 Compare September 25, 2025 10:06

Ostrzyciel requested a review from hmottestad September 25, 2025 10:07

hmottestad changed the base branch from main to develop October 3, 2025 18:25

hmottestad mentioned this pull request Oct 3, 2025

GH-5291 lucene fsync with changes #5473

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GH-5291 Add asynchronous fsync to LuceneSail #5446

GH-5291 Add asynchronous fsync to LuceneSail #5446

Uh oh!

Ostrzyciel commented Sep 22, 2025

Uh oh!

hmottestad commented Sep 24, 2025

Uh oh!

Ostrzyciel commented Sep 24, 2025

Uh oh!

hmottestad Sep 25, 2025

Uh oh!

Ostrzyciel Sep 25, 2025

Uh oh!

Uh oh!

Uh oh!

hmottestad commented Sep 25, 2025

Uh oh!

hmottestad Sep 25, 2025

Uh oh!

Ostrzyciel Sep 25, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	luceneIndex.commit();
	} catch (IOException \| SailException e) {
	logger.error("Rolling back", e);
	luceneIndex.rollback();

GH-5291 Add asynchronous fsync to LuceneSail #5446

Are you sure you want to change the base?

GH-5291 Add asynchronous fsync to LuceneSail #5446

Uh oh!

Conversation

Ostrzyciel commented Sep 22, 2025

Uh oh!

hmottestad commented Sep 24, 2025

Uh oh!

Ostrzyciel commented Sep 24, 2025

Uh oh!

hmottestad Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

Ostrzyciel Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

hmottestad commented Sep 25, 2025

Uh oh!

hmottestad Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

Ostrzyciel Sep 25, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants