Add KV cache for D1 index. Closes #306 #323

juliangruber · 2025-10-01T09:51:35Z

~~As in #315, TS errors that I don't know how to fix are making CI fail.~~

Closes #306

pyropy

Changes look good. I'd only like to discuss cache design before giving it a approval.

pyropy · 2025-10-02T13:56:30Z

piece-retriever/bin/piece-retriever.js

-          findInBadBits(env, pieceCid),
-        ])
+      const indexCacheKey = `${payerWalletAddress}/${pieceCid}`
+      let [dataSetId, serviceUrl] =


Is it worth exploring the possibility of caching multiple datasets that share the same indexCacheKey? Users may have the same piece stored across multiple CDN-enabled data sets. If we only cache information for one data set, users could face retrieval failures when the cached data set's egress limit is reached, even though the same piece exists in other datasets.

Apart from that it would've also be nice to cache other info like egress usage and remaining egress quota (maybe not in this pull-request).

This is a great point. I missed it for so long! I will think about it. Immediate thoughts:

store an array of possible pieces as the value

store multiple kv pairs, and perform a list() (slower)

rotate the cache value after retrieval (pick a different possible piece)

bajtos

Great start!

indexer/lib/pdp-verifier-handlers.js

piece-retriever/bin/piece-retriever.js

Co-authored-by: Miroslav Bajtoš <[email protected]>

indexer/lib/fwss-handlers.js

bajtos

Please re-request another review after you implement the change we agreed on yesterday, where the bad-bits worker will use the KV store only, no D1 database.

Co-authored-by: Srdjan <[email protected]>

Co-authored-by: Miroslav Bajtoš <[email protected]>

…am/worker into update/move-bad-bits-to-kv

bad-bits/worker-configuration.d.ts

bajtos

I love how much simpler this pull request became after we removed the changes related to bad-bits 👏🏻

bajtos · 2025-10-22T14:21:30Z

indexer/lib/fwss-handlers.js

+    results.map(async ({ payerAddress, pieceCID }) => {
+      await env.INDEX_CACHE_KV.delete(`${payerAddress}/${pieceCID}`)
+    }),


Can this run into the limit of KV calls we can make per worker invocation? (I vaguely remember the number 1000.)

I think it's not likely for a long time, so we don't need to worry about that too much right now.

But it would be nice to have some visibility, so that we know early when we have a user approaching 1000 pieces stored. For example, we can have a Grafana chart with an alert where we show the value returned by a SQL query like the following one:

SELECT MAX(COUNT(*)) FROM pieces INNER JOIN data_sets ON pieces.data_set_id = data_sets.id GROUP BY payer_address

I propose to open a follow-up tech-debt issue.

The question is whether we need this for the GA launch, and I don't think so.

Thoughts?

Oh right it can happen, when there are at least 1000 pieces in a data set for example. I don't see this case as unlikely.

I see two options going forward:

use queues

use the REST API, which has higher batch limits

I will evaluate both tomorrow

Yes, on second thought, I also concluded that the limit of 1000 pieces per dataset is too low, and we need to explore other options.

Considering the complexities, maybe we should put this performance optimisation on hold until the GA launch. WDYT?

sounds good, let's reevaluate

bajtos · 2025-10-22T14:26:18Z

@juliangruber please get @pyropy's approval before landing this change.

His comment about a potential design issue seems relevant to me.

#323 (comment)

pyropy · 2025-10-27T13:09:01Z

piece-retriever/bin/piece-retriever.js

-        ])
-
+      const indexCacheKey = `${payerWalletAddress}/${pieceCid}`
+      const [indexCacheValue, isBadBit] = await Promise.all([


We're also going to need to store egress quota inside the KV store as we're not going to query database unless indexCacheValue is null or undefined.

How are we supposed to update these values given that KV store update is not a atomic operation?

juliangruber · 2025-10-27T15:21:00Z

Converting back to draft, as we're deprioritizing this in favor of ipfs/egress/x402 work

juliangruber added 13 commits October 1, 2025 11:50

add KV cache for C1 index

fbbe19e

add passing test

de444fb

add passing test

0d3f60a

add missing wrangler config

69291bf

add test, fix implementation

a093922

fix order

e8608ef

add passing test

8abd8ef

add passing test

c504896

fix indentation

8f879bf

fix unrelated test

6b5f822

add passing test

624b184

Merge branch 'main' into add/index-cache

b66e758

update KV ids

5f18618

juliangruber marked this pull request as ready for review October 2, 2025 13:16

juliangruber requested review from bajtos and pyropy as code owners October 2, 2025 13:16

pyropy reviewed Oct 2, 2025

View reviewed changes

bajtos requested changes Oct 6, 2025

View reviewed changes

indexer/lib/pdp-verifier-handlers.js Outdated Show resolved Hide resolved

piece-retriever/bin/piece-retriever.js Outdated Show resolved Hide resolved

piece-retriever/bin/piece-retriever.js Outdated Show resolved Hide resolved

juliangruber and others added 2 commits October 6, 2025 10:40

Update piece-retriever/bin/piece-retriever.js

19cb854

Co-authored-by: Miroslav Bajtoš <[email protected]>

fix lint

ed6c067

bajtos reviewed Oct 7, 2025

View reviewed changes

indexer/lib/fwss-handlers.js Outdated Show resolved Hide resolved

juliangruber added 4 commits October 7, 2025 12:49

Merge branch 'main' into add/index-cache

4acf720

fix lint & types

7101bb0

refactor sqlPlaceholders

fe5d246

cache bad bits separately (wip)

d149614

juliangruber requested a review from bajtos October 14, 2025 12:08

Merge branch 'main' into add/index-cache

7158b9d

bajtos requested changes Oct 15, 2025

View reviewed changes

implement strategy

f4115c4

juliangruber requested a review from bajtos October 15, 2025 14:40

juliangruber and others added 17 commits October 21, 2025 11:42

Apply suggestion from @pyropy

112e8dd

Co-authored-by: Srdjan <[email protected]>

Apply suggestion from @pyropy

a818ad3

Co-authored-by: Srdjan <[email protected]>

Apply suggestion from @pyropy

eb3c03f

Co-authored-by: Srdjan <[email protected]>

refactor using Set#difference()

174d6bd

fmt

97332cf

Merge branch 'main' into update/move-bad-bits-to-kv

30cd11b

Merge branch 'update/move-bad-bits-to-kv' into add/index-cache

17de33a

rename KV to INDEX_CACHE_KV

3b2a846

refactor

2270be5

test: reliably clear all kv data

6e1e6cd

fix wasCapped

d786f26

refactor

0b891ef

Update bad-bits/lib/store.js

8f694c5

Co-authored-by: Miroslav Bajtoš <[email protected]>

use r2 for latest-hashes

7d84c55

Merge branch 'update/move-bad-bits-to-kv' of https://github.com/filbe…

e141fbe

…am/worker into update/move-bad-bits-to-kv

Merge branch 'main' into update/move-bad-bits-to-kv

14c9fbc

Merge branch 'update/move-bad-bits-to-kv' into add/index-cache

9aed565

juliangruber requested review from bajtos and pyropy October 22, 2025 07:52

juliangruber marked this pull request as ready for review October 22, 2025 07:52

fix test

248a038

Base automatically changed from update/move-bad-bits-to-kv to main October 22, 2025 14:01

Merge branch 'main' into add/index-cache

48597ec

juliangruber commented Oct 22, 2025

View reviewed changes

bad-bits/worker-configuration.d.ts Outdated Show resolved Hide resolved

rebuild

041f09f

bajtos approved these changes Oct 22, 2025

View reviewed changes

pyropy reviewed Oct 27, 2025

View reviewed changes

juliangruber marked this pull request as draft October 27, 2025 15:20

Add KV cache for D1 index. Closes #306 #323

Are you sure you want to change the base?

Add KV cache for D1 index. Closes #306 #323

Uh oh!

Conversation

juliangruber commented Oct 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pyropy left a comment

Choose a reason for hiding this comment

Uh oh!

pyropy Oct 2, 2025

Choose a reason for hiding this comment

Uh oh!

juliangruber Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

bajtos left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bajtos left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

bajtos left a comment

Choose a reason for hiding this comment

Uh oh!

bajtos Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

juliangruber Oct 22, 2025

Choose a reason for hiding this comment

Uh oh!

bajtos Oct 23, 2025

Choose a reason for hiding this comment

Uh oh!

juliangruber Oct 23, 2025

Choose a reason for hiding this comment

Uh oh!

bajtos commented Oct 22, 2025

Uh oh!

pyropy Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

juliangruber commented Oct 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

juliangruber commented Oct 1, 2025 •

edited

Loading

bajtos Oct 22, 2025 •

edited

Loading