Introduce blog post for disk-based k-NN #3616

jmazanec15 · 2025-02-03T19:51:06Z

Description

Adds a blog for disk-based vector search

Issues Resolved

Check List

Commits are signed per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the BSD-3-Clause License.

Adds a blog post for disk-based k-NN. Included is a set of results and images. Signed-off-by: John Mazanec <[email protected]>

navneet1v · 2025-02-04T19:01:30Z

_posts/2025-02-03-Reduce-Cost-with-Disk-based-Vector-Search.md

+
+| Metric/Configuration              | in-memory | on_disk_8x | in_memory_8x | on_disk_16x | in_memory_16x | on_disk_32x | in_memory_32x |
+|-----------------------------------|-----------|------------|--------------|-------------|---------------|-------------|---------------|
+| recall@100 (ratio)                | 0.95      | 0.98       | 0.98         | 0.97        | 0.96          | 0.94        | 0.95          |


This shows that 32x compression just works and we should not add on_disk here

Added both just for sake of transparency. For some data sets, the re-scoring does not significantly help.

_posts/2025-02-03-Reduce-Cost-with-Disk-based-Vector-Search.md

Co-authored-by: Navneet Verma <[email protected]> Signed-off-by: John Mazanec <[email protected]>

Signed-off-by: Fanit Kolchina <[email protected]>

_posts/2025-02-03-Reduce-Cost-with-Disk-based-Vector-Search.md

Co-authored-by: John Mazanec <[email protected]> Signed-off-by: kolchfa-aws <[email protected]>

.github/vale/styles/Vocab/OpenSearch/Words/accept.txt

Signed-off-by: kolchfa-aws <[email protected]>

natebower

@kolchfa-aws @jmazanec15 Editorial review complete. Please see my comments and changes and let me know if you have any questions. Thanks!

Cc: @pajuric

_posts/2025-02-03-Reduce-Cost-with-Disk-based-Vector-Search.md

natebower · 2025-02-06T11:22:27Z

_posts/2025-02-03-Reduce-Cost-with-Disk-based-Vector-Search.md

+Interestingly, for this dataset, the on-disk approach with rescoring produces similar recall to the in-memory approach without rescoring, but the in-memory approach is substantially faster. This is most likely because the Cohere v3 model has been optimized to work very well with binary quantized data (see [this blog post](https://cohere.com/blog/int8-binary-embeddings)).
+
+## Learnings
+


Line 260: We haven't actually referenced ANN prior to this. Instead of "ANN approach", do we mean "nearest neighbor approach"?

Approximate nearest neighbor search

_posts/2025-02-03-Reduce-Cost-with-Disk-based-Vector-Search.md

Co-authored-by: Nathan Bower <[email protected]> Signed-off-by: kolchfa-aws <[email protected]>

_posts/2025-02-03-Reduce-Cost-with-Disk-based-Vector-Search.md

Signed-off-by: kolchfa-aws <[email protected]>

_posts/2025-02-03-Reduce-Cost-with-Disk-based-Vector-Search.md

Signed-off-by: kolchfa-aws <[email protected]>

_posts/2025-02-03-Reduce-Cost-with-Disk-based-Vector-Search.md

Signed-off-by: kolchfa-aws <[email protected]>

kolchfa-aws · 2025-02-06T18:47:05Z

@pajuric Could you please edit the meta for this blog, and it will be ready to publish. Thanks!

Introduce blog post for disk-based k-NN

5f21a53

Adds a blog post for disk-based k-NN. Included is a set of results and images. Signed-off-by: John Mazanec <[email protected]>

jmazanec15 requested review from elfisher, AMoo-Miki, nknize, krisfreedain, peterzhuamazon, CEHENKLE, dtaivpp, kolchfa-aws, nateynateynate and natebower as code owners February 3, 2025 19:51

navneet1v reviewed Feb 4, 2025

View reviewed changes

_posts/2025-02-03-Reduce-Cost-with-Disk-based-Vector-Search.md Outdated Show resolved Hide resolved

Update _posts/2025-02-03-Reduce-Cost-with-Disk-based-Vector-Search.md

eb80b37

Co-authored-by: Navneet Verma <[email protected]> Signed-off-by: John Mazanec <[email protected]>

kolchfa-aws self-assigned this Feb 4, 2025

kolchfa-aws added 8 commits February 5, 2025 10:15

Formatting changes

a268d02

Signed-off-by: Fanit Kolchina <[email protected]>

Doc review

f5ede58

Signed-off-by: Fanit Kolchina <[email protected]>

Typo

7ee683c

Signed-off-by: Fanit Kolchina <[email protected]>

Resolve merge conflicts

db90ece

Signed-off-by: Fanit Kolchina <[email protected]>

Number the steps differently

5ac1ff5

Signed-off-by: Fanit Kolchina <[email protected]>

Highlight code snippets

f3eb7ad

Signed-off-by: Fanit Kolchina <[email protected]>

Change acronym

9fb4181

Signed-off-by: Fanit Kolchina <[email protected]>

Minor rewrite

0f21491

Signed-off-by: Fanit Kolchina <[email protected]>

jmazanec15 commented Feb 5, 2025

View reviewed changes

_posts/2025-02-03-Reduce-Cost-with-Disk-based-Vector-Search.md Outdated Show resolved Hide resolved

_posts/2025-02-03-Reduce-Cost-with-Disk-based-Vector-Search.md Outdated Show resolved Hide resolved

Apply suggestions from code review

da1dd70

Co-authored-by: John Mazanec <[email protected]> Signed-off-by: kolchfa-aws <[email protected]>

kolchfa-aws reviewed Feb 5, 2025

View reviewed changes

.github/vale/styles/Vocab/OpenSearch/Words/accept.txt Outdated Show resolved Hide resolved

Update .github/vale/styles/Vocab/OpenSearch/Words/accept.txt

dc84437

Signed-off-by: kolchfa-aws <[email protected]>

natebower reviewed Feb 6, 2025

View reviewed changes

kolchfa-aws reviewed Feb 6, 2025

View reviewed changes

_posts/2025-02-03-Reduce-Cost-with-Disk-based-Vector-Search.md Outdated Show resolved Hide resolved

Apply suggestions from code review

0c4cccc

Co-authored-by: Nathan Bower <[email protected]> Signed-off-by: kolchfa-aws <[email protected]>

kolchfa-aws reviewed Feb 6, 2025

View reviewed changes

_posts/2025-02-03-Reduce-Cost-with-Disk-based-Vector-Search.md Outdated Show resolved Hide resolved

Update _posts/2025-02-03-Reduce-Cost-with-Disk-based-Vector-Search.md

b483309

Signed-off-by: kolchfa-aws <[email protected]>

kolchfa-aws reviewed Feb 6, 2025

View reviewed changes

_posts/2025-02-03-Reduce-Cost-with-Disk-based-Vector-Search.md Outdated Show resolved Hide resolved

Update _posts/2025-02-03-Reduce-Cost-with-Disk-based-Vector-Search.md

1436072

Signed-off-by: kolchfa-aws <[email protected]>

kolchfa-aws reviewed Feb 6, 2025

View reviewed changes

_posts/2025-02-03-Reduce-Cost-with-Disk-based-Vector-Search.md Outdated Show resolved Hide resolved

Update _posts/2025-02-03-Reduce-Cost-with-Disk-based-Vector-Search.md

24b8ff1

Signed-off-by: kolchfa-aws <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce blog post for disk-based k-NN #3616

Introduce blog post for disk-based k-NN #3616

jmazanec15 commented Feb 3, 2025

navneet1v Feb 4, 2025

jmazanec15 Feb 4, 2025

natebower left a comment

natebower Feb 6, 2025

jmazanec15 Feb 6, 2025

kolchfa-aws commented Feb 6, 2025

		Interestingly, for this dataset, the on-disk approach with rescoring produces similar recall to the in-memory approach without rescoring, but the in-memory approach is substantially faster. This is most likely because the Cohere v3 model has been optimized to work very well with binary quantized data (see [this blog post](https://cohere.com/blog/int8-binary-embeddings)).

		## Learnings

Introduce blog post for disk-based k-NN #3616

Are you sure you want to change the base?

Introduce blog post for disk-based k-NN #3616

Conversation

jmazanec15 commented Feb 3, 2025

Description

Issues Resolved

Check List

navneet1v Feb 4, 2025

Choose a reason for hiding this comment

jmazanec15 Feb 4, 2025

Choose a reason for hiding this comment

natebower left a comment

Choose a reason for hiding this comment

natebower Feb 6, 2025

Choose a reason for hiding this comment

jmazanec15 Feb 6, 2025

Choose a reason for hiding this comment

kolchfa-aws commented Feb 6, 2025