nlist computer example and nprobe suggestion #2771

seetimee · 2024-08-15T07:24:56Z

nlist computer example and nprobe suggestion

If the data volume of the entities is within the millions, you might consider using brute-force search. In other words, set nprobe to nlist.

one more nlist example

Seetimee patch 1

sre-ci-robot · 2024-08-15T07:25:01Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: seetimee

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

seetimee · 2024-08-21T11:13:41Z

/assign

liliu-z · 2024-10-09T09:49:19Z

site/en/faq/performance_faq.md

 Setting `nprobe` is specific to the dataset and scenario, and involves a trade-off between accuracy and query performance. We recommend finding the ideal value through repeated experimentation.
+If the data volume of the entities is within the millions, you might consider using brute-force search. In other words, set `nprobe` to `nlist`.


To my understanding, BF search shows better performance only in thousands level.

It seems that the millions level dosen't effect performance Significantly.

Some misunderstanding.
He meant it doesn't hurt that much for small cases.
Actually for 1M dataset, the performance gap between w/ and w/o index can be 10~100x. Only when the row number smaller than thousands FLAT can outperform. But Milvus will handle that for you

seetimee added 3 commits August 15, 2024 14:31

Update performance_faq.md suggestion

e642cbf

If the data volume of the entities is within the millions, you might consider using brute-force search. In other words, set nprobe to nlist.

Update performance_faq.md

dfd0a9b

one more nlist example

Merge pull request #1 from seetimee/seetimee-patch-1

ff5f084

Seetimee patch 1

sre-ci-robot added the size/XS label Aug 15, 2024

sre-ci-robot assigned seetimee Aug 21, 2024

seetimee removed their assignment Aug 21, 2024

liliu-z suggested changes Oct 9, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

nlist computer example and nprobe suggestion #2771

nlist computer example and nprobe suggestion #2771

seetimee commented Aug 15, 2024

sre-ci-robot commented Aug 15, 2024

seetimee commented Aug 21, 2024

liliu-z Oct 9, 2024

seetimee Oct 9, 2024

seetimee Oct 9, 2024

liliu-z Oct 10, 2024

		Setting `nprobe` is specific to the dataset and scenario, and involves a trade-off between accuracy and query performance. We recommend finding the ideal value through repeated experimentation.
		If the data volume of the entities is within the millions, you might consider using brute-force search. In other words, set `nprobe` to `nlist`.

nlist computer example and nprobe suggestion #2771

Are you sure you want to change the base?

nlist computer example and nprobe suggestion #2771

Conversation

seetimee commented Aug 15, 2024

sre-ci-robot commented Aug 15, 2024

seetimee commented Aug 21, 2024

liliu-z Oct 9, 2024

Choose a reason for hiding this comment

seetimee Oct 9, 2024

Choose a reason for hiding this comment

seetimee Oct 9, 2024

Choose a reason for hiding this comment

liliu-z Oct 10, 2024

Choose a reason for hiding this comment