-
Notifications
You must be signed in to change notification settings - Fork 25.2k
Fix bbq quantization algorithm but for differently distributed components #126778
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Pinging @elastic/es-search-relevance (Team:Search Relevance) |
Hi @benwtrent, I've created a changelog YAML for you. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I believe this was the same fix you made here but applied to ES correct: apache/lucene#14374
lgtm no concerns
…ents (elastic#126778) We had a silly bug in quantizing vectors in bbq where we were scaling the initial quantile optimization parameters incorrectly given the vector component distribution. In distributions where this has a major impact, the recall results were abysmal and rendered the quantization technique useless. In modern, well distributed components, this change is almost a no-op.
…ents (#126778) (#126794) We had a silly bug in quantizing vectors in bbq where we were scaling the initial quantile optimization parameters incorrectly given the vector component distribution. In distributions where this has a major impact, the recall results were abysmal and rendered the quantization technique useless. In modern, well distributed components, this change is almost a no-op.
…ents (#126778) (#126793) We had a silly bug in quantizing vectors in bbq where we were scaling the initial quantile optimization parameters incorrectly given the vector component distribution. In distributions where this has a major impact, the recall results were abysmal and rendered the quantization technique useless. In modern, well distributed components, this change is almost a no-op.
Hi, @benwtrent, will this change in quantification algorithm affect existing bbq quantification data? Do old vector segments need to be repaired? |
For older segments that were getting good results, this change won't effect them. For older segments that were getting very poor results, they will need to be reindexed to take advantage of the fix. |
We had a silly bug in quantizing vectors in bbq where we were scaling the initial quantile optimization parameters incorrectly given the vector component distribution.
In distributions where this has a major impact, the recall results were abysmal and rendered the quantization technique useless.
In modern, well distributed components, this change is almost a no-op.