[Inference API] Add "rerank" task type to "elastic" provider #126022

timgrein · 2025-04-01T08:55:52Z

Implements the rerank task type for the elastic provider.

elasticsearchmachine · 2025-04-01T10:55:32Z

Pinging @elastic/search-inference-team (Team:Search - Inference)

elasticsearchmachine · 2025-04-01T10:55:32Z

Pinging @elastic/search-eng (Team:SearchOrg)

elasticsearchmachine · 2025-05-06T11:03:23Z

Pinging @elastic/ml-core (Team:ML)

jonathan-buttner

Looking good, left some suggestions

jonathan-buttner · 2025-05-06T12:41:25Z

...ch/xpack/inference/external/request/elastic/rerank/ElasticInferenceServiceRerankRequest.java

+        this.query = query;
+        this.documents = documents;
+        this.model = Objects.requireNonNull(model);
+        this.uri = model.uri();


nit: We probably don't need a reference to the uri since we have a reference to the model.

jonathan-buttner · 2025-05-06T14:32:20Z

...sticsearch/xpack/inference/services/elastic/ElasticInferenceServiceRerankRequestManager.java

@@ -0,0 +1,79 @@
+/*


We're trying to transition away from the request manager pattern to avoid the extra class since all the classes are pretty similar.

Here's an example of how we implemented it for voyageai: #124512

And the rerank usage: https://github.com/elastic/elasticsearch/pull/124512/files#diff-3493deea8c9fd5276917f1f8a9d7f008268c34b61f230885df0643de10f19fffR71

jonathan-buttner · 2025-05-06T14:33:07Z

...sticsearch/xpack/inference/services/elastic/action/ElasticInferenceServiceActionCreator.java

+    public ExecutableAction create(ElasticInferenceServiceRerankModel model) {
+        var requestManager = new ElasticInferenceServiceRerankRequestManager(model, serviceComponents, traceContext);
+        var errorMessage = constructFailedToSendRequestMessage(
+            String.format(Locale.ROOT, "%s rerank", ELASTIC_INFERENCE_SERVICE_IDENTIFIER)


nit: I think Strings.format() avoids the need for Locale.ROOT.

jonathan-buttner · 2025-05-06T14:37:07Z

server/src/main/java/org/elasticsearch/TransportVersions.java

@@ -214,6 +214,7 @@ static TransportVersion def(int id) {
    public static final TransportVersion ESQL_REMOVE_AGGREGATE_TYPE = def(9_045_0_00);
    public static final TransportVersion ADD_PROJECT_ID_TO_DSL_ERROR_INFO = def(9_046_0_00);
    public static final TransportVersion SEMANTIC_TEXT_CHUNKING_CONFIG = def(9_047_00_0);
+    public static final TransportVersion ML_INFERENCE_ELASTIC_RERANK = def(9_048_0_00);


Looks like the PR is only targeting 9.1 did we also want to support 8.19? If so we'll need to add another transport version and do the backport dance.

jonathan-buttner · 2025-05-06T14:45:52Z

...earch/xpack/inference/services/elastic/rerank/ElasticInferenceServiceRerankTaskSettings.java

+            return EMPTY_SETTINGS;
+        }
+
+        Integer topNDocumentsOnly = extractOptionalPositiveInteger(


I think we can omit this class. We've moved the common rerank parameters up to the root level of the request and they're passed in to the infer() call from the InferenceAction class. So I think we'll want the ElasticInferenceService to Override the validateRerankParameters from SenderService to ensure only top n is set.

Here's where that's being called by the SenderService: https://github.com/elastic/elasticsearch/blob/main/x-pack/plugin/inference/src/main/java/org/elasticsearch/xpack/inference/services/SenderService.java#L158

Add "rerank" task type to "elastic" provider

33b8110

elasticsearchmachine added v9.1.0 needs:triage Requires assignment of a team area label labels Apr 1, 2025

kingherc added the :SearchOrg/Inference Label for the Search Inference team label Apr 1, 2025

elasticsearchmachine added Team:SearchOrg Meta label for the Search Org (Enterprise Search) Team:Search - Inference and removed needs:triage Requires assignment of a team area label labels Apr 1, 2025

Resolve merge conflicts

efee615

timgrein added the >non-issue label Apr 8, 2025

Fix checkstyle violation

b4b1f05

timgrein added :ml Machine learning Team:ML Meta label for the ML team labels May 6, 2025

jonathan-buttner reviewed May 6, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Inference API] Add "rerank" task type to "elastic" provider #126022

[Inference API] Add "rerank" task type to "elastic" provider #126022

timgrein commented Apr 1, 2025

elasticsearchmachine commented Apr 1, 2025

elasticsearchmachine commented Apr 1, 2025

elasticsearchmachine commented May 6, 2025

jonathan-buttner left a comment

jonathan-buttner May 6, 2025

jonathan-buttner May 6, 2025

jonathan-buttner May 6, 2025

jonathan-buttner May 6, 2025

jonathan-buttner May 6, 2025

[Inference API] Add "rerank" task type to "elastic" provider #126022

Are you sure you want to change the base?

[Inference API] Add "rerank" task type to "elastic" provider #126022

Conversation

timgrein commented Apr 1, 2025

elasticsearchmachine commented Apr 1, 2025

elasticsearchmachine commented Apr 1, 2025

elasticsearchmachine commented May 6, 2025

jonathan-buttner left a comment

Choose a reason for hiding this comment

jonathan-buttner May 6, 2025

Choose a reason for hiding this comment

jonathan-buttner May 6, 2025

Choose a reason for hiding this comment

jonathan-buttner May 6, 2025

Choose a reason for hiding this comment

jonathan-buttner May 6, 2025

Choose a reason for hiding this comment

jonathan-buttner May 6, 2025

Choose a reason for hiding this comment