[ML] Delay copying chunked input strings #125837

davidkyle · 2025-03-28T12:30:35Z

The chunker stores the position of each chunk's text in the original text rather than making copies. However, when an inference request is made the actual chunk text is required, at this point a copy must be made. The copying is done when String#subString() is called.

The PR reduces the lifetime of the string copies but returning a string Supplier in from the chunker and performing the copy closer to where the request will be made. See RequestExecutorService

elasticsearch/x-pack/plugin/inference/src/main/java/org/elasticsearch/xpack/inference/external/http/sender/RequestExecutorService.java

Line 466 in a40370a

    
           .execute(task.getInferenceInputs(), requestSender, task.getRequestCompletedFunction(), task.getListener());

As a follow up once #125567 is merged EmbeddingInputs will be moved to Request#createHttpRequest() so that the string copy will be made at the point the http request is constructed further reducing the lifespan of the copy.

I had to change the logic around InferenceInput#inputSize() to avoid calling the supplier function early of find out if there was more than 1 input.

elasticsearchmachine · 2025-03-28T12:30:59Z

Pinging @elastic/ml-core (Team:ML)

jan-elastic

LGTM

elasticsearchmachine · 2025-04-01T13:55:14Z

💔 Backport failed

Status	Branch	Result
❌	8.x	Commit could not be cherrypicked due to conflicts

You can use sqren/backport to manually backport by running backport --upstream elastic/elasticsearch --pr 125837

The chunked text is only required when the actual inference request is made, using a string supplier means the string creation can be done much much closer to where the request is made reducing the lifespan of the copied string. (cherry picked from commit c521264) # Conflicts: # x-pack/plugin/inference/src/test/java/org/elasticsearch/xpack/inference/chunking/EmbeddingRequestChunkerTests.java

davidkyle · 2025-04-07T12:42:21Z

💚 All backports created successfully

Status	Branch	Result
✅	8.x

Questions ?

Please refer to the Backport tool documentation

The chunked text is only required when the actual inference request is made, using a string supplier means the string creation can be done much much closer to where the request is made reducing the lifespan of the copied string. (cherry picked from commit c521264) # Conflicts: # x-pack/plugin/inference/src/test/java/org/elasticsearch/xpack/inference/chunking/EmbeddingRequestChunkerTests.java

davidkyle added 2 commits March 28, 2025 10:16

Copy inputs later

8473285

Avoid calling suppiler for size

afdba53

davidkyle added >refactoring :ml Machine learning v8.19.0 v9.1.0 labels Mar 28, 2025

elasticsearchmachine added the Team:ML Meta label for the ML team label Mar 28, 2025

prwhelan approved these changes Mar 28, 2025

View reviewed changes

Merge branch 'main' into late-requests

9fae2e2

jan-elastic approved these changes Mar 31, 2025

View reviewed changes

davidkyle added the auto-backport Automatically create backport pull requests when merged label Mar 31, 2025

davidkyle enabled auto-merge (squash) March 31, 2025 10:02

Merge branch 'main' into late-requests

d668f0f

davidkyle merged commit c521264 into elastic:main Apr 1, 2025
17 checks passed

elasticsearchmachine added the backport pending label Apr 1, 2025

davidkyle mentioned this pull request Apr 7, 2025

[8.x] [ML] Delay copying chunked input strings (#125837) #126402

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Delay copying chunked input strings #125837

[ML] Delay copying chunked input strings #125837

davidkyle commented Mar 28, 2025 •

edited

Loading

elasticsearchmachine commented Mar 28, 2025

jan-elastic left a comment

elasticsearchmachine commented Apr 1, 2025

davidkyle commented Apr 7, 2025

[ML] Delay copying chunked input strings #125837

[ML] Delay copying chunked input strings #125837

Conversation

davidkyle commented Mar 28, 2025 • edited Loading

elasticsearchmachine commented Mar 28, 2025

jan-elastic left a comment

Choose a reason for hiding this comment

elasticsearchmachine commented Apr 1, 2025

💔 Backport failed

davidkyle commented Apr 7, 2025

💚 All backports created successfully

Questions ?

davidkyle commented Mar 28, 2025 •

edited

Loading