Semantic Text Chunking Indexing Pressure #125517

Mikep86 · 2025-03-24T16:57:38Z

We have observed many OOMs due to the memory required to inject chunked inference results for semantic_text fields. This PR uses coordinating indexing pressure to account for this memory usage. When indexing pressure memory usage exceeds the threshold set by indexing_pressure.memory.limit, chunked inference result injection will be suspended to prevent OOMs.

…emitted to XContent

x-pack/plugin/inference/src/main/java/org/elasticsearch/xpack/inference/InferenceException.java

Mikep86 · 2025-04-08T14:03:51Z

@elasticmachine update branch

Mikep86 · 2025-04-08T18:43:43Z

@elasticmachine update branch

kderusso

LGTM, thanks for iterating

davidkyle

LGTM

Thanks @Mikep86

elasticsearchmachine · 2025-04-11T07:41:50Z

Hi @Mikep86, I've created a changelog YAML for you.

elasticsearchmachine · 2025-04-11T07:42:23Z

Pinging @elastic/es-distributed-indexing (Team:Distributed Indexing)

jimczi

Thanks, @Mikep86.
Let’s update the PR title and summary to reflect the use of indexing pressure.
It would be great if someone from @elastic/es-distributed-indexing could review this section.

elasticsearchmachine · 2025-04-14T19:56:54Z

💔 Backport failed

Status	Branch	Result
❌	8.x	Commit could not be cherrypicked due to conflicts

You can use sqren/backport to manually backport by running backport --upstream elastic/elasticsearch --pr 125517

Mikep86 · 2025-04-28T12:30:25Z

💚 All backports created successfully

Status	Branch	Result
✅	8.19

Questions ?

Please refer to the Backport tool documentation

We have observed many OOMs due to the memory required to inject chunked inference results for semantic_text fields. This PR uses coordinating indexing pressure to account for this memory usage. When indexing pressure memory usage exceeds the threshold set by indexing_pressure.memory.limit, chunked inference result injection will be suspended to prevent OOMs. (cherry picked from commit 85713f7) # Conflicts: # server/src/main/java/org/elasticsearch/node/NodeConstruction.java # server/src/main/java/org/elasticsearch/node/PluginServiceInstances.java # x-pack/plugin/inference/src/main/java/org/elasticsearch/xpack/inference/InferencePlugin.java # x-pack/plugin/inference/src/test/java/org/elasticsearch/xpack/inference/action/filter/ShardBulkInferenceActionFilterTests.java

* Semantic Text Chunking Indexing Pressure (#125517) We have observed many OOMs due to the memory required to inject chunked inference results for semantic_text fields. This PR uses coordinating indexing pressure to account for this memory usage. When indexing pressure memory usage exceeds the threshold set by indexing_pressure.memory.limit, chunked inference result injection will be suspended to prevent OOMs. (cherry picked from commit 85713f7) # Conflicts: # server/src/main/java/org/elasticsearch/node/NodeConstruction.java # server/src/main/java/org/elasticsearch/node/PluginServiceInstances.java # x-pack/plugin/inference/src/main/java/org/elasticsearch/xpack/inference/InferencePlugin.java # x-pack/plugin/inference/src/test/java/org/elasticsearch/xpack/inference/action/filter/ShardBulkInferenceActionFilterTests.java * [CI] Auto commit changes from spotless --------- Co-authored-by: elasticsearchmachine <[email protected]>

Mikep86 added 14 commits March 19, 2025 15:40

Added circuit breaker

bc62301

Pass circuit breaker to action filter

0a58daa

Estimate memory usage before performing inference

a77516f

Merge branch 'main' into semantic-text_oom-circuit-breaker

fee2c90

Reset circuit breaker on completion of request handling

83395f4

Calculate actual memory usage

f84ff66

Spotless

8698ecd

Added TODOs

c7b1af1

Added more comments

6458bb3

Merge branch 'main' into semantic-text_oom-circuit-breaker

6b7db55

Track memory usage of requests that don't perform inference

f4a4689

Fix test failures

c74ca3c

Add circuit breaker unit test

f5e8a94

Circuit breaker test development

1d5e5bd

Mikep86 added >non-issue :ml Machine learning :SearchOrg/Relevance Label for the Search (solution/org) Relevance team v8.19.0 labels Mar 24, 2025

Mikep86 requested review from davidkyle, jimczi and jan-elastic March 24, 2025 16:57

elasticsearchmachine added the v9.1.0 label Mar 24, 2025

Mikep86 requested a review from kderusso March 24, 2025 16:59

Mikep86 added 5 commits March 24, 2025 13:34

Fix memory usage tracking in estimateMemoryUsage

d93050f

Make circuit breaker limit setting dynamically updatable

2480955

Updated estimateMemoryUsage to throw InferenceException

5d76384

Updated InferenceException to retain the original message when it is …

4bcff47

…emitted to XContent

Added circuit breaker trips on estimated inference bytes unit test

080ae60

Mikep86 commented Mar 24, 2025

View reviewed changes

x-pack/plugin/inference/src/main/java/org/elasticsearch/xpack/inference/InferenceException.java Outdated Show resolved Hide resolved

[CI] Auto commit changes from spotless

30e0a08

elasticmachine and others added 2 commits April 8, 2025 16:03

Merge branch 'main' into semantic-text_oom-circuit-breaker

6a3a4fd

Added partial failure test

f4aef73

Mikep86 requested review from jimczi, kderusso, jan-elastic and davidkyle April 8, 2025 17:27

jimczi requested a review from Tim-Brooks April 8, 2025 17:34

Merge branch 'main' into semantic-text_oom-circuit-breaker

118c27f

kderusso approved these changes Apr 9, 2025

View reviewed changes

davidkyle approved these changes Apr 10, 2025

View reviewed changes

jimczi added >enhancement and removed >non-issue labels Apr 11, 2025

Update docs/changelog/125517.yaml

8cc4402

jimczi added the :Distributed Indexing/Engine Anything around managing Lucene and the Translog in an open shard. label Apr 11, 2025

jimczi approved these changes Apr 11, 2025

View reviewed changes

Mikep86 changed the title ~~Add Semantic Text Chunking OOM Circuit Breaker~~ Semantic Text Chunking Indexing Pressure Apr 11, 2025

Mikep86 added 2 commits April 11, 2025 15:21

Fix changelog

0a138f2

Merge branch 'main' into semantic-text_oom-circuit-breaker

e8742d2

Mikep86 merged commit 85713f7 into elastic:main Apr 14, 2025
17 checks passed

elasticsearchmachine added the backport pending label Apr 14, 2025

Mikep86 mentioned this pull request Apr 28, 2025

[8.19] Semantic Text Chunking Indexing Pressure (#125517) #127463

Merged

jimczi mentioned this pull request Jun 12, 2025

Refine indexing pressure accounting in semantic bulk inference filter #129320

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Semantic Text Chunking Indexing Pressure #125517

Semantic Text Chunking Indexing Pressure #125517

Uh oh!

Mikep86 commented Mar 24, 2025 •

edited

Loading

Uh oh!

Uh oh!

Mikep86 commented Apr 8, 2025

Uh oh!

Mikep86 commented Apr 8, 2025

Uh oh!

kderusso left a comment

Uh oh!

davidkyle left a comment

Uh oh!

elasticsearchmachine commented Apr 11, 2025

Uh oh!

elasticsearchmachine commented Apr 11, 2025

Uh oh!

jimczi left a comment

Uh oh!

Uh oh!

elasticsearchmachine commented Apr 14, 2025

Uh oh!

Mikep86 commented Apr 28, 2025

Uh oh!

Uh oh!

Semantic Text Chunking Indexing Pressure #125517

Semantic Text Chunking Indexing Pressure #125517

Uh oh!

Conversation

Mikep86 commented Mar 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Mikep86 commented Apr 8, 2025

Uh oh!

Mikep86 commented Apr 8, 2025

Uh oh!

kderusso left a comment

Choose a reason for hiding this comment

Uh oh!

davidkyle left a comment

Choose a reason for hiding this comment

Uh oh!

elasticsearchmachine commented Apr 11, 2025

Uh oh!

elasticsearchmachine commented Apr 11, 2025

Uh oh!

jimczi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

elasticsearchmachine commented Apr 14, 2025

💔 Backport failed

Uh oh!

Mikep86 commented Apr 28, 2025

💚 All backports created successfully

Questions ?

Uh oh!

Uh oh!

Mikep86 commented Mar 24, 2025 •

edited

Loading