[ML] Adding timeout to request for creating inference endpoint #126805

jonathan-buttner · 2025-04-14T21:28:11Z

This PR also adds the timeout query parameter to the PUT request so users can specify a longer timeout than the default 30 seconds. 30 seconds was used previously based on the ack timeout for the master node.

The timeout only applies to the start deployment request and the inference request during validation. The model download request ignores the timeout because it waits for the model to finish downloading before responding on the listener.

Testing

PUT http://localhost:9200/_inference/sparse_embedding/elser2?timeout=2nanos
{
    "service": "elasticsearch",
    "service_settings": {
        "model_id": ".elser_model_2",
        "num_threads": 1,
        "adaptive_allocations": {
            "enabled": true,
            "min_number_of_allocations": 1,
            "max_number_of_allocations": 4
        }
    }
}

This request should result in a failure like the following

{
    "error": {
        "root_cause": [
            {
                "type": "status_exception",
                "reason": "Timed out after [2nanos] waiting for model deployment to start. Use the trained model stats API to track the state of the deployment."
            }
        ],
        "type": "status_exception",
        "reason": "Timed out after [2nanos] waiting for model deployment to start. Use the trained model stats API to track the state of the deployment."
    },
    "status": 408
}

elasticsearchmachine · 2025-04-14T23:02:25Z

Hi @jonathan-buttner, I've created a changelog YAML for you.

…rt-timeout-put-inference

davidkyle

LGTM

…rt-timeout-put-inference

elasticsearchmachine · 2025-05-06T18:10:39Z

Pinging @elastic/ml-core (Team:ML)

elasticsearchmachine · 2025-05-06T19:40:33Z

💔 Backport failed

Status	Branch	Result
❌	8.19	Commit could not be cherrypicked due to conflicts

You can use sqren/backport to manually backport by running backport --upstream elastic/elasticsearch --pr 126805

jonathan-buttner · 2025-05-06T19:49:10Z

💚 All backports created successfully

Status	Branch	Result
✅	8.19

Questions ?

Please refer to the Backport tool documentation

…out to request (elastic#126805) * Fixing bug with listener and adding timeout * Update docs/changelog/126805.yaml * Fixing tests * Fixing writeTo (cherry picked from commit 4c507e2) # Conflicts: # server/src/main/java/org/elasticsearch/TransportVersions.java

Fixing bug with listener and adding timeout

653f77b

elasticsearchmachine added the v9.1.0 label Apr 14, 2025

jonathan-buttner added v8.18.1 v8.19.0 v9.0.1 >bug :ml Machine learning auto-backport Automatically create backport pull requests when merged Team:ML Meta label for the ML team labels Apr 14, 2025

Update docs/changelog/126805.yaml

def4c51

jonathan-buttner and others added 4 commits April 15, 2025 14:41

Fixing tests

460b619

Merge branch 'main' of github.com:elastic/elasticsearch into ml-suppo…

4aedb59

…rt-timeout-put-inference

Fixing writeTo

979fb51

Merge branch 'main' into ml-support-timeout-put-inference

e33aaa2

jonathan-buttner removed v8.18.1 v9.0.1 labels Apr 15, 2025

jonathan-buttner mentioned this pull request Apr 16, 2025

[ML] Adding missing onFailure call for Inference API start model request #126930

Merged

Merge branch 'main' of github.com:elastic/elasticsearch into ml-suppo…

db02c0e

…rt-timeout-put-inference

davidkyle approved these changes Apr 30, 2025

View reviewed changes

Merge branch 'main' of github.com:elastic/elasticsearch into ml-suppo…

3e13c89

…rt-timeout-put-inference

jonathan-buttner marked this pull request as ready for review May 6, 2025 18:10

jonathan-buttner merged commit 4c507e2 into elastic:main May 6, 2025
17 checks passed

elasticsearchmachine added the backport pending label May 6, 2025

jonathan-buttner mentioned this pull request May 6, 2025

[8.19] [ML] [ML] Adding timeout to request for creating inference endpoint (#126805) #127779

Open

jonathan-buttner changed the title ~~[ML] Fixing bug with TransportPutModelAction listener and adding timeout to request~~ [ML] Adding timeout to request for creating inference endpoint May 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Adding timeout to request for creating inference endpoint #126805

[ML] Adding timeout to request for creating inference endpoint #126805

jonathan-buttner commented Apr 14, 2025 •

edited

Loading

elasticsearchmachine commented Apr 14, 2025

davidkyle left a comment

elasticsearchmachine commented May 6, 2025

elasticsearchmachine commented May 6, 2025

jonathan-buttner commented May 6, 2025

[ML] Adding timeout to request for creating inference endpoint #126805

[ML] Adding timeout to request for creating inference endpoint #126805

Conversation

jonathan-buttner commented Apr 14, 2025 • edited Loading

Testing

elasticsearchmachine commented Apr 14, 2025

davidkyle left a comment

Choose a reason for hiding this comment

elasticsearchmachine commented May 6, 2025

elasticsearchmachine commented May 6, 2025

💔 Backport failed

jonathan-buttner commented May 6, 2025

💚 All backports created successfully

Questions ?

jonathan-buttner commented Apr 14, 2025 •

edited

Loading