Adding support for binary embedding type to Cohere service embedding type #120751

ymao1 · 2025-01-23T18:39:52Z

Summary

Adds support for binary embedding type in the Cohere text_embedding task.

Usage

# Create a Cohere binary inference endpoint
PUT /_inference/text_embedding/cohere_embeddings_bit
{
    "service": "cohere",
    "service_settings": {
        "api_key": <apiKey>,
        "model_id": "embed-english-v3.0",
        "embedding_type": "bit"
    }
}

# Perform an inference task
POST /_inference/text_embedding/cohere_embeddings_bit
{
    "input": "hello",
    "task_settings": {
        "input_type": "ingest"
    }
}

# Response
{
    "text_embedding_bits": [
        {
            "embedding": [
                -55,
                74,
                101,
                67,
                83,
                1,
                53,
                -101,
                -71,
                -98,
                -116,
                -99,
                80,
                -49,
                65,
                .
                .
                .
            ]
        }
    ]
}

Notes

Requesting a binary embedding from Cohere returns an array of binary embeddings encoded as bytes with int8 precision. Since this aligns with what is expected as input when you specify a dense_vector mapping with element_type: bit, we do not perform any bit unpacking on the Cohere response and handle the bytes as-is.

…type

davidkyle

Nice!

This looks good to merge, please make it a non draft PR when you are ready.

If it is simple to add in this PR then Jina AI also have binary embeddings. They have a good blog at https://jina.ai/news/binary-embeddings-all-the-ai-3125-of-the-fat/

...ore/src/main/java/org/elasticsearch/xpack/core/inference/results/InferenceByteEmbedding.java

…inference/results/InferenceByteEmbedding.java Co-authored-by: David Kyle <[email protected]>

elasticsearchmachine · 2025-01-30T12:45:49Z

Hi @ymao1, I've created a changelog YAML for you.

ymao1 · 2025-01-30T12:46:32Z

If it is simple to add in this PR then Jina AI also have binary embeddings.

I will do this in a follow-up PR!

…1747

github-actions · 2025-01-30T14:01:11Z

It looks like this PR modifies one or more .asciidoc files. These files are being migrated to Markdown, and any changes merged now will be lost. See the migration guide for details.

ymao1 · 2025-01-30T14:04:33Z

Reverted the docs change due to the docs freeze, looks like I'll have to create a PR to 8.x for the asciidocs change and then an issue in https://github.com/elastic/docs-content/issues to get the docs update to 9.0

elasticsearchmachine · 2025-01-30T15:08:47Z

Pinging @elastic/ml-core (Team:ML)

davidkyle

LGTM

elasticsearchmachine · 2025-02-03T18:58:25Z

💔 Backport failed

Status	Branch	Result
❌	8.x	Commit could not be cherrypicked due to conflicts

You can use sqren/backport to manually backport by running backport --upstream elastic/elasticsearch --pr 120751

ymao1 · 2025-02-03T21:46:15Z

💚 All backports created successfully

Status	Branch	Result
✅	8.x

Questions ?

Please refer to the Backport tool documentation

ymao1 · 2025-02-04T14:49:53Z

Documentation request made in elastic/docs-content#310

…dding type (#120751) (#121584) * Adding support for binary embedding type to Cohere service embedding type (#120751) * Adding support for binary embedding type to Cohere service embedding type * Returning response in separate text_embedding_bits field * Update x-pack/plugin/core/src/main/java/org/elasticsearch/xpack/core/inference/results/InferenceByteEmbedding.java Co-authored-by: David Kyle <[email protected]> * Update docs/changelog/120751.yaml * Reverting docs change --------- Co-authored-by: David Kyle <[email protected]> (cherry picked from commit 89d71e1) # Conflicts: # server/src/main/java/org/elasticsearch/TransportVersions.java # x-pack/plugin/core/src/main/java/org/elasticsearch/xpack/core/inference/results/InferenceTextEmbeddingByteResults.java * Adding docs

Adding support for binary embedding type to Cohere service embedding …

e4b3d56

…type

elasticsearchmachine added the v9.0.0 label Jan 23, 2025

ymao1 changed the title ~~Adding support for binary embedding type to Cohere service embedding …~~ Adding support for binary embedding type to Cohere service embedding type Jan 23, 2025

ymao1 added 2 commits January 23, 2025 16:33

Returning response in separate text_embedding_bits field

9b687c8

Merging in main

12a1154

ymao1 mentioned this pull request Jan 27, 2025

Support for bit precision in the Inference API text_embedding task #111747

Open

davidkyle added the :ml Machine learning label Jan 30, 2025

davidkyle reviewed Jan 30, 2025

View reviewed changes

...ore/src/main/java/org/elasticsearch/xpack/core/inference/results/InferenceByteEmbedding.java Outdated Show resolved Hide resolved

ymao1 and others added 2 commits January 30, 2025 07:42

Update x-pack/plugin/core/src/main/java/org/elasticsearch/xpack/core/…

e6034d8

…inference/results/InferenceByteEmbedding.java Co-authored-by: David Kyle <[email protected]>

Merging in main

bad9754

ymao1 added the >enhancement label Jan 30, 2025

Update docs/changelog/120751.yaml

b170eaf

ymao1 added 2 commits January 30, 2025 09:00

Merge branch 'main' of github.com:elastic/elasticsearch into es-111747

a4e1627

Merge branch 'es-111747' of github.com:ymao1/elasticsearch into es-11…

9e0c9a1

…1747

Reverting docs change

a13caa0

ymao1 marked this pull request as ready for review January 30, 2025 15:08

elasticsearchmachine added Team:ML Meta label for the ML team v9.1.0 and removed v9.0.0 labels Jan 30, 2025

davidkyle approved these changes Jan 31, 2025

View reviewed changes

ymao1 added v8.19.0 auto-backport Automatically create backport pull requests when merged labels Jan 31, 2025

ymao1 added 3 commits January 31, 2025 08:06

Merging in main

df723c2

Merging in main

a3a807a

Merge branch 'main' of github.com:elastic/elasticsearch into es-111747

a0d1a84

ymao1 merged commit 89d71e1 into elastic:main Feb 3, 2025
17 checks passed

ymao1 deleted the es-111747 branch February 3, 2025 18:55

elasticsearchmachine added the backport pending label Feb 3, 2025

ymao1 mentioned this pull request Feb 3, 2025

Adding patch transport version for COHERE_BIT_EMBEDDING_TYPE_SUPPORT_ADDED #121560

Merged

This was referenced Feb 3, 2025

[8.x] Adding support for binary embedding type to Cohere service embedding type (#120751) #121584

Merged

[REQUEST]: Add binary and bit embedding types to Cohere documentation elastic/docs-content#310

Open

This was referenced Feb 4, 2025

[ML] Adding text embedding bit to inference result output elastic/elasticsearch-specification#3697

Closed

[ML] Adding text embedding bit to inference result output elastic/elasticsearch-specification#3698

Merged

ymao1 removed the backport pending label Feb 4, 2025

davidkyle mentioned this pull request Mar 4, 2025

[ML] Support binary embeddings for Voyage AI #123983

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding support for binary embedding type to Cohere service embedding type #120751

Adding support for binary embedding type to Cohere service embedding type #120751

ymao1 commented Jan 23, 2025 •

edited

Loading

davidkyle left a comment •

edited

Loading

elasticsearchmachine commented Jan 30, 2025

ymao1 commented Jan 30, 2025

github-actions bot commented Jan 30, 2025

ymao1 commented Jan 30, 2025

elasticsearchmachine commented Jan 30, 2025

davidkyle left a comment

elasticsearchmachine commented Feb 3, 2025

ymao1 commented Feb 3, 2025

ymao1 commented Feb 4, 2025

Adding support for binary embedding type to Cohere service embedding type #120751

Adding support for binary embedding type to Cohere service embedding type #120751

Conversation

ymao1 commented Jan 23, 2025 • edited Loading

Summary

Usage

Notes

davidkyle left a comment • edited Loading

Choose a reason for hiding this comment

elasticsearchmachine commented Jan 30, 2025

ymao1 commented Jan 30, 2025

github-actions bot commented Jan 30, 2025

ymao1 commented Jan 30, 2025

elasticsearchmachine commented Jan 30, 2025

davidkyle left a comment

Choose a reason for hiding this comment

elasticsearchmachine commented Feb 3, 2025

💔 Backport failed

ymao1 commented Feb 3, 2025

💚 All backports created successfully

Questions ?

ymao1 commented Feb 4, 2025

ymao1 commented Jan 23, 2025 •

edited

Loading

davidkyle left a comment •

edited

Loading