syk-coder
diff --git a/‎docs/reference/images/search/vector-search-oversimplification.png‎
914 KB b/‎docs/reference/images/search/vector-search-oversimplification.png‎
914 KB
diff --git a/‎docs/reference/search/search-your-data/knn-search.asciidoc‎
Lines changed: 1 addition & 1 deletion b/‎docs/reference/search/search-your-data/knn-search.asciidoc‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/reference/search/search-your-data/search-your-data.asciidoc‎
Lines changed: 1 addition & 1 deletion b/‎docs/reference/search/search-your-data/search-your-data.asciidoc‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/reference/search/search-your-data/semantic-search-elser.asciidoc‎
Lines changed: 9 additions & 9 deletions b/‎docs/reference/search/search-your-data/semantic-search-elser.asciidoc‎
Lines changed: 9 additions & 9 deletions
diff --git a/‎docs/reference/search/search-your-data/semantic-search.asciidoc‎
Lines changed: 135 additions & 0 deletions b/‎docs/reference/search/search-your-data/semantic-search.asciidoc‎
Lines changed: 135 additions & 0 deletions
diff --git a/‎docs/reference/search/search.asciidoc‎
Lines changed: 1 addition & 1 deletion b/‎docs/reference/search/search.asciidoc‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/reference/tab-widgets/semantic-search/deploy-nlp-model-widget.asciidoc‎
Lines changed: 39 additions & 0 deletions b/‎docs/reference/tab-widgets/semantic-search/deploy-nlp-model-widget.asciidoc‎
Lines changed: 39 additions & 0 deletions
diff --git a/‎docs/reference/tab-widgets/semantic-search/deploy-nlp-model.asciidoc‎
Lines changed: 14 additions & 0 deletions b/‎docs/reference/tab-widgets/semantic-search/deploy-nlp-model.asciidoc‎
Lines changed: 14 additions & 0 deletions
diff --git a/‎docs/reference/tab-widgets/semantic-search/field-mappings-widget.asciidoc‎
Lines changed: 39 additions & 0 deletions b/‎docs/reference/tab-widgets/semantic-search/field-mappings-widget.asciidoc‎
Lines changed: 39 additions & 0 deletions
@@ -333,7 +333,7 @@ nearest documents. If the search also includes a `query`, then aggregations are
 calculated on the combined set of `knn` and `query` matches.
 
 [discrete]
-[[semantic-search]]
+[[knn-semantic-search]]
 ==== Perform semantic search
 
 kNN search enables you to perform semantic search by using a previously deployed
 
@@ -531,4 +531,4 @@ include::search-shard-routing.asciidoc[]
 include::search-template.asciidoc[]
 include::sort-search-results.asciidoc[]
 include::knn-search.asciidoc[]
-include::semantic-search-elser.asciidoc[]
+include::semantic-search.asciidoc[]
@@ -1,5 +1,5 @@
 [[semantic-search-elser]]
-== Tutorial: semantic search with ELSER
+=== Tutorial: semantic search with ELSER
 ++++
 <titleabbrev>Semantic search with ELSER</titleabbrev>
 ++++
@@ -21,7 +21,7 @@ information.
 
 [discrete]
 [[requirements]]
-=== Requirements
+==== Requirements
 
 To perform semantic search by using ELSER, you must have the NLP model deployed 
 in your cluster. Refer to the 
@@ -40,7 +40,7 @@ you must provide suitably sized nodes yourself.
 
 [discrete]
 [[elser-mappings]]
-=== Create the index mapping
+==== Create the index mapping
 
 First, the mapping of the destination index - the index that contains the tokens 
 that the model created based on your text - must be created.  The destination 
@@ -82,7 +82,7 @@ To learn how to optimize space, refer to the <<save-space>> section.
 
 [discrete]
 [[inference-ingest-pipeline]]
-=== Create an ingest pipeline with an inference processor
+==== Create an ingest pipeline with an inference processor
 
 Create an <<ingest,ingest pipeline>> with an 
 <<inference-processor,{infer} processor>> to use ELSER to infer against the data 
@@ -120,7 +120,7 @@ processor.
 
 [discrete]
 [[load-data]]
-=== Load data
+==== Load data
 
 In this step, you load the data that you later use in the {infer} ingest 
 pipeline to extract tokens from it.
@@ -140,7 +140,7 @@ can see an index named `test-data` with 182469 documents.
 
 [discrete]
 [[reindexing-data-elser]]
-=== Ingest the data through the {infer} ingest pipeline
+==== Ingest the data through the {infer} ingest pipeline
 
 Create the tokens from the text by reindexing the data throught the {infer} 
 pipeline that uses ELSER as the inference model.
@@ -178,7 +178,7 @@ follow the progress. It may take a couple of minutes to complete the process.
 
 [discrete]
 [[text-expansion-query]]
-=== Semantic search by using the `text_expansion` query
+==== Semantic search by using the `text_expansion` query
 
 To perform semantic search, use the `text_expansion` query, and provide the 
 query text and the ELSER model ID. The example below uses the query text "How to 
@@ -251,7 +251,7 @@ To learn about optimizing your `text_expansion` query, refer to
 
 [discrete]
 [[text-expansion-compound-query]]
-=== Combining semantic search with other queries
+==== Combining semantic search with other queries
 
 You can combine `text_expansion` with other queries in a 
 <<compound-queries,compound query>>. For example using a filter clause in a 
@@ -356,7 +356,7 @@ PUT my-index
 
 [discrete]
 [[further-reading]]
-=== Further reading
+==== Further reading
 
 * {ml-docs}/ml-nlp-elser.html[How to download and deploy ELSER]
 * {ml-docs}/ml-nlp-limitations.html#ml-nlp-elser-v1-limit-512[ELSER v1 limitation]
 
@@ -0,0 +1,135 @@
+[[semantic-search]]
+== Semantic search
+
+Semantic search is a search method that helps you find data based on the intent
+and contextual meaning of a search query, instead of a match on query terms
+(lexical search).
+
+{es} provides semantic search capabilities using {ml-docs}/ml-nlp.html[natural
+language processing (NLP)] and vector search. Deploying an NLP model to {es}
+enables it to extract text embeddings out of text. Embeddings are vectors that
+provide a numeric representation of a text. Pieces of content with similar
+meaning have similar representations. 
+
+[[semantic-search-diagram]]
+.A simplified representation of encoding textual concepts as vectors
+image::images/search/vector-search-oversimplification.png[A simplified representation of encoding textual concepts as vectors,align="center"]
+
+At query time, {es} can use the same NLP model to convert a query into
+embeddings, enabling you to find documents with similar text embeddings.
+
+This guide shows you how to implement semantic search with {es}, from selecting
+an NLP model, to writing queries.
+
+[discrete]
+[[semantic-search-select-nlp-model]]
+=== Select an NLP model
+
+{es} offers the usage of a 
+{ml-docs}/ml-nlp-model-ref.html#ml-nlp-model-ref-text-embedding[wide range of NLP models], 
+including both dense and sparse vector models. Your choice of the language model 
+is critical for implementing semantic search successfully.
+
+While it is possible to bring your own text embedding model, achieving good 
+search results through model tuning is challenging. Selecting an appropriate 
+model from our third-party model list is the first step. Training the model on 
+your own data is essential to ensure better search results than using only BM25. 
+However, the model training process requires a team of data scientists and ML 
+experts, making it expensive and time-consuming.
+
+To address this issue, Elastic provides a pre-trained representational model 
+called {ml-docs}/ml-nlp-elser.html[Elastic Learned Sparse EncodeR (ELSER)]. 
+ELSER, currently available only for English, is an out-of-domain sparse vector 
+model that does not require fine-tuning. This adaptability makes it suitable for 
+various NLP use cases out of the box. Unless you have a team of ML specialists, 
+it is highly recommended to use the ELSER model.
+
+In the case of sparse vector representation, the vectors mostly consist of zero 
+values, with only a small subset containing non-zero values. This representation 
+is commonly used for textual data. In the case of ELSER, each document in an 
+index and the query text itself are represented by high-dimensional sparse 
+vectors. Each non-zero element of the vector corresponds to a term in the model 
+vocabulary. The ELSER vocabulary contains around 30000 terms, so the sparse 
+vectors created by ELSER contain about 30000 values, the majority of which are 
+zero. Effectively the ELSER model is replacing the terms in the original query 
+with other terms that have been learnt to exist in the documents that best match 
+the original search terms in a training dataset, and weights to control how 
+important each is.
+
+
+[discrete]
+[[semantic-search-deploy-nlp-model]]
+=== Deploy the model
+
+After you decide which model you want to use for implementing semantic search, 
+you need to deploy the model in {es}.
+
+include::{es-repo-dir}/tab-widgets/semantic-search/deploy-nlp-model-widget.asciidoc[]
+
+[discrete]
+[[semantic-search-field-mappings]]
+=== Map a field for the text embeddings
+
+Before you start using the deployed model to generate embeddings based on your 
+input text, you need to prepare your index mapping first. The mapping of the 
+index depends on the type of model.
+
+include::{es-repo-dir}/tab-widgets/semantic-search/field-mappings-widget.asciidoc[]
+
+[discrete]
+[[semantic-search-generate-embeddings]]
+=== Generate text embeddings
+
+Once you have created the mappings for the index, you can generate text 
+embeddings from your input text. This can be done by using an 
+<<ingest,ingest pipeline>> with an <<inference-processor,inference processor>>. 
+The ingest pipeline processes the input data and indexes it into the destination 
+index. At index time, the inference ingest processor uses the trained model to 
+infer against the data ingested through the pipeline. After you created the 
+ingest pipeline with the inference processor, you can ingest your data through 
+it to generate the model output.
+
+include::{es-repo-dir}/tab-widgets/semantic-search/generate-embeddings-widget.asciidoc[]
+
+Now it is time to perform semantic search!
+
+[discrete]
+[[semantic-search-search]]
+=== Search the data
+
+Depending on the type of model you have deployed, you can query rank features 
+with a text expansion query, or dense vectors with a kNN search.
+
+include::{es-repo-dir}/tab-widgets/semantic-search/search-widget.asciidoc[]
+
+[discrete]
+[[semantic-search-hybrid-search]]
+=== Beyond semantic search with hybrid search
+
+In some situations, lexical search may perform better than semantic search. For
+example, when searching for single words or IDs, like product numbers.
+
+Combining semantic and lexical search into one hybrid search request using
+<<rrf,reciprocal rank fusion>> provides the best of both worlds. Not only that,
+but hybrid search using reciprocal rank fusion {blog-ref}improving-information-retrieval-elastic-stack-hybrid[has been shown to perform better
+in general].
+
+include::{es-repo-dir}/tab-widgets/semantic-search/hybrid-search-widget.asciidoc[]
+
+[discrete]
+[[semantic-search-read-more]]
+=== Read more
+
+* Tutorials:
+** <<semantic-search-elser,Semantic search with ELSER>>
+** {ml-docs}/ml-nlp-text-emb-vector-search-example.html[Semantic search with the msmarco-MiniLM-L-12-v3 sentence-transformer model]
+* Blogs:
+** {blog-ref}may-2023-launch-sparse-encoder-ai-model[Introducing Elastic Learned Sparse Encoder: Elastic's AI model for semantic search]
+** {blog-ref}lexical-ai-powered-search-elastic-vector-database[How to get the best of lexical and AI-powered search with Elastic's vector database]
+** Information retrieval blog series:
+*** {blog-ref}improving-information-retrieval-elastic-stack-search-relevance[Part 1: Steps to improve search relevance]
+*** {blog-ref}improving-information-retrieval-elastic-stack-benchmarking-passage-retrieval[Part 2: Benchmarking passage retrieval]
+*** {blog-ref}may-2023-launch-information-retrieval-elasticsearch-ai-model[Part 3: Introducing Elastic Learned Sparse Encoder, our new retrieval model]
+*** {blog-ref}improving-information-retrieval-elastic-stack-hybrid[Part 4: Hybrid retrieval]
+
+include::semantic-search-elser.asciidoc[]
@@ -515,7 +515,7 @@ include::{es-repo-dir}/rest-api/common-parms.asciidoc[tag=knn-query-vector]
 (Optional, object)
 A configuration object indicating how to build a query_vector before executing
 the request. You must provide a `query_vector_builder` or `query_vector`, but
-not both. Refer to <<semantic-search>> to learn more.
+not both. Refer to <<knn-semantic-search>> to learn more.
 
 `similarity`::
 (Optional, float)
 
@@ -0,0 +1,39 @@
+++++
+<div class="tabs" data-tab-group="model">
+  <div role="tablist" aria-label="model">
+    <button role="tab"
+            aria-selected="true"
+            aria-controls="elser-tab-deploy-nlp-model"
+            id="elser-deploy-nlp-model">
+      ELSER
+    </button>
+    <button role="tab"
+            aria-selected="false"
+            aria-controls="dense-vector-tab-deploy-nlp-model"
+            id="dense-vector-deploy-nlp-model">
+      Dense vector models
+    </button>
+  </div>
+  <div tabindex="0"
+       role="tabpanel"
+       id="elser-tab-deploy-nlp-model"
+       aria-labelledby="elser-deploy-nlp-model">
+++++
+
+include::deploy-nlp-model.asciidoc[tag=elser]
+
+++++
+  </div>
+  <div tabindex="0"
+       role="tabpanel"
+       id="dense-vector-tab-deploy-nlp-model"
+       aria-labelledby="dense-vector-deploy-nlp-model"
+       hidden="">
+++++
+
+include::deploy-nlp-model.asciidoc[tag=dense-vector]
+
+++++
+  </div>
+</div>
+++++
@@ -0,0 +1,14 @@
+// tag::elser[]
+
+To deploy ELSER, refer to 
+{ml-docs}/ml-nlp-elser.html#download-deploy-elser[Download and deploy ELSER].
+
+// end::elser[]
+
+
+// tag::dense-vector[]
+
+To deploy a third-party text embedding model, refer to 
+{ml-docs}/ml-nlp-text-emb-vector-search-example.html#ex-te-vs-deploy[Deploy a text embedding model].
+
+// end::dense-vector[]
@@ -0,0 +1,39 @@
+++++
+<div class="tabs" data-tab-group="model">
+  <div role="tablist" aria-label="model">
+    <button role="tab"
+            aria-selected="true"
+            aria-controls="elser-tab-field-mappings"
+            id="elser-field-mappings">
+      ELSER
+    </button>
+    <button role="tab"
+            aria-selected="false"
+            aria-controls="dense-vector-tab-field-mappings"
+            id="dense-vector-field-mappings">
+      Dense vector models
+    </button>
+  </div>
+  <div tabindex="0"
+       role="tabpanel"
+       id="elser-tab-field-mappings"
+       aria-labelledby="elser-field-mappings">
+++++
+
+include::field-mappings.asciidoc[tag=elser]
+
+++++
+  </div>
+  <div tabindex="0"
+       role="tabpanel"
+       id="dense-vector-tab-field-mappings"
+       aria-labelledby="dense-vector-field-mappings"
+       hidden="">
+++++
+
+include::field-mappings.asciidoc[tag=dense-vector]
+
+++++
+  </div>
+</div>
+++++