watson-developer-cloud
diff --git a/‎integrations/extensions/docs/elasticsearch-install-and-setup/assets/synchronize_trained_model.png
67.1 KB b/‎integrations/extensions/docs/elasticsearch-install-and-setup/assets/synchronize_trained_model.png
67.1 KB
diff --git a/‎integrations/extensions/docs/elasticsearch-install-and-setup/text_embedding_deploy_and_use.md
Lines changed: 222 additions & 0 deletions b/‎integrations/extensions/docs/elasticsearch-install-and-setup/text_embedding_deploy_and_use.md
Lines changed: 222 additions & 0 deletions
diff --git a/‎integrations/extensions/starter-kits/elasticsearch/README.md
Lines changed: 35 additions & 11 deletions b/‎integrations/extensions/starter-kits/elasticsearch/README.md
Lines changed: 35 additions & 11 deletions
diff --git a/‎integrations/extensions/starter-kits/elasticsearch/assets/use_elasticsearch_custom_extension_knn.png
70.9 KB b/‎integrations/extensions/starter-kits/elasticsearch/assets/use_elasticsearch_custom_extension_knn.png
70.9 KB
diff --git a/‎integrations/extensions/starter-kits/elasticsearch/elasticsearch-generic-openapi.json
Lines changed: 14 additions & 0 deletions b/‎integrations/extensions/starter-kits/elasticsearch/elasticsearch-generic-openapi.json
Lines changed: 14 additions & 0 deletions
@@ -0,0 +1,222 @@
+# How to set up and use 3rd-party text embeddings for dense vector search in Elasticsearch
+This guide demonstrates how to deploy and use a text embedding model in Elasticsearch. The model will generate vector representations for text, enabling vector similarity (k-nearest neighbours) search.
+
+## Set up Elasticsearch
+
+### Elasticsearch from IBM Cloud
+If you are using Elasticsearch from IBM Cloud, please refer to [this guide](./ICD_Elasticsearch_install_and_setup.md) first to create an Elasticsearch instance and set up Kibana if you haven't already.
+
+### Elasticsearch on CloudPak
+Alternatively, if you want to install Elasticsearch on Kubernetes (ECK) in CloudPak, you need to follow [this guide](./watsonx_discovery_install_and_setup.md) first to set up Elasticsearch and Kibana. You can skip [Enable ELSER model (v2)](./watsonx_discovery_install_and_setup.md#enable-elser-model-v2) and any section beyond that in the guide.
+
+## Install the eland library
+Run the command below to install the [eland](https://github.com/elastic/eland) library.
+```bash
+python -m pip install "eland[pytorch]"
+```
+This library allows us to pull and deploy a 3rd-party text embedding model to our Elasticsearch instance.
+
+> CAUTION: Open source and 3rd party models are not in scope of IBM or Elastic indemnity clauses. Customers must accept relevant terms and conditions to choose or bring their own models. Additionally, IBM has not assessed Elastic's supported multi-lingual models so any use of Elastic-supported models should be understood thoroughly both with respect to the terms of use for those models and the terms of use of all of the data that was used to train those models.
+
+NOTE: As of the time this documentation was written, `eland` only supports Python 3.8, 3.9, and 3.10. Please refer to the eland library [compatibility section](https://github.com/elastic/eland?tab=readme-ov-file#compatibility) to make sure you're using compatible Python and Elasticsearch versions.
+
+NOTE: You can also use eland without installing the library in case you run into any issues with the library. This can be done by using the docker image provided [here](https://github.com/elastic/eland?tab=readme-ov-file#docker).
+
+## Create environment variables for ES credentials
+Feel free to customize the names of the `ES_SOURCE_INDEX_NAME`, `ES_EMBEDDING_INDEX_NAME` and `ES_PIPELINE_NAME` variables below. These names will serve as references for your source index, embedding index, and ingestion pipeline throughout this guide.
+  ```bash
+  export ES_URL=https://<hostname:port>
+  export ES_USER=<username>
+  export ES_PASSWORD=<password>
+  export ES_CACERT=<path-to-your-cert>
+  export ES_SOURCE_INDEX_NAME=<name-of-source-index>
+  export ES_EMBEDDING_INDEX_NAME=<name-of-embedding-index>
+  export ES_PIPELINE_NAME=<name-of-ingest-pipeline>
+  ```  
+You can find the credentials from the service credentials of your Elasticsearch instance.
+## Pull and deploy an embedding model
+Run the command below to pull your desired model from the [Huggingface Models Hub](https://huggingface.co/models) and deploy it on your Elasticsearch instance:
+```bash
+eland_import_hub_model \
+  --url $ES_URL \
+  -u $ES_USER -p $ES_PASSWORD --insecure \
+  --hub-model-id intfloat/multilingual-e5-small \
+  --task-type text_embedding \
+  --start
+```
+
+In this example, we are using the `multilingual-e5-small` model which is a multi-lingual model that supports text embeddings in 100 languages. You can read more about this model [here](https://huggingface.co/intfloat/multilingual-e5-small)
+
+## Synchronize your deployed model
+Go to the **Machine Learning > Trained Models** page http://localhost:5601/app/ml/trained_models and synchronize your trained models. A warning message is displayed at the top of the page that says "ML job and trained model synchronization required". Follow the link to "Synchronize your jobs and trained models." Then click Synchronize.
+
+<img src="assets/synchronize_trained_model.png"/>
+
+Once you synchronize your model you should see your deployed model on the **Machine Learning > Model Management** page in Kibana.
+
+
+
+## Test your deployed model
+Run the command below to test the model using the _infer API
+```bash
+curl -X POST "${ES_URL}/_ml/trained_models/intfloat__multilingual-e5-small/_infer" -u "${ES_USER}:${ES_PASSWORD}" -H "Content-Type: application/json" --cacert $ES_CACERT -d '{
+  "docs": {
+    "text_field": "how to set up custom extension?"
+  }
+}'
+```
+You should see a response containing the predicted embedding vector.
+
+```bash
+{
+  "inference_results": [
+    {
+      "predicted_value": [
+        0.016921168193221092,
+        -0.035475824028253555,
+        -0.0497407428920269,
+        ...
+```
+
+## Load sample data
+Refer to the [Load data into Elasticsearch](./ICD_Elasticsearch_install_and_setup.md#load-data-into-elasticsearch) section in the Elasticsearch setup guide to upload a sample data to Elasticsearch using Kabana.
+
+## Add your embedding model to an inference ingest pipeline
+Create an ingest pipeline using the command below:
+```bash
+curl -X PUT "${ES_URL}/_ingest/pipeline/${ES_PIPELINE_NAME}" \
+  -u "${ES_USER}:${ES_PASSWORD}" --cacert "${ES_CACERT}"\
+  -H 'Content-Type: application/json' -d '{
+  "description": "Text embedding pipeline",
+  "processors": [
+    {
+      "inference": {
+        "model_id": "intfloat__multilingual-e5-small",
+        "target_field": "text_embedding",
+        "field_map": {
+          "text": "text_field"
+        }
+      }
+    }
+  ],
+  "on_failure": [
+    {
+      "set": {
+        "description": "Index document to '\''failed-<index>'\''",
+        "field": "_index",
+        "value": "failed-{{{_index}}}"
+      }
+    },
+    {
+      "set": {
+        "description": "Set error message",
+        "field": "ingest.failure",
+        "value": "{{_ingest.on_failure_message}}"
+      }
+    }
+  ]
+}'
+```
+
+You can verify that the ingest pipeline was created by locating it in the list of your ingest pipelines on Kibana http://localhost:5601/app/management/ingest/ingest_pipelines
+
+## Create a mapping for the destination index containing the embeddings
+Then run the command below to create the mappings of the destination index called `ES_EMBEDDING_INDEX_NAME`:
+```bash
+curl -X PUT "${ES_URL}/${ES_EMBEDDING_INDEX_NAME}" \
+  -u "${ES_USER}:${ES_PASSWORD}" --cacert $ES_CACERT \
+  -H 'Content-Type: application/json' -d '{
+  "mappings": {
+    "properties": {
+      "text_embedding.predicted_value": {
+        "type": "dense_vector",
+        "dims": 384,
+        "index": true,
+        "similarity": "cosine"
+      },
+      "text": {
+        "type": "text"
+      }
+    }
+  }
+}'
+```
+
+* `text_embedding.predicted_value` is the field where the ingest processor stores the embeddings
+* `dims` is the embedding size of the deployed model which is 384 for the `intfloat/multilingual-e5-small` model we are using here
+
+## Create the text embeddings
+Run the ingest pipeline to reindex the data to the `ES_EMBEDDING_INDEX_NAME` index
+```bash
+curl -X POST "${ES_URL}/_reindex?wait_for_completion=false" \
+  -u "${ES_USER}:${ES_PASSWORD}" --cacert "$ES_CACERT" \
+  -H 'Content-Type: application/json' -d "{
+  \"source\": {
+    \"index\": \"${ES_SOURCE_INDEX_NAME}\",
+    \"size\": 50
+  },
+  \"dest\": {
+    \"index\": \"${ES_EMBEDDING_INDEX_NAME}\",
+    \"pipeline\": \"${ES_PIPELINE_NAME}\"
+  }
+}"
+```
+
+This command will return a task id that looks like this:
+```json
+{"task":<task-id>}
+```
+
+The reindexing process can take around 10 minutes. You can use the task id that is returned above to check the status of the process.
+
+```bash
+curl -X GET "${ES_URL}/_tasks/<task-id>" \
+  -u "${ES_USER}:${ES_PASSWORD}" --cacert $ES_CACERT
+```
+
+You can check the completion status by monitoring the `"completed"` field in the response:
+
+```bash
+{
+  "completed": true,
+  ...
+}
+```
+
+Once the process is completed, you should see `ES_EMBEDDING_INDEX_NAME` in the list of your indices http://localhost:5601/app/enterprise_search/content/search_indices
+
+You can confirm the successful completion of this step by checking the `ES_EMBEDDING_INDEX_NAME` index. If you find the `text_embedding` column filled with embedding vectors as shown below, it indicates that the process was successful:
+```bash
+{
+  "predicted_value": [
+    -0.016909973695874214,
+    -0.05246243625879288,
+    -0.02864678204059601,
+    ...
+  ],
+  "model_id": "intfloat__multilingual-e5-small"
+}
+```
+## Run semantic search
+After the dataset has been enriched with vector embeddings, you can query the data using semantic search. 
+```bash
+curl -X GET "${ES_URL}/${ES_EMBEDDING_INDEX_NAME}/_search" \
+  -u "${ES_USER}:${ES_PASSWORD}" --cacert $ES_CACERT \
+  -H 'Content-Type: application/json' -d '{
+  "knn": {
+    "field": "text_embedding.predicted_value",
+    "query_vector_builder": {
+      "text_embedding": {
+        "model_id": "intfloat__multilingual-e5-small",
+        "model_text": "how to set up custom extension?"
+      }
+    },
+    "k": 10,
+    "num_candidates": 100
+  },
+  "_source": [
+    "id",
+    "text"
+  ]
+}'
+```
@@ -52,20 +52,44 @@ but you will need to find set-up instructions appropriate to that environment.
       ```
       NOTE: Learn more about ELSER v1 from [here](https://www.elastic.co/guide/en/elasticsearch/reference/8.10/semantic-search-elser.html) 
     * Semantic search with ELSER v2
-    ```json
-      {
-        "text_expansion": {
-          "content_embedding": {
-            "model_id": ".elser_model_2",
-            "model_text": "how to set up a custom extension?"
+      ```json
+        {
+          "text_expansion": {
+            "content_embedding": {
+              "model_id": ".elser_model_2",
+              "model_text": "how to set up a custom extension?"
+            }
           }
-         }
-      }
-      ```
-    NOTE: Learn more about ELSER v2 from [here](https://www.elastic.co/guide/en/elasticsearch/reference/8.11/semantic-search-elser.html). 
-    ELSER v2 is only available for the 8.11 version of Elasticsearch
+        }
+        ```
+      NOTE: Learn more about ELSER v2 from [here](https://www.elastic.co/guide/en/elasticsearch/reference/8.11/semantic-search-elser.html). 
+      ELSER v2 is only available for the 8.11 version of Elasticsearch
+
+    * To use dense vector search (k-nearest neighbours search), you can set `knn_body` as a session variable and set the `knn` variable to `knn_body` as shown below:
+      
+      Here is an example knn body you can use when setting up the `knn_body` session variable:
+      
+      ```json
+        {
+          "field": "text_embedding.predicted_value",
+          "query_vector_builder": {
+            "text_embedding": {
+              "model_id": "intfloat__multilingual-e5-small",
+              "model_text": "how to set up custom extension?"
+            }
+          },
+          "k": 10,
+          "num_candidates": 100
+        }
+        ```
+        
+        NOTE: `intfloat__multilingual-e5-small` is a multilingual embedding model supported by Elasticsearch. If you have Elasticsearch version 8.11 or earlier, you will need to deploy this model to your Elasticsearch cluster first before starting to use it by following the instructions [here](/integrations/extensions/docs/elasticsearch-install-and-setup/text_embedding_deploy_and_use.md).
+
+        <img src="assets/use_elasticsearch_custom_extension_knn.png" width="669" height="627" />
+      
     * Compound search  
       You can combine different types of queries in a compound query. Learn more about it from this [Elasticsearch tutorial](https://www.elastic.co/guide/en/elasticsearch/reference/8.10/semantic-search-elser.html#text-expansion-compound-query).
+  
   * Try typing in anything in your preview chat to trigger `No action matches` action. 
     If you see a successful extension call with valid response in the Extension Inspector, your Elasticsearch custom extension has been set up successfully.
 
 
@@ -55,6 +55,9 @@
                   },
                   "query": {
                     "type": "object"
+                  },
+                  "knn": {
+                    "type": "object"
                   }
                 }
               },
@@ -66,6 +69,17 @@
                       "model_text": "tell me about a custom extension"
                     }
                   }
+                },
+                "knn": {
+                  "field": "text_embedding.predicted_value",
+                  "query_vector_builder": {
+                    "text_embedding": {
+                      "model_id": "intfloat__multilingual-e5-small",
+                      "model_text": "how to set up custom extension?"
+                    }
+                  },
+                  "k": 10,
+                  "num_candidates": 100
                 }
               }
             }
Original file line number	Diff line number	Diff line change
`@@ -55,6 +55,9 @@`
`55`	`55`	`},`
`56`	`56`	`"query": {`
`57`	`57`	`"type": "object"`
	`58`	`+ },`
	`59`	`+ "knn": {`
	`60`	`+ "type": "object"`
`58`	`61`	`}`
`59`	`62`	`}`
`60`	`63`	`},`
`@@ -66,6 +69,17 @@`
`66`	`69`	`"model_text": "tell me about a custom extension"`
`67`	`70`	`}`
`68`	`71`	`}`
	`72`	`+ },`
	`73`	`+ "knn": {`
	`74`	`+ "field": "text_embedding.predicted_value",`
	`75`	`+ "query_vector_builder": {`
	`76`	`+ "text_embedding": {`
	`77`	`+ "model_id": "intfloat__multilingual-e5-small",`
	`78`	`+ "model_text": "how to set up custom extension?"`
	`79`	`+ }`
	`80`	`+ },`
	`81`	`+ "k": 10,`
	`82`	`+ "num_candidates": 100`
`69`	`83`	`}`
`70`	`84`	`}`
`71`	`85`	`}`