通用文本向量同步API参考与调用示例 - Alibaba Cloud Model Studio

General-purpose Text Embedding is a unified multilingual text embedding model developed by the Qwen team. It supports multiple major languages and provides an efficient service for vectorizing text data. This service is suitable for natural language processing (NLP) tasks such as retrieval-augmented generation (RAG), text classification, and sentiment analysis.

Model overview

Singapore

Model

Embedding dimensions

Batch size

Maximum tokens per row (Note)

Price (Mllion input tokens)

Supported languages

Free quota(Note)

text-embedding-v4

Part of the Qwen3-Embedding series

2,048, 1,536, 1,024 (default), 768, 512, 256, 128, 64

8,192

$0.07

Chinese, English, Spanish, French, Portuguese, Indonesian, Japanese, Korean, German, Russian, and 100+ other major languages

1 million tokens

Valid for 90 days after you activate Alibaba Cloud Model Studio.

text-embedding-v3

1,024 (default), 768, 512

Chinese, English, Spanish, French, Portuguese, Indonesian, Japanese, Korean, German, Russian, and 50+ other major languages

500,000 tokens

Validity: 90 days after you activate Model Studio

Beijing

Model

Embedding dimensions

Batch size

Maximum tokens per row

Price (Mllion input tokens)

Supported languages

text-embedding-v4

Part of the Qwen3-Embedding series

2,048, 1,536, 1,024 (default), 768, 512, 256, 128, 64

8,192

$0.072

Chinese, English, Spanish, French, Portuguese, Indonesian, Japanese, Korean, German, Russian, and 100+ other major languages and multiple programming languages

For more information about model rate limits, see Rate limits.

Prerequisites

Before making a call, you must get your API key and set it as an environment variable. If you plan to use an SDK, you must also install the appropriate DashScope SDK.

OpenAI compatible

The base_url to configure for SDK calls:

Singapore: https://dashscope-intl.aliyuncs.com/compatible-mode/v1
China (Beijing): https://dashscope.aliyuncs.com/compatible-mode/v1

The endpoint to configure for HTTP calls:

Singapore: POST https://dashscope-intl.aliyuncs.com/compatible-mode/v1/embeddings
China (Beijing): POST https://dashscope.aliyuncs.com/compatible-mode/v1/embeddings

Request body	Input string Python import os from openai import OpenAI client = OpenAI( # If you use a model in the China (Beijing) region, you must use an API key for that region. You can obtain one at: https://bailian.console.alibabacloud.com/?tab=model#/api-key api_key=os.getenv("DASHSCOPE_API_KEY"), # If you have not configured the environment variable, replace the placeholder with your API key. # If you use a model in the China (Beijing) region, replace the URL with: https://dashscope.aliyuncs.com/compatible-mode/v1 base_url="https://dashscope-intl.aliyuncs.com/compatible-mode/v1" ) completion = client.embeddings.create( model="text-embedding-v4", input='The clothes are of good quality and look good, definitely worth the wait. I love them.', dimensions=1024, encoding_format="float" ) print(completion.model_dump_json()) Java import java.net.URI; import java.net.http.HttpClient; import java.net.http.HttpRequest; import java.net.http.HttpResponse; import java.util.HashMap; import java.util.Map; import com.alibaba.dashscope.utils.JsonUtils; public final class Main { public static void main(String[] args) { // If you use a model in the China (Beijing) region, you must use an API key for that region. You can obtain one at: https://bailian.console.alibabacloud.com/?tab=model#/api-key String apiKey = System.getenv("DASHSCOPE_API_KEY"); if (apiKey == null) { System.out.println("DASHSCOPE_API_KEY not found in environment variables"); return; } // If you use a model in the China (Beijing) region, replace the URL with: https://dashscope.aliyuncs.com/compatible-mode/v1/embeddings String baseUrl = "https://dashscope-intl.aliyuncs.com/compatible-mode/v1/embeddings"; HttpClient client = HttpClient.newHttpClient(); Map<String, Object> requestBody = new HashMap<>(); requestBody.put("model", "text-embedding-v4"); requestBody.put("input", "The wind is strong, the sky is high, and the apes cry mournfully. The islet is clear, the sand is white, and the birds fly back. The boundless forest sheds its leaves shower by shower. The endless river rolls on wave after wave."); requestBody.put("dimensions", 1024); requestBody.put("encoding_format", "float"); try { String requestBodyString = JsonUtils.toJson(requestBody); HttpRequest request = HttpRequest.newBuilder() .uri(URI.create(baseUrl)) .header("Content-Type", "application/json") .header("Authorization", "Bearer " + apiKey) .POST(HttpRequest.BodyPublishers.ofString(requestBodyString)) .build(); HttpResponse<String> response = client.send(request, HttpResponse.BodyHandlers.ofString()); if (response.statusCode() == 200) { System.out.println("Response: " + response.body()); } else { System.out.printf("Failed to retrieve response, status code: %d, response: %s%n", response.statusCode(), response.body()); } } catch (Exception e) { System.err.println("Error: " + e.getMessage()); } } } curl If you use a model in the China (Beijing) region, use an API key for that region and replace the URL with: https://dashscope.aliyuncs.com/compatible-mode/v1/embeddings curl --location 'https://dashscope-intl.aliyuncs.com/compatible-mode/v1/embeddings' \ --header "Authorization: Bearer $DASHSCOPE_API_KEY" \ --header 'Content-Type: application/json' \ --data '{ "model": "text-embedding-v4", "input": "The wind is strong, the sky is high, and the apes cry mournfully. The islet is clear, the sand is white, and the birds fly back. The boundless forest sheds its leaves shower by shower. The endless river rolls on wave after wave.", "dimensions": "1024", "encoding_format": "float" }' Input string list Python import os from openai import OpenAI client = OpenAI( # If you use a model in the China (Beijing) region, you must use an API key for that region. You can obtain one at: https://bailian.console.alibabacloud.com/?tab=model#/api-key api_key=os.getenv("DASHSCOPE_API_KEY"), # If you have not configured the environment variable, replace the placeholder with your API key. # If you use a model in the China (Beijing) region, replace the URL with: https://dashscope.aliyuncs.com/compatible-mode/v1 base_url="https://dashscope-intl.aliyuncs.com/compatible-mode/v1" ) completion = client.embeddings.create( model="text-embedding-v4", input=['The wind is strong, the sky is high, and the apes cry mournfully.', 'The islet is clear, the sand is white, and the birds fly back.', 'The boundless forest sheds its leaves shower by shower.', 'The endless river rolls on wave after wave.'], dimensions=1024, encoding_format="float" ) print(completion.model_dump_json()) Java import java.net.URI; import java.net.http.HttpClient; import java.net.http.HttpRequest; import java.net.http.HttpResponse; import java.util.HashMap; import java.util.Map; import java.util.List; import java.util.Arrays; import com.alibaba.dashscope.utils.JsonUtils; public final class Main { public static void main(String[] args) { /** Get the API key from an environment variable. If not configured, replace the placeholder with your API key. / // If you use a model in the China (Beijing) region, you must use an API key for that region. You can obtain one at: https://bailian.console.alibabacloud.com/?tab=model#/api-key String apiKey = System.getenv("DASHSCOPE_API_KEY"); if (apiKey == null) { System.out.println("DASHSCOPE_API_KEY not found in environment variables"); return; } // If you use a model in the China (Beijing) region, replace the URL with: https://dashscope.aliyuncs.com/compatible-mode/v1/embeddings String baseUrl = "https://dashscope-intl.aliyuncs.com/compatible-mode/v1/embeddings"; HttpClient client = HttpClient.newHttpClient(); Map<String, Object> requestBody = new HashMap<>(); requestBody.put("model", "text-embedding-v4"); List<String> inputList = Arrays.asList("The wind is strong, the sky is high, and the apes cry mournfully.", "The islet is clear, the sand is white, and the birds fly back.", "The boundless forest sheds its leaves shower by shower.", "The endless river rolls on wave after wave."); requestBody.put("input", inputList); requestBody.put("encoding_format", "float"); try { /* Convert the request body to a JSON string. / String requestBodyString = JsonUtils.toJson(requestBody); /* Build the HTTP request. / HttpRequest request = HttpRequest.newBuilder() .uri(URI.create(baseUrl)) .header("Content-Type", "application/json") .header("Authorization", "Bearer " + apiKey) .POST(HttpRequest.BodyPublishers.ofString(requestBodyString)) .build(); /* Send the request and receive the response. / HttpResponse<String> response = client.send(request, HttpResponse.BodyHandlers.ofString()); if (response.statusCode() == 200) { System.out.println("Response: " + response.body()); } else { System.out.printf("Failed to retrieve response, status code: %d, response: %s%n", response.statusCode(), response.body()); } } catch (Exception e) { /* Catch and print the exception. / System.err.println("Error: " + e.getMessage()); } } } curl If you use a model in the China (Beijing) region, use an API key for that region and replace the URL with: https://dashscope.aliyuncs.com/compatible-mode/v1/embeddings curl --location 'https://dashscope-intl.aliyuncs.com/compatible-mode/v1/embeddings' \ --header "Authorization: Bearer $DASHSCOPE_API_KEY" \ --header 'Content-Type: application/json' \ --data '{ "model": "text-embedding-v4", "input": [ "The wind is strong, the sky is high, and the apes cry mournfully.", "The islet is clear, the sand is white, and the birds fly back.", "The boundless forest sheds its leaves shower by shower.", "The endless river rolls on wave after wave." ], "dimensions": 1024, "encoding_format": "float" }' Input file Python import os from openai import OpenAI client = OpenAI( # If you use a model in the China (Beijing) region, you must use an API key for that region. You can obtain one at: https://bailian.console.alibabacloud.com/?tab=model#/api-key api_key=os.getenv("DASHSCOPE_API_KEY"), # If you have not configured the environment variable, replace the placeholder with your API key. # If you use a model in the China (Beijing) region, replace the URL with: https://dashscope.aliyuncs.com/compatible-mode/v1 base_url="https://dashscope-intl.aliyuncs.com/compatible-mode/v1" ) # Make sure to replace 'texts_to_embedding.txt' with your own file name or path. with open('texts_to_embedding.txt', 'r', encoding='utf-8') as f: completion = client.embeddings.create( model="text-embedding-v4", input=f, encoding_format="float" ) print(completion.model_dump_json()) Java import java.net.URI; import java.net.http.HttpClient; import java.net.http.HttpRequest; import java.net.http.HttpResponse; import java.util.HashMap; import java.util.Map; import java.io.BufferedReader; import java.io.FileReader; import java.io.IOException; import com.alibaba.dashscope.utils.JsonUtils; public class Main { public static void main(String[] args) { /* Get the API key from an environment variable. If not configured, replace the placeholder with your API key. / // If you use a model in the China (Beijing) region, you must use an API key for that region. You can obtain one at: https://bailian.console.alibabacloud.com/?tab=model#/api-key String apiKey = System.getenv("DASHSCOPE_API_KEY"); if (apiKey == null) { System.out.println("DASHSCOPE_API_KEY not found in environment variables"); return; } // If you use a model in the China (Beijing) region, replace the URL with: https://dashscope.aliyuncs.com/compatible-mode/v1/embeddings String baseUrl = "https://dashscope-intl.aliyuncs.com/compatible-mode/v1/embeddings"; HttpClient client = HttpClient.newHttpClient(); /* Read the input file. / StringBuilder inputText = new StringBuilder(); try (BufferedReader reader = new BufferedReader(new FileReader("<path_to_your_content_root>"))) { String line; while ((line = reader.readLine()) != null) { inputText.append(line).append("\n"); } } catch (IOException e) { System.err.println("Error reading input file: " + e.getMessage()); return; } Map<String, Object> requestBody = new HashMap<>(); requestBody.put("model", "text-embedding-v4"); requestBody.put("input", inputText.toString().trim()); requestBody.put("dimensions", 1024); requestBody.put("encoding_format", "float"); try { String requestBodyString = JsonUtils.toJson(requestBody); /* Build the HTTP request. */ HttpRequest request = HttpRequest.newBuilder() .uri(URI.create(baseUrl)) .header("Content-Type", "application/json") .header("Authorization", "Bearer " + apiKey) .POST(HttpRequest.BodyPublishers.ofString(requestBodyString)) .build(); HttpResponse<String> response = client.send(request, HttpResponse.BodyHandlers.ofString()); if (response.statusCode() == 200) { System.out.println("Response: " + response.body()); } else { System.out.printf("Failed to retrieve response, status code: %d, response: %s%n", response.statusCode(), response.body()); } } catch (Exception e) { System.err.println("Error: " + e.getMessage()); } } } curl If you use a model in the China (Beijing) region, use an API key for that region and replace the URL with: https://dashscope.aliyuncs.com/compatible-mode/v1/embeddings Replace 'texts_to_embedding.txt' with your file name or path. `FILE_CONTENT=$(cat texts_to_embedding.txt \| jq -Rs .) curl --location 'https://dashscope-intl.aliyuncs.com/compatible-mode/v1/embeddings' \ --header "Authorization: Bearer $DASHSCOPE_API_KEY" \ --header 'Content-Type: application/json' \ --data '{ "model": "text-embedding-v4", "input": ['"$FILE_CONTENT"'], "dimensions": 1024, "encoding_format": "float" }'`
model `string` Required The name of the model to call. Select a model name from the Model overview table.
input `array<string> or string or file` Required The input text. The value can be a string, a list of strings, or a file. Each line in the file is treated as a separate text to be embedded. Text limits: Number of texts: A string can contain up to 8,192 tokens. A list of strings can contain up to 10 strings. Each string can contain up to 8,192 tokens. A file can contain up to 10 lines. Each line can contain up to 8,192 tokens.
dimensions `integer` Optional Specifies the output vector dimensions. This parameter applies only to the text-embedding-v3 and text-embedding-v4 models. You can set this parameter to 2048 (for text-embedding-v4 only), 1536 (for text-embedding-v4 only), 1024, 768, 512, 256, 128, or 64. The default value is 1024.
encoding_format `string` Optional Specifies the format of the returned embedding. Currently, only the `float` format is supported.

Response object	Successful response `{ "data": [ { "embedding": [ -0.0695386752486229, 0.030681096017360687, ... ], "index": 0, "object": "embedding" }, ... { "embedding": [ -0.06348952651023865, 0.060446035116910934, ... ], "index": 5, "object": "embedding" } ], "model": "text-embedding-v3", "object": "list", "usage": { "prompt_tokens": 184, "total_tokens": 184 }, "id": "73591b79-d194-9bca-8bb5-xxxxxxxxxxxx" }` Error response `{ "error": { "message": "Incorrect API key provided. ", "type": "invalid_request_error", "param": null, "code": "invalid_api_key" } }`
data `array` The output data for the task. Properties embedding `list` The embedding vector, which is an array of floating-point numbers. index `integer` The index of the input text in the input array that corresponds to the result in this structure. object string The type of the object returned by the call. The default value is embedding.
model `string` The name of the model that was called.
object string The type of the data returned by the call. The default value is list.
usage `object` Properties prompt_tokens integer The number of tokens in the input text. total_tokens integer The total number of tokens in the request. The token count is calculated based on how the model's tokenizer parses the input string.
id string The unique request ID. You can use this ID to trace and troubleshoot requests.

DashScope

The base_url to configure for SDK calls:

Singapore: https://dashscope-intl.aliyuncs.com/api/v1
China (Beijing): https://dashscope.aliyuncs.com/api/v1

The endpoint to configure for HTTP calls:

Singapore: POST https://dashscope-intl.aliyuncs.com/api/v1/services/embeddings/text-embedding/text-embedding
China (Beijing): POST https://dashscope.aliyuncs.com/api/v1/services/embeddings/text-embedding/text-embedding

Request body	Input string Python import dashscope from http import HTTPStatus # If you use a model in the China (Beijing) region, replace the URL with: https://dashscope.aliyuncs.com/api/v1 dashscope.base_http_api_url = 'https://dashscope-intl.aliyuncs.com/api/v1' resp = dashscope.TextEmbedding.call( model=dashscope.TextEmbedding.Models.text_embedding_v4, input='The wind is strong, the sky is high, and the apes cry mournfully. The islet is clear, the sand is white, and the birds fly back. The boundless forest sheds its leaves shower by shower. The endless river rolls on wave after wave.', dimension=1024, output_type="dense&sparse" ) print(resp) if resp.status_code == HTTPStatus.OK else print(resp) Java import java.util.Arrays; import java.util.concurrent.Semaphore; import com.alibaba.dashscope.common.ResultCallback; import com.alibaba.dashscope.embeddings.TextEmbedding; import com.alibaba.dashscope.embeddings.TextEmbeddingParam; import com.alibaba.dashscope.embeddings.TextEmbeddingResult; import com.alibaba.dashscope.exception.ApiException; import com.alibaba.dashscope.exception.NoApiKeyException; import com.alibaba.dashscope.utils.Constants; public final class Main { static { // If you use a model in the China (Beijing) region, replace the URL with: https://dashscope.aliyuncs.com/api/v1 Constants.baseHttpApiUrl="https://dashscope-intl.aliyuncs.com/api/v1"; } public static void basicCall() throws ApiException, NoApiKeyException{ TextEmbeddingParam param = TextEmbeddingParam .builder() .model(TextEmbedding.Models.TEXT_EMBEDDING_V4) .texts(Arrays.asList("The wind is strong, the sky is high, and the apes cry mournfully.", "The islet is clear, the sand is white, and the birds fly back.", "The boundless forest sheds its leaves shower by shower.", "The endless river rolls on wave after wave.")).build(); TextEmbedding textEmbedding = new TextEmbedding(); TextEmbeddingResult result = textEmbedding.call(param); System.out.println(result); } public static void callWithCallback() throws ApiException, NoApiKeyException, InterruptedException{ TextEmbeddingParam param = TextEmbeddingParam .builder() .model(TextEmbedding.Models.TEXT_EMBEDDING_V3) .texts(Arrays.asList("The wind is strong, the sky is high, and the apes cry mournfully.", "The islet is clear, the sand is white, and the birds fly back.", "The boundless forest sheds its leaves shower by shower.", "The endless river rolls on wave after wave.")).build(); TextEmbedding textEmbedding = new TextEmbedding(); Semaphore sem = new Semaphore(0); textEmbedding.call(param, new ResultCallback<TextEmbeddingResult>() { @Override public void onEvent(TextEmbeddingResult message) { System.out.println(message); } @Override public void onComplete(){ sem.release(); } @Override public void onError(Exception err){ System.out.println(err.getMessage()); err.printStackTrace(); sem.release(); } }); sem.acquire(); } public static void main(String[] args){ try{ callWithCallback(); }catch(ApiException\|NoApiKeyException\|InterruptedException e){ e.printStackTrace(); System.out.println(e); } try { basicCall(); } catch (ApiException \| NoApiKeyException e) { System.out.println(e.getMessage()); } System.exit(0); } } curl If you use a model in the China (Beijing) region, use an API key for that region and replace the URL with: https://dashscope.aliyuncs.com/api/v1/services/embeddings/text-embedding/text-embedding curl --location 'https://dashscope-intl.aliyuncs.com/api/v1/services/embeddings/text-embedding/text-embedding' \ --header "Authorization: Bearer $DASHSCOPE_API_KEY" \ --header 'Content-Type: application/json' \ --data '{ "model": "text-embedding-v4", "input": { "texts": [ "The wind is strong, the sky is high, and the apes cry mournfully. The islet is clear, the sand is white, and the birds fly back. The boundless forest sheds its leaves shower by shower. The endless river rolls on wave after wave." ] }, "parameters": { "dimension": 1024, "output_type": "dense" } }' Input string list Python import dashscope from http import HTTPStatus # If you use a model in the China (Beijing) region, replace the URL with: https://dashscope.aliyuncs.com/api/v1 dashscope.base_http_api_url = 'https://dashscope-intl.aliyuncs.com/api/v1' DASHSCOPE_MAX_BATCH_SIZE = 25 inputs = ['The wind is strong, the sky is high, and the apes cry mournfully.', 'The islet is clear, the sand is white, and the birds fly back.', 'The boundless forest sheds its leaves shower by shower.', 'The endless river rolls on wave after wave.'] result = None batch_counter = 0 for i in range(0, len(inputs), DASHSCOPE_MAX_BATCH_SIZE): batch = inputs[i:i + DASHSCOPE_MAX_BATCH_SIZE] resp = dashscope.TextEmbedding.call( model=dashscope.TextEmbedding.Models.text_embedding_v4, input=batch, dimension=1024 ) if resp.status_code == HTTPStatus.OK: if result is None: result = resp else: for emb in resp.output['embeddings']: emb['text_index'] += batch_counter result.output['embeddings'].append(emb) result.usage['total_tokens'] += resp.usage['total_tokens'] else: print(resp) batch_counter += len(batch) print(result) Java import java.util.Arrays; import java.util.List; import com.alibaba.dashscope.embeddings.TextEmbedding; import com.alibaba.dashscope.embeddings.TextEmbeddingParam; import com.alibaba.dashscope.embeddings.TextEmbeddingResult; import com.alibaba.dashscope.exception.ApiException; import com.alibaba.dashscope.exception.NoApiKeyException; import com.alibaba.dashscope.utils.Constants; public final class Main { static { // If you use a model in the China (Beijing) region, replace the URL with: https://dashscope.aliyuncs.com/api/v1 Constants.baseHttpApiUrl="https://dashscope-intl.aliyuncs.com/api/v1"; } private static final int DASHSCOPE_MAX_BATCH_SIZE = 25; public static void main(String[] args) { List<String> inputs = Arrays.asList( "The wind is strong, the sky is high, and the apes cry mournfully.", "The islet is clear, the sand is white, and the birds fly back.", "The boundless forest sheds its leaves shower by shower.", "The endless river rolls on wave after wave." ); TextEmbeddingResult result = null; int batchCounter = 0; for (int i = 0; i < inputs.size(); i += DASHSCOPE_MAX_BATCH_SIZE) { List<String> batch = inputs.subList(i, Math.min(i + DASHSCOPE_MAX_BATCH_SIZE, inputs.size())); TextEmbeddingParam param = TextEmbeddingParam.builder() .model(TextEmbedding.Models.TEXT_EMBEDDING_V4) .texts(batch) .build(); TextEmbedding textEmbedding = new TextEmbedding(); try { TextEmbeddingResult resp = textEmbedding.call(param); if (resp != null) { if (result == null) { result = resp; } else { for (var emb : resp.getOutput().getEmbeddings()) { emb.setTextIndex(emb.getTextIndex() + batchCounter); result.getOutput().getEmbeddings().add(emb); } result.getUsage().setTotalTokens(result.getUsage().getTotalTokens() + resp.getUsage().getTotalTokens()); } } else { System.out.println(resp); } } catch (ApiException \| NoApiKeyException e) { e.printStackTrace(); } batchCounter += batch.size(); } System.out.println(result); } } curl If you use a model in the China (Beijing) region, use an API key for that region and replace the URL with: https://dashscope.aliyuncs.com/api/v1/services/embeddings/text-embedding/text-embedding curl --location 'https://dashscope-intl.aliyuncs.com/api/v1/services/embeddings/text-embedding/text-embedding' \ --header "Authorization: Bearer $DASHSCOPE_API_KEY" \ --header 'Content-Type: application/json' \ --data '{ "model": "text-embedding-v4", "input": { "texts": [ "The wind is strong, the sky is high, and the apes cry mournfully.", "The islet is clear, the sand is white, and the birds fly back.", "The boundless forest sheds its leaves shower by shower.", "The endless river rolls on wave after wave." ] }, "parameters": { "dimension": 1024, "output_type": "dense" } }' Input file Python import dashscope from http import HTTPStatus from dashscope import TextEmbedding # If you use a model in the China (Beijing) region, replace the URL with: https://dashscope.aliyuncs.com/api/v1 dashscope.base_http_api_url = 'https://dashscope-intl.aliyuncs.com/api/v1' # Make sure to replace 'texts_to_embedding.txt' with your own file name or path. with open('texts_to_embedding.txt', 'r', encoding='utf-8') as f: resp = TextEmbedding.call( model=TextEmbedding.Models.text_embedding_v4, input=f, dimension=1024 ) if resp.status_code == HTTPStatus.OK: print(resp) else: print(resp) Java import java.io.BufferedReader; import java.io.FileReader; import java.io.IOException; import com.alibaba.dashscope.embeddings.TextEmbedding; import com.alibaba.dashscope.embeddings.TextEmbeddingParam; import com.alibaba.dashscope.embeddings.TextEmbeddingResult; import com.alibaba.dashscope.exception.ApiException; import com.alibaba.dashscope.exception.NoApiKeyException; import com.alibaba.dashscope.utils.Constants; public final class Main { static { // If you use a model in the China (Beijing) region, replace the URL with: https://dashscope.aliyuncs.com/api/v1 Constants.baseHttpApiUrl="https://dashscope-intl.aliyuncs.com/api/v1"; } public static void main(String[] args) { // Make sure to replace 'tests_to_embedding.txt' with the full path of your file. try (BufferedReader reader = new BufferedReader(new FileReader("tests_to_embedding.txt"))) { StringBuilder content = new StringBuilder(); String line; while ((line = reader.readLine()) != null) { content.append(line).append("\n"); } TextEmbeddingParam param = TextEmbeddingParam.builder() .model(TextEmbedding.Models.TEXT_EMBEDDING_V4) .text(content.toString()) .build(); TextEmbedding textEmbedding = new TextEmbedding(); TextEmbeddingResult result = textEmbedding.call(param); if (result != null) { System.out.println(result); } else { System.out.println("Failed to get embedding: " + result); } } catch (IOException \| ApiException \| NoApiKeyException e) { e.printStackTrace(); } } } curl If you use a model in the China (Beijing) region, use an API key for that region and replace the URL with: https://dashscope.aliyuncs.com/api/v1/services/embeddings/text-embedding/text-embedding Replace 'texts_to_embedding.txt' with your file name or path. `FILE_CONTENT=$(cat texts_to_embedding.txt \| jq -Rs .) curl --location 'https://dashscope-intl.aliyuncs.com/api/v1/services/embeddings/text-embedding/text-embedding' \ --header "Authorization: Bearer $DASHSCOPE_API_KEY" \ --header 'Content-Type: application/json' \ --data '{ "model": "text-embedding-v4", "input": { "texts": ['"$FILE_CONTENT"'] }, "parameters": { "dimension": 1024, "output_type": "dense" } }'`
model `string` Required The name of the model to call. Select a model name from the Model overview table.
input `string` or `array<string>` Required The input text. The value can be a string, a list of strings, or a file. Each line in the file is treated as a separate text to be embedded. Text limits: Number of texts: A string can contain up to 8,192 tokens. A list of strings can contain up to 10 strings. Each string can contain up to 8,192 tokens. A file can contain up to 10 lines. Each line can contain up to 8,192 tokens.
text_type `string` Optional After text is converted to a vector, it can be used for downstream tasks such as retrieval, clustering, and classification. For asymmetric tasks such as retrieval, we recommend distinguishing between query text (query) and document text (document) to achieve better results. For symmetric tasks such as clustering and classification, you can use the default value `document`.
dimension `integer` Optional Specifies the output vector dimensions. This parameter applies only to the text-embedding-v3 and text-embedding-v4 models. You can set this parameter to 2048 (for text-embedding-v4 only), 1536 (for text-embedding-v4 only), 1024, 768, 512, 256, 128, or 64. The default value is 1024.
output_type `string` Optional Specifies whether to output a sparse vector representation. This parameter applies only to the text_embedding_v3 and text_embedding_v4 models. Valid values are dense, sparse, and dense&sparse. The default value is dense, which outputs only a dense vector.
instruct `string` Optional Adds a custom task description. This parameter takes effect only when you use the `text-embedding-v4` model and set `text_type` to `query`. We recommend writing the description in English, because it can typically improve performance by 1% to 5%.

Response object

Successful response

{   "status_code": 200, 
    "request_id": "1ba94ac8-e058-99bc-9cc1-7fdb37940a46", 
    "code": "", 
    "message": "",
    "output":{
        "embeddings": [
          {  
             "sparse_embedding":[
               {"index":7149,"value":0.829,"token":"feng"},
               .....
               {"index":111290,"value":0.9004,"token":"ai"}],
             "embedding": [-0.006929283495992422,-0.005336422007530928, ...],
             "text_index": 0
          }, 
          {
             "sparse_embedding":[
               {"index":246351,"value":1.0483,"token":"zhu"},
               .....
               {"index":2490,"value":0.8579,"token":"hui"}],
             "embedding": [-0.006929283495992422,-0.005336422007530928, ...],
             "text_index": 1
          },
          {
             "sparse_embedding":[
               {"index":3759,"value":0.7065,"token":"wu"},
               .....
               {"index":1130,"value":0.815,"token":"xia"}],
             "embedding": [-0.006929283495992422,-0.005336422007530928, ...],
             "text_index": 2
          },
          {
             "sparse_embedding":[
               {"index":562,"value":0.6752,"token":"bu"},
               .....
               {"index":1589,"value":0.7097,"token":"lai"}],
             "embedding": [-0.001945948973298072,-0.005336422007530928, ...],
             "text_index": 3
          }
        ]
    },
    "usage":{
        "total_tokens":27
    },
    "request_id":"xxxxxxxx"
}

Error response

{
    "code":"InvalidApiKey",
    "message":"Invalid API-key provided.",
    "request_id":"xxxxxxxx"
}

status_code string

The status code. A value of 200 indicates a successful request.

request_id string

The unique request ID. You can use this ID to trace and troubleshoot requests.

code string

If the request fails, this parameter indicates the error code. If the request is successful, this parameter is empty.

message string

If the request fails, this parameter indicates the detailed error message. If the request is successful, this parameter is empty.

output object

The output data for the task.

Properties

embeddings array

The output for the current request. It is an array of structures, where each structure contains the output for the corresponding input text.

Properties

sparse_embedding array

The sparse vector representation for the corresponding string. The sparse_embedding parameter applies only to text_embedding_v3 and text_embedding_v4.

Properties

index integer

The index of the word or character in the vocabulary.

value float

The weight or importance score of the token. A higher value indicates that the token is more important or relevant in the current text context.

token string

The actual text unit or word from the vocabulary.

embedding array

The dense vector representation for the corresponding string (dense embedding).

text_index integer

The index of the input text in the input array that corresponds to the result in this structure.

usage object

Properties

total_tokens integer

The total number of tokens in the request. The token count is calculated based on how the model's tokenizer parses the input string.

request_id string

The unique request ID. You can use this ID to trace and troubleshoot requests.

Error codes

If a model call fails and an error message is returned, see Error messages for information about how to resolve the issue.

Model overview

Singapore

Beijing

Prerequisites

OpenAI compatible

Request body

Input string

Python

Java

curl

Input string list

Python

Java

curl

Input file

Python

Java

curl

Response object

Successful response

Error response

DashScope

Request body

Input string

Python

Java

curl

Input string list

Python

Java

curl

Input file

Python

Java

curl

Response object

Successful response

Error response

Error codes