- 1.121.0 (latest)
- 1.120.0
- 1.119.0
- 1.118.0
- 1.117.0
- 1.116.0
- 1.115.0
- 1.114.0
- 1.113.0
- 1.112.0
- 1.111.0
- 1.110.0
- 1.109.0
- 1.108.0
- 1.107.0
- 1.106.0
- 1.105.0
- 1.104.0
- 1.103.0
- 1.102.0
- 1.101.0
- 1.100.0
- 1.99.0
- 1.98.0
- 1.97.0
- 1.96.0
- 1.95.1
- 1.94.0
- 1.93.1
- 1.92.0
- 1.91.0
- 1.90.0
- 1.89.0
- 1.88.0
- 1.87.0
- 1.86.0
- 1.85.0
- 1.84.0
- 1.83.0
- 1.82.0
- 1.81.0
- 1.80.0
- 1.79.0
- 1.78.0
- 1.77.0
- 1.76.0
- 1.75.0
- 1.74.0
- 1.73.0
- 1.72.0
- 1.71.1
- 1.70.0
- 1.69.0
- 1.68.0
- 1.67.1
- 1.66.0
- 1.65.0
- 1.63.0
- 1.62.0
- 1.60.0
- 1.59.0
- 1.58.0
- 1.57.0
- 1.56.0
- 1.55.0
- 1.54.1
- 1.53.0
- 1.52.0
- 1.51.0
- 1.50.0
- 1.49.0
- 1.48.0
- 1.47.0
- 1.46.0
- 1.45.0
- 1.44.0
- 1.43.0
- 1.39.0
- 1.38.1
- 1.37.0
- 1.36.4
- 1.35.0
- 1.34.0
- 1.33.1
- 1.32.0
- 1.31.1
- 1.30.1
- 1.29.0
- 1.28.1
- 1.27.1
- 1.26.1
- 1.25.0
- 1.24.1
- 1.23.0
- 1.22.1
- 1.21.0
- 1.20.0
- 1.19.1
- 1.18.3
- 1.17.1
- 1.16.1
- 1.15.1
- 1.14.0
- 1.13.1
- 1.12.1
- 1.11.0
- 1.10.0
- 1.9.0
- 1.8.1
- 1.7.1
- 1.6.2
- 1.5.0
- 1.4.3
- 1.3.0
- 1.2.0
- 1.1.1
- 1.0.1
- 0.9.0
- 0.8.0
- 0.7.1
- 0.6.0
- 0.5.1
- 0.4.0
- 0.3.1
LocalEndpoint(
    serving_container_image_uri: str,
    artifact_uri: Optional[str] = None,
    serving_container_predict_route: Optional[str] = None,
    serving_container_health_route: Optional[str] = None,
    serving_container_command: Optional[Sequence[str]] = None,
    serving_container_args: Optional[Sequence[str]] = None,
    serving_container_environment_variables: Optional[Dict[str, str]] = None,
    serving_container_ports: Optional[Sequence[int]] = None,
    credential_path: Optional[str] = None,
    host_port: Optional[str] = None,
    gpu_count: Optional[int] = None,
    gpu_device_ids: Optional[List[str]] = None,
    gpu_capabilities: Optional[List[List[str]]] = None,
    container_ready_timeout: Optional[int] = None,
    container_ready_check_interval: Optional[int] = None,
)Class that represents a local endpoint.
Inheritance
builtins.object > LocalEndpointMethods
LocalEndpoint
LocalEndpoint(
    serving_container_image_uri: str,
    artifact_uri: Optional[str] = None,
    serving_container_predict_route: Optional[str] = None,
    serving_container_health_route: Optional[str] = None,
    serving_container_command: Optional[Sequence[str]] = None,
    serving_container_args: Optional[Sequence[str]] = None,
    serving_container_environment_variables: Optional[Dict[str, str]] = None,
    serving_container_ports: Optional[Sequence[int]] = None,
    credential_path: Optional[str] = None,
    host_port: Optional[str] = None,
    gpu_count: Optional[int] = None,
    gpu_device_ids: Optional[List[str]] = None,
    gpu_capabilities: Optional[List[List[str]]] = None,
    container_ready_timeout: Optional[int] = None,
    container_ready_check_interval: Optional[int] = None,
)Creates a local endpoint instance.
| Name | Description | 
| serving_container_image_uri | strRequired. The URI of the Model serving container. | 
| artifact_uri | strOptional. The path to the directory containing the Model artifact and any of its supporting files. The path is either a GCS uri or the path to a local directory. If this parameter is set to a GCS uri: (1)  | 
| serving_container_predict_route | strOptional. An HTTP path to send prediction requests to the container, and which must be supported by it. If not specified a default HTTP path will be used by Vertex AI. | 
| serving_container_health_route | strOptional. An HTTP path to send health check requests to the container, and which must be supported by it. If not specified a standard HTTP path will be used by Vertex AI. | 
| serving_container_command | Sequence[str]Optional. The command with which the container is run. Not executed within a shell. The Docker image's ENTRYPOINT is used if this is not provided. Variable references $(VAR_NAME) are expanded using the container's environment. If a variable cannot be resolved, the reference in the input string will be unchanged. The $(VAR_NAME) syntax can be escaped with a double $$, ie: $$(VAR_NAME). Escaped references will never be expanded, regardless of whether the variable exists or not. | 
| serving_container_environment_variables | Dict[str, str]Optional. The environment variables that are to be present in the container. Should be a dictionary where keys are environment variable names and values are environment variable values for those names. | 
| serving_container_ports | Sequence[int]Optional. Declaration of ports that are exposed by the container. This field is primarily informational, it gives Vertex AI information about the network connections the container uses. Listing or not a port here has no impact on whether the port is actually exposed, any port listening on the default "0.0.0.0" address inside a container will be accessible from the network. | 
| credential_path | strOptional. The path to the credential key that will be mounted to the container. If it's unset, the environment variable,  | 
| host_port | strOptional. The port on the host that the port,  | 
| gpu_count | intOptional. Number of devices to request. Set to -1 to request all available devices. To use GPU, set either  | 
| gpu_device_ids | List[str]Optional. This parameter corresponds to  | 
| gpu_capabilities | List[List[str]]Optional. This parameter corresponds to  | 
| container_ready_timeout | intOptional. The timeout in second used for starting the container or succeeding the first health check. | 
| container_ready_check_interval | intOptional. The time interval in second to check if the container is ready or the first health check succeeds. | 
| Type | Description | 
| ValueError | If both gpu_countandgpu_device_idsare set. | 
__del__
__del__()Stops the container when the instance is about to be destroyed.
__enter__
__enter__()Enters the runtime context related to this object.
__exit__
__exit__(exc_type, exc_value, exc_traceback)Exits the runtime context related to this object.
get_container_status
get_container_status()Gets the container status.
predict
predict(
    request: Optional[Any] = None,
    request_file: Optional[str] = None,
    headers: Optional[Dict] = None,
    verbose: bool = True,
)Executes a prediction.
| Name | Description | 
| request | AnyOptional. The request sent to the container. | 
| request_file | strOptional. The path to a request file sent to the container. | 
| headers | DictOptional. The headers in the prediction request. | 
| verbose | boolRequired. Whether or not print logs if any. | 
| Type | Description | 
| RuntimeError | If the local endpoint has been stopped. | 
| ValueError | If both requestandrequest_fileare specified, bothrequestandrequest_fileare not provided, orrequest_fileis specified but does not exist. | 
| requests.exception.RequestException | If the request fails with an exception. | 
print_container_logs
print_container_logs(show_all: bool = False, message: Optional[str] = None)Prints container logs.
| Name | Description | 
| show_all | boolRequired. If True, prints all logs since the container starts. | 
| message | strOptional. The message to be printed before printing the logs. | 
print_container_logs_if_container_is_not_running
print_container_logs_if_container_is_not_running(
    show_all: bool = False, message: Optional[str] = None
)Prints container logs if the container is not in "running" status.
| Name | Description | 
| show_all | boolRequired. If True, prints all logs since the container starts. | 
| message | strOptional. The message to be printed before printing the logs. | 
run_health_check
run_health_check(verbose: bool = True)Runs a health check.
| Name | Description | 
| verbose | boolRequired. Whether or not print logs if any. | 
| Type | Description | 
| RuntimeError | If the local endpoint has been stopped. | 
| requests.exception.RequestException | If the request fails with an exception. | 
serve
serve()Starts running the container and serves the traffic locally.
An environment variable, GOOGLE_CLOUD_PROJECT, will be set to the project in the global config.
This is required if the credentials file does not have project specified and used to
recognize the project by the Cloud Storage client.
| Type | Description | 
| DockerError | If the container is not ready or health checks do not succeed after the timeout. | 
stop
stop()Explicitly stops the container.