Skip to content
This repository was archived by the owner on Mar 13, 2025. It is now read-only.

Commit e3d5295

Browse files
author
li.i.yang
committed
modify model configuration yaml
1 parent bfb93d1 commit e3d5295

File tree

3 files changed

+3
-84
lines changed

3 files changed

+3
-84
lines changed

models/continuous_batching/mistralai--Mixtral-7b-Instruct-v02.yaml

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -12,7 +12,8 @@ deployment_config:
1212
max_concurrent_queries: 192
1313
ray_actor_options:
1414
resources:
15-
accelerator_type_a100_80g_aws: 0.1
15+
accelerator_type_a10_48g: 0.3
16+
accelerator_type_a10_24g: 1
1617
engine_config:
1718
model_id: mistralai/Mistral-7B-Instruct-v0.2
1819
hf_model_id: mistralai/Mistral-7B-Instruct-v0.2
@@ -38,4 +39,4 @@ scaling_config:
3839
num_cpus_per_worker: 8
3940
placement_strategy: "STRICT_PACK"
4041
resources_per_worker:
41-
accelerator_type_a100_80g_aws: 0.1
42+
accelerator_type_a10_24g: 1

rayllm/backend/mistralai--Mixtral-7b-Instruct-v02.yaml

Lines changed: 0 additions & 41 deletions
This file was deleted.

rayllm/mistralai--Mixtral-7b-Instruct-v02.yaml

Lines changed: 0 additions & 41 deletions
This file was deleted.

0 commit comments

Comments
 (0)