@@ -17,6 +17,18 @@ This document provides detailed examples for using the MLPerf Inference Harness
1717 python harness_main.py --model-category llama3.1-8b --model RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8 --dataset-path cnn_eval.json --dataset-name llama3.1-8b --server-config backendserver/simple.yaml --scenario Server --test-mode performance --batch-size 13368 --num-samples 13368 --output-dir TEST-SERVER --lg-model-name llama3_1-8b --server-target-qps 40
1818 ```
1919
20+ ### Sample command for Deepseek
21+ ``` bash
22+ python harness_main.py --model-category deepseek-r1 --model deepseek-ai/DeepSeek-R1-0528 --dataset-path mlperf_deepseek_r1_dataset_4388_fp8_eval.pkl --dataset-name deepseek-r1 --scenario Offline --test-mode performance --batch-size 4388 --num-samples 4388 --output-dir < output_dir> --lg-model-name deepseek-r1 --server-config backendserver/deepseek.yaml --mlflow-experiment-name testing-stuff --mlflow-host ip --enable-metrics
23+ ```
24+
25+ ### Sample command with MLflow auto-upload
26+ ``` bash
27+ python harness_main.py --model-category llama3.1-8b --model RedHatAI/Meta-Llama-3.1-8B-Instruct-FP8 --dataset-path cnn_eval.json --dataset-name llama3.1-8b --server-config backendserver/simple.yaml --scenario Server --test-mode performance --batch-size 13368 --num-samples 13368 --output-dir TEST-SERVER-2 --lg-model-name llama3_1-8b --server-target-qps 40 --mlflow-experiment-name testing-stuff --mlflow-host 150.239.115.202
28+ ```
29+
30+ ** Note:** When ` --mlflow-experiment-name ` and ` --mlflow-host ` are provided, results are automatically uploaded to MLflow after the test completes successfully. No separate upload step is needed.
31+
2032## Basic Usage
2133
2234### Example 1: Simplest Case - Using Model Name Auto-Detection
@@ -81,86 +93,7 @@ python harness/harness_main.py \
8193### Example 5: DeepSeek R1 Model
8294
8395``` bash
84- python language/deepseek-r1/harness_deepseek_r1.py \
85- --model deepseek-ai/DeepSeek-R1-0528 \
86- --dataset-path ./deepseek_dataset.pkl \
87- --dataset-name deepseek-r1
88- ```
89-
90- ### Example 6: Using Model Config File
91-
92- ``` bash
93- # Model config is auto-loaded from configs/models/llama3.1-8b.yaml
94- python harness/harness_main.py \
95- --model llama3.1-8b \
96- --dataset-path ./cnn_eval.json
97- ```
98-
99- ## Dataset Configuration
100-
101- ### Example 7: Using Dataset Config File
102-
103- ``` bash
104- # Specify exact config file to use
105- python harness/harness_main.py \
106- --model meta-llama/Llama-3.1-8B-Instruct \
107- --dataset-path ./my_dataset.pkl \
108- --dataset-config-file configs/datasets/my-dataset.yaml
109- ```
110-
111- ### Example 8: Overriding Column Names
112-
113- ``` bash
114- # Override column mappings without creating config file
115- python harness/harness_main.py \
116- --model my-model/MyModel \
117- --dataset-path ./dataset.pkl \
118- --input-column prompt \
119- --input-ids-column token_ids \
120- --output-column target
121- ```
122-
123- ### Example 9: Combining Config and Overrides
124-
125- ``` bash
126- # Use config file but override specific column
127- python harness/harness_main.py \
128- --model meta-llama/Llama-3.1-8B-Instruct \
129- --dataset-path ./cnn_eval.json \
130- --dataset-config-file configs/datasets/llama3.1-8b.yaml \
131- --input-column custom_input # Overrides config's input_column
132- ```
133-
134- ### Example 10: Different Dataset Formats
135-
136- ``` bash
137- # JSON dataset
138- python harness/harness_main.py \
139- --model llama3.1-8b \
140- --dataset-path ./cnn_eval.json \
141- --dataset-name llama3.1-8b
142-
143- # Pickle dataset
144- python harness/harness_main.py \
145- --model deepseek-r1 \
146- --dataset-path ./deepseek_dataset.pkl \
147- --dataset-name deepseek-r1
148-
149- # CSV dataset (needs config with column mappings)
150- python harness/harness_main.py \
151- --model my-model \
152- --dataset-path ./my_dataset.csv \
153- --dataset-name my-dataset
154- ```
155-
156- ## Endpoint Configuration
157-
158- ### Example 11: Using Completions Endpoint (Default)
159-
160- ``` bash
161- # Explicitly specify completions endpoint
162- python harness/harness_main.py \
163- --model llama3.1-8b \
96+ model llama3.1-8b \
16497 --dataset-path ./cnn_eval.json \
16598 --api-server-url http://localhost:8000 \
16699 --endpoint-type completions
0 commit comments