Skip to content

[URGENT] Power issues #1581

@psyhtest

Description

@psyhtest

We've stumbled upon new submission checker issues with the latest power workflow.

Whitespace lines at the end of spl.txt files

For some reasons, spl.txt files now terminate with a whitespace line (not entirely blank, probably a space there, as len(line) == 1). This causes exceptions like exception in check_power_dir: list index out of range.

We can work around this with the following patch:

krai@aus655-pci-bowie:~/work_collection/mlperf_inference_git_master$ git diff
diff --git a/tools/submission/power/power_checker.py b/tools/submission/power/power_checker.py
index 5adcd19..ac9cb0a 100755
--- a/tools/submission/power/power_checker.py
+++ b/tools/submission/power/power_checker.py
@@ -408,6 +408,7 @@ def phases_check(

         with open(spl_fname) as f:
             for line in f:
+                if not line.startswith("Time"): continue
                 timestamp = (
                     datetime.strptime(line.split(",")[1], datetime_format)
                 ).replace(tzinfo=timezone.utc)
diff --git a/tools/submission/submission_checker.py b/tools/submission/submission_checker.py
index e61590e..3b2ffe6 100755
--- a/tools/submission/submission_checker.py
+++ b/tools/submission/submission_checker.py
@@ -2230,6 +2230,7 @@ def get_power_metric(config, scenario_fixed, log_path, is_valid, res):
     power_list = []
     with open(spl_fname) as f:
         for line in f:
+            if not line.startswith("Time"): continue
             timestamp = (
                 datetime.datetime.strptime(line.split(",")[1], datetime_format)
                 + server_timezone

I suggest to make a PR with this change, because we've collected a number of logs with this issue and they are checksum'ed to include this line.

tools/submission/power/sources_checksums.json is 6 months old

Despite multiple changes to mlcommons/power-dev since the last round, the checksum file has not been updated since 2 August 2023.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions