Eagerly write to tuning database #2025

mirza-halilcevic · 2025-10-09T10:47:15Z

Motivation

Refactor of tuningRunner.py so that the winning configs are immediately written to the output file instead of waiting for the entire tuning process to finish, so we avoid situations where a crash forces us to start all over.

Technical Details

Code changes to the tuningRunner.py script to implement said functionality.

Resolves https://github.com/ROCm/rocMLIR-internal/issues/2017

Test Plan

Manually tested.

Test Result

Tuning results persist after the process is interrupted.

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

mlir/utils/performance/tuningRunner.py

dhernandez0 · 2025-10-09T11:00:08Z

mlir/utils/performance/tuningRunner.py


-    winners, allData = tuneMLIRKernels(configs, confClass, paths, options)
-
-    if winners is None:


this was a weird check, maybe we should still check len(entries ) > 0 inside tuneMLIRKernels()?

I think we should keep this check if winner is none inside tuneMLIRKernels.

In your opinion, should we abort the entire loop in case this happens, or just continue with the rest of the configs? Same goes for when verification fails?

I wonder if it is possible to continue with rest of tuning and print error message about config that is failing to pick any winning perfConfig. That would be better IMO

mlir/utils/performance/tuningRunner.py

umangyadav · 2025-10-10T12:41:31Z

mlir/utils/performance/tuningRunner.py

        if options.tflops:
-            winners[testVector] = (winningConfig,maxTFlops)
+            if not headerWritten:
+                print(f"# arch\tnumCUs\ttestVector\tperfConfig\tTFlops ({options.tuningSpaceKind})", file=outFile)


It is better to create list out of header columns and then do pd.DataFrame.to_csv(sep='\t)` instead of directly putting print inside the file itself.

umangyadav · 2025-10-10T12:44:52Z

mlir/utils/performance/tuningRunner.py

+            if not headerWritten:
+                print(f"# arch\tnumCUs\ttestVector\tperfConfig\tTFlops ({options.tuningSpaceKind})", file=outFile)
+                headerWritten = True
+            print(f"{options.arch}\t{options.numCU}\t{testVector}\t{winningConfig}\t{maxTFlops}", file=outFile)


It is better to create list and use pd.DataFrame.to_csv here too.

Copilot

Pull Request Overview

This PR refactors the tuning process to write winning configurations to the output file immediately after each test vector is tuned, rather than waiting for the entire tuning process to complete. This prevents loss of tuning results if the process crashes or is interrupted.

Modified tuneMLIRKernels to accept output file handles and write results immediately
Changed getWinningConfig to return collected entries instead of appending to a global list
Restructured main function to open files upfront and use try/finally for proper cleanup

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

mlir/utils/performance/tuningRunner.py

mirza-halilcevic requested a review from causten as a code owner October 9, 2025 10:47

mirza-halilcevic requested review from dhernandez0, djramic, pabloantoniom and umangyadav October 9, 2025 10:47

dhernandez0 reviewed Oct 9, 2025

View reviewed changes

umangyadav reviewed Oct 10, 2025

View reviewed changes

mlir/utils/performance/tuningRunner.py Outdated Show resolved Hide resolved

umangyadav reviewed Oct 10, 2025

View reviewed changes

umangyadav requested a review from Copilot October 10, 2025 13:15

Copilot AI reviewed Oct 10, 2025

View reviewed changes

mlir/utils/performance/tuningRunner.py Outdated Show resolved Hide resolved

mlir/utils/performance/tuningRunner.py Outdated Show resolved Hide resolved

mirza-halilcevic added 2 commits October 10, 2025 16:34

Incrementally write tuning results to disk.

202a6cd

Address code review comments.

82e038c

mirza-halilcevic force-pushed the tuning-persistence branch from fa63939 to 82e038c Compare October 10, 2025 16:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Eagerly write to tuning database #2025

Eagerly write to tuning database #2025

mirza-halilcevic commented Oct 9, 2025

Uh oh!

Uh oh!

dhernandez0 Oct 9, 2025

Uh oh!

umangyadav Oct 10, 2025 •

edited

Loading

Uh oh!

mirza-halilcevic Oct 10, 2025

Uh oh!

umangyadav Oct 10, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

umangyadav Oct 10, 2025

Uh oh!

umangyadav Oct 10, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		winners, allData = tuneMLIRKernels(configs, confClass, paths, options)

		if winners is None:

Eagerly write to tuning database #2025

Are you sure you want to change the base?

Eagerly write to tuning database #2025

Conversation

mirza-halilcevic commented Oct 9, 2025

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

Uh oh!

dhernandez0 Oct 9, 2025

Choose a reason for hiding this comment

Uh oh!

umangyadav Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mirza-halilcevic Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

umangyadav Oct 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

umangyadav Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

umangyadav Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

umangyadav Oct 10, 2025 •

edited

Loading

umangyadav Oct 10, 2025 •

edited

Loading