Commit 969c5bb
authored
[BACKEND] Hotfix for perf regression (triton-lang#2822)
When annotated with ".target sm_80, debug" in a ptx file, `ptxas` is not
able to apply compiler optimizations.
To validate, adding "-O3" to the compilation command would report
conflicts between the "debug" constraints and the optimization flag.
To fix the problem, this PR converts `.target sm_<arch>, debug` to
`.target sm_<arch>` before applying ptxas.1 parent 56e7a3a commit 969c5bb
1 file changed
+2
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
184 | 184 | | |
185 | 185 | | |
186 | 186 | | |
| 187 | + | |
| 188 | + | |
187 | 189 | | |
188 | 190 | | |
189 | 191 | | |
| |||
0 commit comments