[offload] Redesign the ELF format for device-side binaries. #139037

StevenYangCC · 2025-05-08T07:28:48Z

Redesign the ELF format for device-side binaries.
The original binary format is not suitable for device-side binaries.
For example:
Provide a callgraph section to enable binary optimizations at link time, as well as support for lazy loading.
Provide a prototype section to support function pointers.
Provide a section for recording metadata for each function, such as register kind, register number, stack size, etc. This metadata should be extensible, as different architectures may require different metadata, and the attributes of kernel functions and regular functions may also vary.
Moderately reference the CUDA cubin format for inspiration.

llvmbot · 2025-05-08T14:07:13Z

@llvm/issue-subscribers-offload

Author: None (StevenYangCC)

Redesign the ELF format for device-side binaries. The original binary format is not suitable for device-side binaries. For example: Provide a callgraph section to enable binary optimizations at link time, as well as support for lazy loading. Provide a prototype section to support function pointers. Provide a section for recording metadata for each function, such as register kind, register number, stack size, etc. This metadata should be extensible, as different architectures may require different metadata, and the attributes of kernel functions and regular functions may also vary. Moderately reference the CUDA cubin format for inspiration.

jhuber6 · 2025-05-08T14:14:34Z

I also do not understand this issue. Most of what you're describing is handled by the GPU runtime's loader, which is not something we have access to from offload/. We already have metadata for things like this in AMDGPU at least, unless you're talking about using that information with a call graph to generate the launch packet metadata?

Artem-B · 2025-05-08T16:25:36Z

@StevenYangCC if you could elaborate on the issue(s) that prompt this request, it would be very helpful to figure out how those issues should be addressed.

StevenYangCC · 2025-05-09T02:19:04Z

The ELF format generated by LLVM compilation is very rigid and cannot be flexibly handled on different architectures and does not meet the needs of heterogeneous computing architectures.

jhuber6 · 2025-05-09T02:41:17Z

The ELF format generated by LLVM compilation is very rigid and cannot be flexibly handled on different architectures and does not meet the needs of heterogeneous computing architectures.

This is completely vague so I'm just going to close this.

StevenYangCC · 2025-05-09T02:50:09Z

@jhuber6 What I mean is that the device-side ELF format should be redesigned instead of using the same ELF format as the host-side, since the needs of the two are not consistent. For example, the metadata in the file in the device side of the ELF for AMDGPUs is currently a big, overarching structure, and a lot of unneeded attributes take up space as well. We can design a binary ELF format that is applicable to the device side of all heterogeneous architectures, and each architecture can flexibly add sections, symbols, or attributes.

jhuber6 · 2025-05-09T02:56:48Z

@jhuber6 What I mean is that the device-side ELF format should be redesigned instead of using the same ELF format as the host-side, since the needs of the two are not consistent. For example, the metadata in the file in the device side of the ELF for AMDGPUs is currently a big, overarching structure, and a lot of unneeded attributes take up space as well. We can design a binary ELF format that is applicable to the device side of all heterogeneous architectures, and each architecture can flexibly add sections, symbols, or attributes.

I do not know what this means, feel free to write up a design document and contribute patches.

StevenYangCC · 2025-05-09T03:02:48Z

@jhuber6 You can express your doubts in detail and I will try my best to express them clearly.

Artem-B · 2025-05-09T23:37:44Z

Moderately reference the CUDA cubin format for inspiration.

That request and responses read like a chat with an LLM. I wonder if we're being curl'ed here? https://arstechnica.com/gadgets/2025/05/open-source-project-curl-is-sick-of-users-submitting-ai-slop-vulnerabilities/

jhuber6 · 2025-05-10T00:02:38Z

Moderately reference the CUDA cubin format for inspiration.

That request and responses read like a chat with an LLM. I wonder if we're being curl'ed here? https://arstechnica.com/gadgets/2025/05/open-source-project-curl-is-sick-of-users-submitting-ai-slop-vulnerabilities/

I said as much in #139039 which is why I closed the issue.

llvmbot added the new issue label May 8, 2025

EugeneZelenko added offload and removed new issue labels May 8, 2025

jhuber6 mentioned this issue May 8, 2025

[clang] Automatically generate builtin documentation from a .td File. #139039

Open

jhuber6 closed this as completed May 9, 2025

EugeneZelenko added the question A question, not bug report. Check out https://llvm.org/docs/GettingInvolved.html instead! label May 9, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[offload] Redesign the ELF format for device-side binaries. #139037

[offload] Redesign the ELF format for device-side binaries. #139037

StevenYangCC commented May 8, 2025

llvmbot commented May 8, 2025

jhuber6 commented May 8, 2025

Artem-B commented May 8, 2025

StevenYangCC commented May 9, 2025

jhuber6 commented May 9, 2025

StevenYangCC commented May 9, 2025 •

edited

Loading

jhuber6 commented May 9, 2025

StevenYangCC commented May 9, 2025 •

edited

Loading

Artem-B commented May 9, 2025

jhuber6 commented May 10, 2025

[offload] Redesign the ELF format for device-side binaries. #139037

[offload] Redesign the ELF format for device-side binaries. #139037

Comments

StevenYangCC commented May 8, 2025

llvmbot commented May 8, 2025

jhuber6 commented May 8, 2025

Artem-B commented May 8, 2025

StevenYangCC commented May 9, 2025

jhuber6 commented May 9, 2025

StevenYangCC commented May 9, 2025 • edited Loading

jhuber6 commented May 9, 2025

StevenYangCC commented May 9, 2025 • edited Loading

Artem-B commented May 9, 2025

jhuber6 commented May 10, 2025

StevenYangCC commented May 9, 2025 •

edited

Loading

StevenYangCC commented May 9, 2025 •

edited

Loading