Skip to content

[FEA] Support of FP32 scales in FP8 scaledMM #456

@sanchitintel

Description

@sanchitintel

Summary

Some models use Float scales even for FP8 weights. FP8 ScaledMM should be able to support FP32 weights

Details

Currently, FP16 scales are being used in FP8 scaledMM.
I tried to use FP32 scales by using an FP32 copy-atom for scales, which is similar to the FP16 copy-atom being used to copy scales, but it didn't help

https://github.com/codeplaysoftware/cutlass-sycl/compare/sycl-develop...sanchitintel:enable_float_scales_fp8_scaledmm?expand=1

Please advise how to go about adding support for Float32 scales in FP8 scaledMM.

Thank you!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions