Question about dtype check in marlin_qqq validation for w4a8 functionality #2115

xxw11 · 2025-04-23T08:10:49Z

Hi torchao developers,

Recently, while experimenting with the w4a8 functionality in torchao, I noticed that the marlin_qqq check function requires

input_tensor.dtype == torch.float16

This seems potentially problematic, as most modern models typically use bf16 or fp32 for activation values. Forcing a conversion to float16 might introduce precision loss or even NaN issues in some cases.

Could you clarify if this dtype check is strictly necessary? Are there specific constraints or optimizations that depend on float16 here?

Thank you for your insights!

jerryzh168 · 2025-04-23T18:10:32Z

cc @HandH1998 can Marlin QQQ kernel be extended to support bfloat16 as well?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about dtype check in marlin_qqq validation for w4a8 functionality #2115

Question about dtype check in marlin_qqq validation for w4a8 functionality #2115

xxw11 commented Apr 23, 2025

jerryzh168 commented Apr 23, 2025

Question about dtype check in marlin_qqq validation for w4a8 functionality #2115

Question about dtype check in marlin_qqq validation for w4a8 functionality #2115

Comments

xxw11 commented Apr 23, 2025

jerryzh168 commented Apr 23, 2025