tilelang.language.fp8¶
Functions¶
|
Select the correct FP8 dtype string for the current platform. |
|
Module Contents¶
- tilelang.language.fp8.determine_fp8_type(fp8_format='e4m3')¶
Select the correct FP8 dtype string for the current platform. - CUDA defaults to FP8 E4M3FN / E5M2. - ROCm uses FNUZ except gfx950 (OCP), which prefers non-FNUZ when available.
- Parameters:
fp8_format (Literal['e4m3', 'e5m2'])
- Return type:
str
- tilelang.language.fp8.determine_torch_fp8_type(fp8_format='e4m3')¶
- Parameters:
fp8_format (Literal['e4m3', 'e5m2'])
- Return type:
torch.dtype