tilelang.intrinsics.mfma_layout¶
Attributes¶
Functions¶
Return (m, n) = (row, col) for the 32x32 MFMA output register layout. |
|
|
Module Contents¶
Return (m, n) = (row, col) for the 32x32 MFMA output register layout.
For v_mfma_i32_32x32x32_i8 (gfx950), each wave-64 lane holds 16 output i32 values. The column (N-dimension) is indexed by
thread_id % 32and the row (M-dimension) is given by the interleaved formula below. This function returns(m_idx, n_idx)matching the(row, col)convention expected bystmatrix.
- tilelang.intrinsics.mfma_layout.make_mfma_swizzle_layout(shared_buf, vecSize=8)¶