👋 Welcome to Tile Language#
Tile Language (tile-lang) is a concise domain-specific language designed to streamline the development of high-performance GPU/CPU kernels (e.g., GEMM, Dequant GEMM, FlashAttention, LinearAttention). By employing a Pythonic syntax with an underlying compiler infrastructure on top of TVM, tile-lang allows developers to focus on productivity without sacrificing the low-level optimizations necessary for state-of-the-art performance.
GET STARTED
TUTORIALS
DEEP LEARNING OPERATORS
- ElementWise Operators
- General Matrix-Vector Multiplication (GEMV)
- Triton implementation
- Naive Implementation in TileLang
- More concurrency
- Vectorized Reads
tvm_thread_allreduce
Instead ofatomicAdd
- Autotune
- Conclusion
- General Matrix-Matrix Multiplication with Tile Library
- General Matrix-Matrix Multiplication with Dequantization
- Flash Attention
- Flash Linear Attention
- Convolution
- TMAC: Look Up Table Based Mixed Precision Computing
LANGUAGE REFERENCE
API Reference
Privacy