tilelang.autotuner.paramΒΆ

The auto-tune parameters.

AttributesΒΆ

ClassesΒΆ

CompileArgs

Compile arguments for the auto-tuner. Detailed description can be found in tilelang.jit.compile.

ProfileArgs

Profile arguments for the auto-tuner.

AutotuneResult

Results from auto-tuning process.

Module ContentsΒΆ

tilelang.autotuner.param.BEST_CONFIG_PATH = 'best_config.json'ΒΆ
tilelang.autotuner.param.FUNCTION_PATH = 'function.pkl'ΒΆ
tilelang.autotuner.param.LATENCY_PATH = 'latency.json'ΒΆ
tilelang.autotuner.param.KERNEL_PATH = 'kernel.cu'ΒΆ
tilelang.autotuner.param.WRAPPED_KERNEL_PATH = 'wrapped_kernel.cu'ΒΆ
tilelang.autotuner.param.KERNEL_LIB_PATH = 'kernel_lib.so'ΒΆ
tilelang.autotuner.param.PARAMS_PATH = 'params.pkl'ΒΆ
class tilelang.autotuner.param.CompileArgsΒΆ

Compile arguments for the auto-tuner. Detailed description can be found in tilelang.jit.compile. .. attribute:: out_idx

List of output tensor indices.

execution_backendΒΆ

Execution backend to use for kernel execution (default: β€œcython”).

targetΒΆ

Compilation target, either as a string or a TVM Target object (default: β€œauto”).

target_hostΒΆ

Target host for cross-compilation (default: None).

verboseΒΆ

Whether to enable verbose output (default: False).

pass_configsΒΆ

Additional keyword arguments to pass to the Compiler PassContext.

Refer to `tilelang.PassConfigKey` for supported options.
out_idx: list[int] | int | None = NoneΒΆ
execution_backend: Literal['dlpack', 'ctypes', 'cython'] = 'cython'ΒΆ
target: Literal['auto', 'cuda', 'hip'] = 'auto'ΒΆ
target_host: str | tvm.target.Target = NoneΒΆ
verbose: bool = FalseΒΆ
pass_configs: dict[str, Any] | None = NoneΒΆ
compile_program(program)ΒΆ
Parameters:

program (tvm.tir.PrimFunc)

__hash__()ΒΆ
class tilelang.autotuner.param.ProfileArgsΒΆ

Profile arguments for the auto-tuner.

warmupΒΆ

Number of warmup iterations.

repΒΆ

Number of repetitions for timing.

timeoutΒΆ

Maximum time per configuration.

supply_typeΒΆ

Type of tensor supply mechanism.

ref_progΒΆ

Reference program for correctness validation.

supply_progΒΆ

Supply program for input tensors.

out_idxΒΆ

Union[List[int], int] = -1

supply_typeΒΆ

tilelang.TensorSupplyType = tilelang.TensorSupplyType.Auto

ref_progΒΆ

Callable = None

supply_progΒΆ

Callable = None

rtolΒΆ

float = 1e-2

atolΒΆ

float = 1e-2

max_mismatched_ratioΒΆ

float = 0.01

skip_checkΒΆ

bool = False

manual_check_progΒΆ

Callable = None

cache_input_tensorsΒΆ

bool = True

warmup: int = 25ΒΆ
rep: int = 100ΒΆ
timeout: int = 30ΒΆ
supply_type: tilelang.TensorSupplyTypeΒΆ
ref_prog: Callable = NoneΒΆ
supply_prog: Callable = NoneΒΆ
rtol: float = 0.01ΒΆ
atol: float = 0.01ΒΆ
max_mismatched_ratio: float = 0.01ΒΆ
skip_check: bool = FalseΒΆ
manual_check_prog: Callable = NoneΒΆ
cache_input_tensors: bool = TrueΒΆ
__hash__()ΒΆ
class tilelang.autotuner.param.AutotuneResultΒΆ

Results from auto-tuning process.

latencyΒΆ

Best achieved execution latency.

configΒΆ

Configuration that produced the best result.

ref_latencyΒΆ

Reference implementation latency.

libcodeΒΆ

Generated library code.

funcΒΆ

Optimized function.

kernelΒΆ

Compiled kernel function.

latency: float | None = NoneΒΆ
config: dict | None = NoneΒΆ
ref_latency: float | None = NoneΒΆ
libcode: str | None = NoneΒΆ
func: Callable | None = NoneΒΆ
kernel: Callable | None = NoneΒΆ
save_to_disk(path, verbose=False)ΒΆ
Parameters:
  • path (pathlib.Path)

  • verbose (bool)

classmethod load_from_disk(path, compile_args)ΒΆ
Parameters:
Return type:

AutotuneResult