tilelang.autotuner.paramΒΆ
The auto-tune parameters.
AttributesΒΆ
ClassesΒΆ
Compile arguments for the auto-tuner. Detailed description can be found in tilelang.jit.compile. |
|
Profile arguments for the auto-tuner. |
|
Results from auto-tuning process. |
Module ContentsΒΆ
- tilelang.autotuner.param.BEST_CONFIG_PATH = 'best_config.json'ΒΆ
- tilelang.autotuner.param.FUNCTION_PATH = 'function.pkl'ΒΆ
- tilelang.autotuner.param.LATENCY_PATH = 'latency.json'ΒΆ
- tilelang.autotuner.param.KERNEL_PATH = 'kernel.cu'ΒΆ
- tilelang.autotuner.param.WRAPPED_KERNEL_PATH = 'wrapped_kernel.cu'ΒΆ
- tilelang.autotuner.param.KERNEL_LIB_PATH = 'kernel_lib.so'ΒΆ
- tilelang.autotuner.param.PARAMS_PATH = 'params.pkl'ΒΆ
- class tilelang.autotuner.param.CompileArgsΒΆ
Compile arguments for the auto-tuner. Detailed description can be found in tilelang.jit.compile. .. attribute:: out_idx
List of output tensor indices.
- execution_backendΒΆ
Execution backend to use for kernel execution (default: βcythonβ).
- targetΒΆ
Compilation target, either as a string or a TVM Target object (default: βautoβ).
- target_hostΒΆ
Target host for cross-compilation (default: None).
- verboseΒΆ
Whether to enable verbose output (default: False).
- pass_configsΒΆ
Additional keyword arguments to pass to the Compiler PassContext.
- Refer to `tilelang.PassConfigKey` for supported options.
- out_idx: list[int] | int | None = NoneΒΆ
- execution_backend: Literal['dlpack', 'ctypes', 'cython'] = 'cython'ΒΆ
- target: Literal['auto', 'cuda', 'hip'] = 'auto'ΒΆ
- target_host: str | tvm.target.Target = NoneΒΆ
- verbose: bool = FalseΒΆ
- pass_configs: dict[str, Any] | None = NoneΒΆ
- compile_program(program)ΒΆ
- Parameters:
program (tvm.tir.PrimFunc)
- __hash__()ΒΆ
- class tilelang.autotuner.param.ProfileArgsΒΆ
Profile arguments for the auto-tuner.
- warmupΒΆ
Number of warmup iterations.
- repΒΆ
Number of repetitions for timing.
- timeoutΒΆ
Maximum time per configuration.
- supply_typeΒΆ
Type of tensor supply mechanism.
- ref_progΒΆ
Reference program for correctness validation.
- supply_progΒΆ
Supply program for input tensors.
- out_idxΒΆ
Union[List[int], int] = -1
- supply_typeΒΆ
tilelang.TensorSupplyType = tilelang.TensorSupplyType.Auto
- ref_progΒΆ
Callable = None
- supply_progΒΆ
Callable = None
- rtolΒΆ
float = 1e-2
- atolΒΆ
float = 1e-2
- max_mismatched_ratioΒΆ
float = 0.01
- skip_checkΒΆ
bool = False
- manual_check_progΒΆ
Callable = None
- cache_input_tensorsΒΆ
bool = True
- warmup: int = 25ΒΆ
- rep: int = 100ΒΆ
- timeout: int = 30ΒΆ
- supply_type: tilelang.TensorSupplyTypeΒΆ
- ref_prog: Callable = NoneΒΆ
- supply_prog: Callable = NoneΒΆ
- rtol: float = 0.01ΒΆ
- atol: float = 0.01ΒΆ
- max_mismatched_ratio: float = 0.01ΒΆ
- skip_check: bool = FalseΒΆ
- manual_check_prog: Callable = NoneΒΆ
- cache_input_tensors: bool = TrueΒΆ
- __hash__()ΒΆ
- class tilelang.autotuner.param.AutotuneResultΒΆ
Results from auto-tuning process.
- latencyΒΆ
Best achieved execution latency.
- configΒΆ
Configuration that produced the best result.
- ref_latencyΒΆ
Reference implementation latency.
- libcodeΒΆ
Generated library code.
- funcΒΆ
Optimized function.
- kernelΒΆ
Compiled kernel function.
- latency: float | None = NoneΒΆ
- config: dict | None = NoneΒΆ
- ref_latency: float | None = NoneΒΆ
- libcode: str | None = NoneΒΆ
- func: Callable | None = NoneΒΆ
- kernel: Callable | None = NoneΒΆ
- save_to_disk(path, verbose=False)ΒΆ
- Parameters:
path (pathlib.Path)
verbose (bool)
- classmethod load_from_disk(path, compile_args)ΒΆ
- Parameters:
path (pathlib.Path)
compile_args (CompileArgs)
- Return type: