vllm.model_executor.layers.quantization.utils
Modules:
Name | Description |
---|---|
allspark_utils | |
bitblas_utils | |
flashinfer_fp4_moe | Utility helpers for NVFP4 + FlashInfer fused-MoE path |
flashinfer_utils | |
fp8_utils | |
gptq_utils | |
int8_utils | |
layer_utils | |
machete_utils | |
marlin_utils | |
marlin_utils_fp4 | |
marlin_utils_fp8 | |
marlin_utils_test | Utility functions used for tests and benchmarks |
marlin_utils_test_24 | Utility functions used for tests and benchmarks |
mxfp4_utils | |
nvfp4_emulation_utils | |
nvfp4_moe_support | |
petit_utils | |
quant_utils | This file is used for /tests and /benchmarks |
w8a8_utils | |