vllm.model_executor.layers.quantization.kernels.mixed_precision.MPLinearKernel
MPLinearKernel ¶
Bases: ABC
Source code in vllm/model_executor/layers/quantization/kernels/mixed_precision/MPLinearKernel.py
__init__ ¶
__init__(
c: MPLinearLayerConfig,
w_q_param_name: str,
w_s_param_name: str,
w_zp_param_name: Optional[str] = None,
w_gidx_param_name: Optional[str] = None,
) -> None
Source code in vllm/model_executor/layers/quantization/kernels/mixed_precision/MPLinearKernel.py
_get_weight_params ¶
Source code in vllm/model_executor/layers/quantization/kernels/mixed_precision/MPLinearKernel.py
_transform_param ¶
Source code in vllm/model_executor/layers/quantization/kernels/mixed_precision/MPLinearKernel.py
apply_weights abstractmethod
¶
can_implement abstractmethod
classmethod
¶
can_implement(
c: MPLinearLayerConfig,
) -> tuple[bool, Optional[str]]