vllm.v1.attention.backends.linear_attn
LinearAttentionBackend ¶
Bases: AttentionBackend
Source code in vllm/v1/attention/backends/linear_attn.py
LinearAttentionMetadata dataclass
¶
Source code in vllm/v1/attention/backends/linear_attn.py
LinearAttentionMetadataBuilder ¶
Bases: AttentionMetadataBuilder[LinearAttentionMetadata]
Source code in vllm/v1/attention/backends/linear_attn.py
__init__ ¶
__init__(
kv_cache_spec: AttentionSpec,
layer_names: list[str],
vllm_config: VllmConfig,
device: device,
)
build ¶
build(
common_prefix_len: int,
common_attn_metadata: CommonAttentionMetadata,
fast_build: bool = False,
) -> LinearAttentionMetadata