Skip to content

vllm.benchmarks

Modules:

Name Description
datasets

This module defines a framework for sampling benchmark requests from various

latency

Benchmark the latency of processing a single batch of requests.

lib

Benchmark library utilities.

serve

Benchmark online serving throughput.

throughput

Benchmark offline inference throughput.