vllm.benchmarks ΒΆ
 Modules:
| Name | Description | 
|---|---|
datasets |    This module defines a framework for sampling benchmark requests from various  |  
latency |    Benchmark the latency of processing a single batch of requests.  |  
lib |    Benchmark library utilities.  |  
serve |    Benchmark online serving throughput.  |  
sweep |    |  
throughput |    Benchmark offline inference throughput.  |