Skip to content

vllm.v1.kv_offload

Modules:

Name Description
base

Core abstractions for KV cache offloading in vLLM v1.

cpu
factory
reuse_manager

Reuse-frequency gating for CPU KV-cache offload stores.

worker