vllm.distributed.kv_transfer.kv_connector.v1.offloading.common ¶
OffloadingWorkerMetadata dataclass ¶
Bases: KVConnectorWorkerMetadata
Worker -> Scheduler metadata for completed transfer jobs.
Each worker reports {job_id: 1} for newly completed transfer jobs (load or store). aggregate() sums counts across workers within a step. The scheduler accumulates across steps and processes a transfer completion only when count reaches num_workers.
Source code in vllm/distributed/kv_transfer/kv_connector/v1/offloading/common.py
TransferJob dataclass ¶
A transfer job bundling request context with transfer spec.
Used for both loads and stores, keyed by scheduler-assigned job ID. The worker reports the job ID back when the transfer finishes, and the scheduler processes the completion.