vllm.v1.sample ¶
Modules:
| Name | Description |
|---|---|
logits_processor | |
ops | |
rejection_sampler | |
sampler | A layer that samples the next tokens from the model's outputs. |
thinking_budget_state | Per-batch thinking token budget state; applied after penalties at sample time. |