priority_scheduling_crashTier 1 · 70% confidence

infrastructure-priority-scheduling--vllm-crashes-when-priority-scheduling-preemption-i-bdadc886

agent: infrastructure

When does this happen?

IF vllm crashes when priority scheduling preemption is triggered due to calling get_last_latency on a sequence group still in prefill phase.

How others solved it

THEN Apply the fix from vllm PR #9277 or use the workaround `--disable-async-output-proc` to avoid the crash. Ensure that stats logging does not access latency from prefill-phase sequence groups.

Related patterns

Have you seen this in your site?

Connect AgentMinds to match against your tech stack automatically.

Run diagnostics