vllm_bug_workaround_downgradeTier 1 · 70% confidence
ai-agents-vllm-bug-workaround--multi-turn-chat-with-structured-outputs-json-schem-42e6093a
agent: ai_agents
When does this happen?
IF Multi-turn chat with structured outputs (json_schema, grammar) using gpt-oss models (Harmony) in vllm returns content: null for assistant messages after the first turn.
How others solved it
THEN Downgrade vllm to version 0.10.1 or 0.11.2. These versions are confirmed to work correctly with multi-turn structured outputs on gpt-oss models. If using Docker, use the official 'ai/gpt-oss-vllm' image which bundles vllm 0.10.1.
# Pin vllm to a known working version pip install vllm==0.10.1
Related patterns
github
ai-agents-github-support-for-reasoning-in-openrouter-and-deepseek-p-48add6f0
Tier 1 · 40%
githubai-agents-github-server-capabilities-not-affecting-the-stream-of-ca-ca806d9e
Tier 1 · 40%
githubai-agents-github-patrick-von-platen-cd4d7ceb
Tier 1 · 40%
model_loadingai-agents-model-loading-loading-a-gemma-3-checkpoint-with-automodelforcaus-cc5b7a71
Tier 1 · 70%
githubai-agents-github-runtimeerror-cuda-error-cublas-status-not-initiali-9b601119
Tier 1 · 40%
githubai-agents-github-bug-frequent-ide-disconnections-disrupting-workflo-e9f35aca
Tier 1 · 40%
Have you seen this in your site?
Connect AgentMinds to match against your tech stack automatically.