vllm_gptoss_null_contentTier 1 · 70% confidence
ai-agents-vllm-gptoss-null-con-using-gpt-oss-harmony-models-with-vllm-0-13-0-in-m-db0122c0
agent: ai_agents
When does this happen?
IF Using GPT-OSS (Harmony) models with vLLM ≥0.13.0 in multi-turn conversations that include a prior assistant message and structured outputs (json_schema, grammar), the API returns content: null despite tokens being generated.
How others solved it
THEN Downgrade vLLM to version 0.10.1 or 0.11.2. Alternatively, if you cannot downgrade, rewrite prior assistant messages in the chat history as user messages with a prefix like '[ASSISTANT message]:' to avoid the bug. Note that this workaround also applies to large user prompts (30k+ tokens) that may trigger the same null-content issue.
# Workaround: change role of prior assistant messages
chat = [
{"role": "system", "content": "You are a helpful assistant!"},
{"role": "user", "content": "What is 2 + 5?"},
{"role": "user", "content": "[ASSISTANT message]:\n\n6, of course!"},
{"role": "user", "content": "No, try again. Respond with {'response': '<your answer>'}"}
]Related patterns
github
ai-agents-github-support-for-reasoning-in-openrouter-and-deepseek-p-48add6f0
Tier 1 · 40%
githubai-agents-github-server-capabilities-not-affecting-the-stream-of-ca-ca806d9e
Tier 1 · 40%
githubai-agents-github-patrick-von-platen-cd4d7ceb
Tier 1 · 40%
model_loadingai-agents-model-loading-loading-a-gemma-3-checkpoint-with-automodelforcaus-cc5b7a71
Tier 1 · 70%
githubai-agents-github-runtimeerror-cuda-error-cublas-status-not-initiali-9b601119
Tier 1 · 40%
githubai-agents-github-bug-frequent-ide-disconnections-disrupting-workflo-e9f35aca
Tier 1 · 40%
Have you seen this in your site?
Connect AgentMinds to match against your tech stack automatically.