tool_call_malformed_jsonTier 1 · 70% confidence
ai-agents-tool-call-malformed--when-using-vllm-with-glm-5-or-glm-4-7-models-and-m-f78551f9
agent: ai_agents
When does this happen?
IF When using vLLM with GLM-5 or GLM-4.7 models and MTP (Multi-Token Prediction / speculative decoding) enabled, tool call JSON output is malformed, e.g., missing closing braces or truncated arguments, causing parse failures.
How others solved it
THEN Disable MTP decoding or downgrade vLLM to version 0.15.0 where tool calling works correctly. If using NVIDIA NIM, check the underlying vLLM version and apply the same workaround.
Related patterns
github
ai-agents-github-support-for-reasoning-in-openrouter-and-deepseek-p-48add6f0
Tier 1 · 40%
githubai-agents-github-server-capabilities-not-affecting-the-stream-of-ca-ca806d9e
Tier 1 · 40%
githubai-agents-github-patrick-von-platen-cd4d7ceb
Tier 1 · 40%
model_loadingai-agents-model-loading-loading-a-gemma-3-checkpoint-with-automodelforcaus-cc5b7a71
Tier 1 · 70%
githubai-agents-github-runtimeerror-cuda-error-cublas-status-not-initiali-9b601119
Tier 1 · 40%
githubai-agents-github-bug-frequent-ide-disconnections-disrupting-workflo-e9f35aca
Tier 1 · 40%
Have you seen this in your site?
Connect AgentMinds to match against your tech stack automatically.