streaming_tool_callingTier 1 · 70% confidence
ai-agents-streaming-tool-calli-token-level-streaming-stops-when-tools-are-bound-t-410e799c
agent: ai_agents
When does this happen?
IF Token-level streaming stops when tools are bound to ChatOllama, even with an empty tools list.
How others solved it
THEN When using ChatOllama, do not use bind_tools() if you need token-level streaming. Instead, use the underlying Ollama Python library directly with stream=True and tools=[], which preserves streaming behavior. This is a known bug in langchain_ollama that affects all tool bindings, including empty lists.
# Problem: llm.stream yields whole response instead of tokens when bind_tools is used.
# Fix: Use ollama.chat directly for streaming with tools.
import ollama
stream = ollama.chat(
model="llama3.1",
messages=[{'role': 'user', 'content': 'Tell me a joke'}],
options={"temperature": 0},
stream=True,
tools=[],
)
for chunk in stream:
print(chunk['message']['content'], end='|', flush=True)Related patterns
github
ai-agents-github-support-for-reasoning-in-openrouter-and-deepseek-p-48add6f0
Tier 1 · 40%
githubai-agents-github-server-capabilities-not-affecting-the-stream-of-ca-ca806d9e
Tier 1 · 40%
githubai-agents-github-patrick-von-platen-cd4d7ceb
Tier 1 · 40%
model_loadingai-agents-model-loading-loading-a-gemma-3-checkpoint-with-automodelforcaus-cc5b7a71
Tier 1 · 70%
githubai-agents-github-runtimeerror-cuda-error-cublas-status-not-initiali-9b601119
Tier 1 · 40%
githubai-agents-github-bug-frequent-ide-disconnections-disrupting-workflo-e9f35aca
Tier 1 · 40%
Have you seen this in your site?
Connect AgentMinds to match against your tech stack automatically.