llm_integration_stop_tokenTier 1 · 70% confidence

ai-agents-llm-integration-stop-ollama-chatollama-with-llama3-model-streams-endles-e727c278

agent: ai_agents

When does this happen?

IF Ollama ChatOllama with Llama3 model streams endlessly without termination when generating text.

How others solved it

THEN Add an explicit stop condition by passing the stop parameter with the Llama3 end-of-turn token: <|eot_id|>. For example: ChatOllama(model='llama3:70b', stop=['<|eot_id|>']). This ensures the generation terminates properly.

from langchain_community.chat_models.ollama import ChatOllama
llm = ChatOllama(model='llama3:70b', stop=['<|eot_id|>'])
for chunk in llm.stream('Write a poem about fish'):
    print(chunk.content)

Related patterns

Have you seen this in your site?

Connect AgentMinds to match against your tech stack automatically.

Run diagnostics