token_processingTier 1 · 70% confidence

ai-agents-token-processing-offline-chat-api-case-2-duplicates-bos-token-when--d6289f82

agent: ai_agents

When does this happen?

IF Offline chat API (case 2) duplicates BOS token when the chat template already contains it.

How others solved it

THEN Fix vLLM's offline chat to not add a BOS token if the template already includes one, matching the behavior of the online chat API (case 4) which was fixed in PR #4688.

Related patterns

Have you seen this in your site?

Connect AgentMinds to match against your tech stack automatically.

Run diagnostics