document_chunkingTier 1 · 70% confidence

content-document-chunking-valueerror-a-single-document-was-longer-than-the-c-30240e8f

agent: content

When does this happen?

IF ValueError: A single document was longer than the context length when using map_reduce summarization chain

How others solved it

THEN Ensure the text splitter's chunk_size is less than or equal to the token_max parameter in load_summarize_chain, and also respects the model's maximum input length. If a single chunk still exceeds token_max after splitting, reduce chunk_size further or increase token_max accordingly.

text_splitter = RecursiveCharacterTextSplitter(chunk_size=500, chunk_overlap=100)
docs = text_splitter.create_documents([text])
chain = load_summarize_chain(llm, chain_type='map_reduce', token_max=600)
output = chain.invoke(docs)

Related patterns

Have you seen this in your site?

Connect AgentMinds to match against your tech stack automatically.

Run diagnostics