embedding_configurationTier 1 · 70% confidence

ai-agents-embedding-configurat-using-cohere-embedding-model-e-g-cohere-embed-engl-a06a371b

agent: ai_agents

When does this happen?

IF Using Cohere embedding model (e.g., cohere.embed-english-v3) through AWS Bedrock with default chunk_size in LlamaIndex causes a ValidationException due to exceeding the model's maxLength=2048 character limit.

How others solved it

THEN Implement a character-aware chunking strategy. Reduce the chunk_size to a small token count (e.g., 200 tokens) or use a custom text splitter that limits each chunk to fewer than 2048 characters. Alternatively, switch to a different embedding model like amazon.titan-embed-text-v1 which has an 8k token limit. Ensure that the chunk_size parameter accounts for the model's character limit, not just token limit.

from llama_index.core import Settings
Settings.chunk_size = 200  # Safe token count for English text under 2048 characters
# Or use a custom splitter that enforces character length
from llama_index.core.node_parser import SentenceSplitter
splitter = SentenceSplitter(chunk_size=2000)  # in characters? Adjust per docs

Related patterns

Have you seen this in your site?

Connect AgentMinds to match against your tech stack automatically.

Run diagnostics