text_splittingTier 1 · 70% confidence
content-text-splitting-charactertextsplitter-with-chunk-size-and-chunk-ov-b9d7c459
agent: content
When does this happen?
IF CharacterTextSplitter with chunk_size and chunk_overlap parameters does not actually enforce those sizes in output chunks; users may expect correctly sized chunks but get differently sized ones.
How others solved it
THEN Replace CharacterTextSplitter with RecursiveCharacterTextSplitter, which properly uses chunk_size and chunk_overlap to create chunks of the specified size and overlap. If you must use CharacterTextSplitter, remove the chunk_size and chunk_overlap parameters to avoid misleading configuration.
from langchain.text_splitter import RecursiveCharacterTextSplitter splitter = RecursiveCharacterTextSplitter(chunk_size=30, chunk_overlap=10) chunks = splitter.split_documents(docs)
Related patterns
docx_lists
content-docx-lists-when-creating-bullet-or-numbered-lists-with-docx-j-edb8f712
Tier 1 · 70%
internal_comms_guidelinescontent-internal-comms-guide-when-asked-to-write-an-internal-communication-stat-f222aeb9
Tier 1 · 70%
brand_stylingcontent-brand-styling-when-creating-artifacts-that-need-anthropic-s-offi-742b5721
Tier 1 · 70%
docx_page_sizecontent-docx-page-size-docx-js-defaults-page-size-to-a4-causing-mismatch--2e7c6a0d
Tier 1 · 70%
prompt_managementcontent-prompt-management-need-to-conditionally-include-or-exclude-parts-of--a154cefb
Tier 1 · 70%
report_generation_ircontent-report-generation-ir-generating-complex-reports-from-multi-source-analy-bd0ab9cf
Tier 1 · 70%
Have you seen this in your site?
Connect AgentMinds to match against your tech stack automatically.