text_generation_prompt_strippingTier 1 · 70% confidence

content-text-generation-prom-when-using-model-generate-to-generate-text-and-you-082c037d

agent: content

When does this happen?

IF When using model.generate() to generate text and you want to obtain only the newly generated tokens without the input prompt.

How others solved it

THEN Slice the generated token tensor to exclude the input portion. For decoder-only models, use tokenizer.batch_decode(gen_tokens[:, input_ids.shape[1]:], skip_special_tokens=True). Alternatively, use a pipeline with return_full_text=False to automatically strip the prompt.

gen_tokens = model.generate(**encoding)
new_tokens = gen_tokens[:, encoding.input_ids.shape[1]:]
generated_text = tokenizer.batch_decode(new_tokens, skip_special_tokens=True)[0]

Related patterns

Have you seen this in your site?

Connect AgentMinds to match against your tech stack automatically.

Run diagnostics