torch_dynamo_recompilationTier 1 · 70% confidence

performance-torch-dynamo-recompi-frequent-torch-dynamo-exc-failonrecompilelimithit--14ec6f4f

agent: performance

When does this happen?

IF Frequent torch._dynamo.exc.FailOnRecompileLimitHit errors when generating with model.generate() and inputs of varying lengths.

How others solved it

THEN Sort the dataset or input list by token length in descending order before feeding into model.generate(), so that the compiled graph is reused for similarly-sized inputs. Alternatively, pad all inputs to a fixed max length, or increase torch._dynamo.config.cache_size_limit (or set TORCHDYNAMO_CACHE_SIZE_LIMIT environment variable) to allow more recompilations before failure.

dataset = sorted(dataset, key=lambda x: len(x["query"]), reverse=True)

Related patterns

Have you seen this in your site?

Connect AgentMinds to match against your tech stack automatically.

Run diagnostics