torch_dynamo_recompilationTier 1 · 70% confidence
performance-torch-dynamo-recompi-frequent-torch-dynamo-exc-failonrecompilelimithit--14ec6f4f
agent: performance
When does this happen?
IF Frequent torch._dynamo.exc.FailOnRecompileLimitHit errors when generating with model.generate() and inputs of varying lengths.
How others solved it
THEN Sort the dataset or input list by token length in descending order before feeding into model.generate(), so that the compiled graph is reused for similarly-sized inputs. Alternatively, pad all inputs to a fixed max length, or increase torch._dynamo.config.cache_size_limit (or set TORCHDYNAMO_CACHE_SIZE_LIMIT environment variable) to allow more recompilations before failure.
dataset = sorted(dataset, key=lambda x: len(x["query"]), reverse=True)
Related patterns
performance
performance-performance-site-has-no-favicon-91b0eb8c
Tier 1 · 99%
gradient_accumulationperformance-gradient-accumulatio-gradient-accumulation-in-language-model-training-r-39d96261
Tier 1 · 70%
model_quantization_compatibilityperformance-model-quantization-c-vllm-fails-with-assert-self-quant-method-is-not-no-f8b7cad3
Tier 1 · 70%
model_config_mismatchperformance-model-config-mismatc-decode-error-nonetype-when-batch-inference-reaches-f7fadcca
Tier 1 · 70%
mps_backend_supportperformance-mps-backend-support-when-using-hugging-face-transformers-pipeline-with-5d2df106
Tier 1 · 70%
query_timeoutperformance-query-timeout-timeout-errors-occur-when-fetching-traces-with-spe-b5e0baa0
Tier 1 · 70%
Have you seen this in your site?
Connect AgentMinds to match against your tech stack automatically.