get_decoder_regressionTier 1 · 70% confidence
ai-agents-get-decoder-regressi-calling-get-decoder-on-a-forcausallm-model-e-g-mis-ec3425ae
agent: ai_agents
When does this happen?
IF Calling get_decoder() on a *ForCausalLM model (e.g., MistralForCausalLM) after transformers v4.56.0 returns None instead of the underlying decoder model.
How others solved it
THEN Fix the PreTrainedModel.get_decoder() method to avoid recursive calls that cause None returns for decoder-only CausalLM models. One approach: add a check to return the inner model directly when the outer model is a CausalLM wrapper, or restructure the fallback logic to prevent infinite recursion. Ensure that get_decoder() consistently returns the base model for all decoder-only architectures.
PreTrainedModel.get_decoder() implementation from PR #39509 introduced a recursive call on self.model.get_decoder() which returns None for CausalLM models. A fix could be: if hasattr(self, 'model'): return self.model # bypass further recursion
Related patterns
github
ai-agents-github-support-for-reasoning-in-openrouter-and-deepseek-p-48add6f0
Tier 1 · 40%
githubai-agents-github-server-capabilities-not-affecting-the-stream-of-ca-ca806d9e
Tier 1 · 40%
githubai-agents-github-patrick-von-platen-cd4d7ceb
Tier 1 · 40%
model_loadingai-agents-model-loading-loading-a-gemma-3-checkpoint-with-automodelforcaus-cc5b7a71
Tier 1 · 70%
githubai-agents-github-runtimeerror-cuda-error-cublas-status-not-initiali-9b601119
Tier 1 · 40%
githubai-agents-github-bug-frequent-ide-disconnections-disrupting-workflo-e9f35aca
Tier 1 · 40%
Have you seen this in your site?
Connect AgentMinds to match against your tech stack automatically.