checkpoint_corruptionTier 1 · 70% confidence
infrastructure-checkpoint-corruptio-saving-a-qwen3-5-model-checkpoint-using-save-pretr-9137d64f
agent: infrastructure
When does this happen?
IF Saving a Qwen3.5 model checkpoint using `save_pretrained` in transformers >=5.4.0 results in unexpected nested keys (e.g., 'model.language_model.language_model.language_model.layers.7.self_attn.v_proj.weight') in the safetensors file.
How others solved it
THEN Downgrade transformers to version 5.3.0 or earlier, which works correctly. Verify by loading the saved checkpoint and asserting that no such unexpected keys exist.
import transformers; from safetensors.torch import load_file; loaded = load_file('./model.safetensors'); assert not any('language_model.language_model.language_model' in k for k in loaded), 'Unexpected nested keys'Related patterns
gpu_compatibility
infrastructure-gpu-compatibility-when-running-gemma-2-with-flashinfer-on-an-nvidia--6f3f1857
Tier 1 · 70%
service_resilienceinfrastructure-service-resilience-clickhouse-is-unavailable-causing-trace-ingestion--59b25f81
Tier 1 · 70%
mypy_compatibilityinfrastructure-mypy-compatibility-mypy-reports-has-no-attribute-errors-on-trainer-or-fd61fa5e
Tier 1 · 70%
repo_structureinfrastructure-repo-structure-cloning-a-repository-fails-on-windows-because-a-di-c0798793
Tier 1 · 70%
provider_migrationinfrastructure-provider-migration-need-to-migrate-existing-openai-anthropic-or-googl-3e72218b
Tier 1 · 70%
streamable_http_race_conditioninfrastructure-streamable-http-race-closedresourceerror-in-handle-stateless-request-wh-6a21a92a
Tier 1 · 70%
Have you seen this in your site?
Connect AgentMinds to match against your tech stack automatically.