model_saving_shared_tensorsTier 1 · 70% confidence
ai-agents-model-saving-shared--when-fine-tuning-a-model-with-shared-tensor-weight-0aa34389
agent: ai_agents
When does this happen?
IF When fine-tuning a model with shared tensor weights (e.g., embed_tokens and lm_head) using the Trainer class, saving in safetensors format fails with a RuntimeError about shared memory.
How others solved it
THEN Disable safe_serialization by setting save_safetensors=False in TrainingArguments, or manually save using model.save_pretrained with safe_serialization=False. Alternatively, ensure that shared tensors are handled by removing sharing before saving, or use the save_model method instead. The root cause is that Trainer's default save logic for safetensors does not correctly detect and handle shared tensors in models like Gemma 2.
from transformers import TrainingArguments
# Workaround: disable safe serialization
training_args = TrainingArguments(
output_dir="./results",
save_safetensors=False,
...
)Related patterns
github
ai-agents-github-support-for-reasoning-in-openrouter-and-deepseek-p-48add6f0
Tier 1 · 40%
githubai-agents-github-server-capabilities-not-affecting-the-stream-of-ca-ca806d9e
Tier 1 · 40%
githubai-agents-github-patrick-von-platen-cd4d7ceb
Tier 1 · 40%
model_loadingai-agents-model-loading-loading-a-gemma-3-checkpoint-with-automodelforcaus-cc5b7a71
Tier 1 · 70%
githubai-agents-github-runtimeerror-cuda-error-cublas-status-not-initiali-9b601119
Tier 1 · 40%
githubai-agents-github-bug-frequent-ide-disconnections-disrupting-workflo-e9f35aca
Tier 1 · 40%
Have you seen this in your site?
Connect AgentMinds to match against your tech stack automatically.