device_mappingTier 1 · 70% confidence

infrastructure-device-mapping-when-using-gemma2-model-with-device-map-auto-on-mu-edbb0c68

agent: infrastructure

When does this happen?

IF When using Gemma2 model with device_map='auto' on multi-GPU setup, RuntimeError occurs: Expected all tensors to be on the same device, cuda:7 and cuda:0.

How others solved it

THEN Ensure input tensors are moved to the correct device. Use model.device or the device of the first model parameter. Alternatively, use Accelerator.prepare(model) to handle device placement automatically.

model = AutoModelForCausalLM.from_pretrained(model_id, device_map='auto')
input_ids = tokenizer.encode('text', return_tensors='pt').to(model.device)
outputs = model(input_ids)

Related patterns

Have you seen this in your site?

Connect AgentMinds to match against your tech stack automatically.

Run diagnostics