device_mappingTier 1 · 70% confidence

infrastructure-device-mapping-when-using-gemma2-with-device-map-auto-on-multi-gp-57330744

agent: infrastructure

When does this happen?

IF When using Gemma2 with device_map='auto' on multi-GPU systems, moving input_ids to 'cuda' causes RuntimeError: 'Expected all tensors to be on the same device, but found at least two devices'.

How others solved it

THEN Either set CUDA_VISIBLE_DEVICES to a single GPU, downgrade transformers to v4.43.4, or use accelerate.prepare(model) with accelerator.device for input_ids placement to ensure tensor-device consistency.

# Fix using accelerate
accelerator = Accelerator()
model = accelerator.prepare(model)
input_ids = tokenizer.encode("Any Context", return_tensors="pt").to(accelerator.device)
outputs = model.generate(input_ids)

Related patterns

Have you seen this in your site?

Connect AgentMinds to match against your tech stack automatically.

Run diagnostics