调用glm模型，遇到modeling_glm.py的bug：attention_mask初始化device设置遗漏 #186

luo-li-ba-suo · 2023-07-13T04:16:15Z

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! (when checking argument for argument index in method wrapper_CUDA__index_select)

原因是GLMModel类中
if attention_mask is None: attention_mask = torch.zeros(batch_size)
这里没有把attention_mask转到正确的device上

The text was updated successfully, but these errors were encountered:

luo-li-ba-suo · 2023-07-13T04:31:37Z

额貌似是别的问题
这个问题不管好像没事

luo-li-ba-suo closed this as completed Jul 13, 2023

luo-li-ba-suo reopened this Jul 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

调用glm模型，遇到modeling_glm.py的bug：attention_mask初始化device设置遗漏 #186

调用glm模型，遇到modeling_glm.py的bug：attention_mask初始化device设置遗漏 #186

luo-li-ba-suo commented Jul 13, 2023

luo-li-ba-suo commented Jul 13, 2023

调用glm模型，遇到modeling_glm.py的bug：attention_mask初始化device设置遗漏 #186

调用glm模型，遇到modeling_glm.py的bug：attention_mask初始化device设置遗漏 #186

Comments

luo-li-ba-suo commented Jul 13, 2023

luo-li-ba-suo commented Jul 13, 2023