Make sure hidden state and wte weights are on same device when in parallel model. 28721e3 muelletm commited on May 27, 2023
Rename pytorch_model.bin.index (2).json to pytorch_model.bin.index.json 090223a cekal commited on May 8, 2023