runtime error
Exit code: 1. Reason: | 608M/4.78G [00:01<00:07, 590MB/s][A model-00009-of-00010.safetensors: 43%|βββββ | 2.08G/4.78G [00:02<00:02, 1.10GB/s][A model-00009-of-00010.safetensors: 74%|ββββββββ | 3.52G/4.78G [00:03<00:01, 1.25GB/s][A model-00009-of-00010.safetensors: 100%|ββββββββββ| 4.78G/4.78G [00:03<00:00, 1.27GB/s] Downloading shards: 90%|βββββββββ | 9/10 [00:39<00:04, 4.16s/it][A model-00010-of-00010.safetensors: 0%| | 0.00/3.90G [00:00<?, ?B/s][A model-00010-of-00010.safetensors: 12%|ββ | 472M/3.90G [00:01<00:07, 471MB/s][A model-00010-of-00010.safetensors: 50%|βββββ | 1.96G/3.90G [00:02<00:01, 1.06GB/s][A model-00010-of-00010.safetensors: 100%|ββββββββββ| 3.90G/3.90G [00:03<00:00, 1.29GB/s] Downloading shards: 100%|ββββββββββ| 10/10 [00:42<00:00, 3.89s/it][A Downloading shards: 100%|ββββββββββ| 10/10 [00:42<00:00, 4.30s/it] Loading checkpoint shards: 0%| | 0/10 [00:00<?, ?it/s][A Loading checkpoint shards: 50%|βββββ | 5/10 [00:01<00:01, 4.21it/s][A Loading checkpoint shards: 100%|ββββββββββ| 10/10 [00:02<00:00, 4.40it/s][A Loading checkpoint shards: 100%|ββββββββββ| 10/10 [00:02<00:00, 4.37it/s] generation_config.json: 0%| | 0.00/154 [00:00<?, ?B/s][A generation_config.json: 100%|ββββββββββ| 154/154 [00:00<00:00, 958kB/s] Traceback (most recent call last): File "/home/user/app/app.py", line 203, in <module> model, tokenizer = initialize_model() File "/home/user/app/app.py", line 147, in initialize_model ).to("cuda") File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 3149, in to raise ValueError( ValueError: `.to` is not supported for `8-bit` bitsandbytes models. Please use the model as it is, since the model has already been set to the correct devices and casted to the correct `dtype`.
Container logs:
Fetching error logs...