runtime error

Exit code: 1. Reason: | 0.00/2.20G [00:00<?, ?B/s] model-00002-of-00002.safetensors: 1%| | 21.0M/2.20G [00:01<01:56, 18.7MB/s] model-00002-of-00002.safetensors: 3%|▎ | 62.9M/2.20G [00:02<01:10, 30.4MB/s] model-00002-of-00002.safetensors: 49%|████▊ | 1.07G/2.20G [00:03<00:02, 411MB/s]  model-00002-of-00002.safetensors: 99%|█████████▊| 2.17G/2.20G [00:04<00:00, 658MB/s] model-00002-of-00002.safetensors: 100%|█████████▉| 2.20G/2.20G [00:09<00:00, 240MB/s] Downloading shards: 100%|██████████| 2/2 [00:20<00:00, 10.22s/it] Downloading shards: 100%|██████████| 2/2 [00:20<00:00, 10.42s/it] Sliding Window Attention is enabled but not implemented for `sdpa`; unexpected results may be encountered. Loading checkpoint shards: 0%| | 0/2 [00:00<?, ?it/s] Loading checkpoint shards: 100%|██████████| 2/2 [00:00<00:00, 47393.27it/s] generation_config.json: 0%| | 0.00/242 [00:00<?, ?B/s] generation_config.json: 100%|██████████| 242/242 [00:00<00:00, 1.85MB/s] Traceback (most recent call last): File "/home/user/app/app.py", line 564, in <module> chat_model_state = AutoModelForCausalLM.from_pretrained(model_name, torch_dtype="auto", device_map="auto") File "/usr/local/lib/python3.10/site-packages/transformers/models/auto/auto_factory.py", line 564, in from_pretrained return model_class.from_pretrained( File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 262, in _wrapper return func(*args, **kwargs) File "/usr/local/lib/python3.10/site-packages/transformers/modeling_utils.py", line 4397, in from_pretrained dispatch_model(model, **device_map_kwargs) File "/usr/local/lib/python3.10/site-packages/accelerate/big_modeling.py", line 498, in dispatch_model raise ValueError( ValueError: You are trying to offload the whole model to the disk. Please use the `disk_offload` function instead.

Container logs:

Fetching error logs...