bigstupidhats
/

Llama-3.2-1B-Instruct-sft_metamath

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

Llama-3.2-1B-Instruct-sft_metamath / last-checkpoint /global_step200

1 contributor

History: 1 commit

minghaowu's picture

Training in progress, step 200, checkpoint

0aaab9b verified about 2 months ago

bf16_zero_pp_rank_0_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "torch.FloatStorage"
How to fix it?
7.41 GB
LFS

Training in progress, step 200, checkpoint about 2 months ago
bf16_zero_pp_rank_1_mp_rank_00_optim_states.pt
Detected Pickle imports (6)
- "deepspeed.runtime.fp16.loss_scaler.LossScaler",
- "deepspeed.utils.tensor_fragment.fragment_address",
- "deepspeed.runtime.zero.config.ZeroStageEnum",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "torch.FloatStorage"
How to fix it?
7.41 GB
LFS

Training in progress, step 200, checkpoint about 2 months ago
mp_rank_00_model_states.pt
Detected Pickle imports (5)
- "torch.BFloat16Storage",
- "torch.Size",
- "torch._utils._rebuild_tensor_v2",
- "collections.OrderedDict",
- "__builtin__.set"
How to fix it?
2.47 GB
LFS

Training in progress, step 200, checkpoint about 2 months ago