YAML Metadata
Warning:
empty or missing yaml metadata in repo card
(https://huggingface.co/docs/hub/model-cards#model-card-metadata)
Checkpoint Upload
This model checkpoint was automatically uploaded from a distributed training run.
Model Details
- Training step: 21
- Architecture: Llama-style model
- Hidden size: 2048
- Layers: 36
- Vocabulary size: 151,936
Checkpoint Information
- Originally saved as distributed checkpoint across 4 ranks
- Consolidated into single checkpoint for easier use
- Contains model weights, optimizer states, and training configuration
Usage
import torch
# Load the checkpoint
checkpoint = torch.load('pytorch_model.bin', map_location='cpu')
# The checkpoint contains the model state dict
# You'll need to initialize the appropriate model architecture
# and load these weights
Note
This is a raw training checkpoint. For inference, you may need to:
- Initialize the correct model architecture
- Load the weights properly
- Convert to the desired format (e.g., Hugging Face Transformers format)
- Downloads last month
- 24
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support