Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
prosecalign
/
clm7b0129-kendall-onof-ofif-corr-max-2-simpo-max1500-decay-sft-beta1.5-gamma0.5-lr5e-6
like
0
Follow
Proactive Security Alignment
3
Transformers
Safetensors
Generated from Trainer
llama-factory
Inference Endpoints
arxiv:
2305.18290
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
clm7b0129-kendall-onof-ofif-corr-max-2-simpo-max1500-decay-sft-beta1.5-gamma0.5-lr5e-6
/
checkpoint-300
/
rng_state_7.pth
Commit History
Training in progress, step 300, checkpoint
4b5d90e
verified
ziansu
commited on
1 day ago