Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
prosecalign
/
clm7b0129-kendall-onof-ofif-corr-max-2-simpo-max1000-decay-sft-beta1.5-gamma0.5-lr5e-6
like
0
Follow
Proactive Security Alignment
3
Transformers
Safetensors
Generated from Trainer
llama-factory
Inference Endpoints
arxiv:
2305.18290
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
08c9e17
clm7b0129-kendall-onof-ofif-corr-max-2-simpo-max1000-decay-sft-beta1.5-gamma0.5-lr5e-6
/
trainer_log.jsonl
Commit History
Training in progress, step 250
08c9e17
verified
ziansu
commited on
20 days ago
Training in progress, step 200
775fd15
verified
ziansu
commited on
20 days ago
Training in progress, step 150
8039c23
verified
ziansu
commited on
20 days ago
Training in progress, step 100
a3f2788
verified
ziansu
commited on
20 days ago
Training in progress, step 50
f678ae2
verified
ziansu
commited on
20 days ago