esm2_t12_35M_UR50D-Trainerfinetuned-symmetric
This model is a fine-tuned version of facebook/esm2_t12_35M_UR50D on the None dataset. It achieves the following results on the evaluation set:
- Loss: 0.5783
- Accuracy: 0.8368
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 16
- eval_batch_size: 16
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 5
Training results
Training Loss | Epoch | Step | Validation Loss | Accuracy |
---|---|---|---|---|
1.9941 | 0.25 | 177 | 1.4609 | 0.5718 |
1.342 | 0.5 | 354 | 1.2594 | 0.6223 |
1.1903 | 0.75 | 531 | 1.1485 | 0.6560 |
1.0986 | 1.0 | 708 | 1.0412 | 0.6857 |
1.0033 | 1.25 | 885 | 0.9809 | 0.7023 |
0.9365 | 1.5 | 1062 | 0.9276 | 0.7220 |
0.8967 | 1.75 | 1239 | 0.8836 | 0.7372 |
0.8428 | 2.0 | 1416 | 0.8275 | 0.7555 |
0.79 | 2.25 | 1593 | 0.7990 | 0.7654 |
0.7402 | 2.5 | 1770 | 0.7581 | 0.7777 |
0.7339 | 2.75 | 1947 | 0.7265 | 0.7890 |
0.6676 | 3.0 | 2124 | 0.7044 | 0.7946 |
0.6457 | 3.25 | 2301 | 0.6792 | 0.8024 |
0.6408 | 3.5 | 2478 | 0.6551 | 0.8105 |
0.6183 | 3.75 | 2655 | 0.6440 | 0.8155 |
0.6202 | 4.0 | 2832 | 0.6335 | 0.8169 |
0.5827 | 4.25 | 3009 | 0.6223 | 0.8221 |
0.5866 | 4.5 | 3186 | 0.6135 | 0.8241 |
0.5749 | 4.75 | 3363 | 0.6026 | 0.8297 |
0.5775 | 5.0 | 3540 | 0.6012 | 0.8276 |
Framework versions
- Transformers 4.41.2
- Pytorch 2.3.0+cu121
- Datasets 2.20.0
- Tokenizers 0.19.1
- Downloads last month
- 95
Model tree for vrhoward/esm2_t12_35M_UR50D-Trainerfinetuned-symmetric
Base model
facebook/esm2_t12_35M_UR50D