Derify
/

ChemBERTa_augmented_pubchem_13m

Model card Files Files and versions Community

eacortes commited on Feb 19

Commit

3a36438

·

verified ·

1 Parent(s): dd8ad0e

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -7,6 +7,6 @@ library_name: transformers
 This model is a ChemBERTa model trained on the augmented_canonical_pubchem_13m dataset.
-The model was trained for 10 epochs using NVIDIA Apex's FusedAdam optimizer with a reduce-on-plateau learning rate scheduler.
 To improve performance, mixed precision (fp16), TF32, and torch.compile were enabled. Training used gradient accumulation (16 steps) and batch size of 128 for efficient resource utilization.
 Evaluation was performed at regular intervals, with the best model selected based on validation performance.

 This model is a ChemBERTa model trained on the augmented_canonical_pubchem_13m dataset.
+The model was trained for 24 epochs using NVIDIA Apex's FusedAdam optimizer with a reduce-on-plateau learning rate scheduler.
 To improve performance, mixed precision (fp16), TF32, and torch.compile were enabled. Training used gradient accumulation (16 steps) and batch size of 128 for efficient resource utilization.
 Evaluation was performed at regular intervals, with the best model selected based on validation performance.