aparajitha/mbart-sci-ms-tr-hi

Files changed (5) hide show

README.md CHANGED Viewed

@@ -1,7 +1,6 @@
 ---
-library_name: transformers
 license: mit
-base_model: aparajitha/mbart-sci-ms-tr
 tags:
 - generated_from_trainer
 model-index:
@@ -14,9 +13,9 @@ should probably proofread and complete it, then remove this comment. -->
 # mbart-sci-ms-tr-hi
-This model is a fine-tuned version of [aparajitha/mbart-sci-ms-tr](https://huggingface.co/aparajitha/mbart-sci-ms-tr) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.3624
 ## Model description
@@ -45,19 +44,19 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| 2.4977        | 1.0   | 1092 | 2.3430          |
-| 2.099         | 2.0   | 2184 | 2.2412          |
-| 1.8193        | 3.0   | 3276 | 2.2304          |
-| 1.5883        | 4.0   | 4368 | 2.2644          |
-| 1.4088        | 5.0   | 5460 | 2.3011          |
-| 1.2952        | 6.0   | 6552 | 2.3624          |
 ### Framework versions
-- Transformers 4.44.2
-- Pytorch 2.4.1+cu121
 - Datasets 2.19.1
 - Tokenizers 0.19.1

 ---
 license: mit
+base_model: aparajitha/mbart-ft-sci-ms-en
 tags:
 - generated_from_trainer
 model-index:
 # mbart-sci-ms-tr-hi
+This model is a fine-tuned version of [aparajitha/mbart-ft-sci-ms-en](https://huggingface.co/aparajitha/mbart-ft-sci-ms-en) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.7499
 ## Model description
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss |
+|:-------------:|:-----:|:-----:|:---------------:|
+| 2.3763        | 1.0   | 2904  | 2.5883          |
+| 2.1843        | 2.0   | 5808  | 2.5540          |
+| 1.9488        | 3.0   | 8712  | 2.5917          |
+| 1.7875        | 4.0   | 11616 | 2.6334          |
+| 1.6368        | 5.0   | 14520 | 2.7048          |
+| 1.5575        | 6.0   | 17424 | 2.7499          |
 ### Framework versions
+- Transformers 4.40.2
+- Pytorch 2.4.1.post300
 - Datasets 2.19.1
 - Tokenizers 0.19.1

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "aparajitha/mbart-sci-ms-tr",
   "_num_labels": 3,
   "activation_dropout": 0.0,
   "activation_function": "gelu",
@@ -52,7 +52,7 @@
   "static_position_embeddings": false,
   "tokenizer_class": "MBart50Tokenizer",
   "torch_dtype": "float32",
-  "transformers_version": "4.44.2",
   "use_cache": true,
   "vocab_size": 250054
 }

 {
+  "_name_or_path": "aparajitha/mbart-ft-sci-ms-en",
   "_num_labels": 3,
   "activation_dropout": 0.0,
   "activation_function": "gelu",
   "static_position_embeddings": false,
   "tokenizer_class": "MBart50Tokenizer",
   "torch_dtype": "float32",
+  "transformers_version": "4.40.2",
   "use_cache": true,
   "vocab_size": 250054
 }

generation_config.json CHANGED Viewed

@@ -8,5 +8,5 @@
   "max_length": 200,
   "num_beams": 5,
   "pad_token_id": 1,
-  "transformers_version": "4.44.2"
 }

   "max_length": 200,
   "num_beams": 5,
   "pad_token_id": 1,
+  "transformers_version": "4.40.2"
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:916fc78af513f8b3ea1b7832ac0caadd373deb63fd4cf8469ae737007b691d1b
 size 2444578688

 version https://git-lfs.github.com/spec/v1
+oid sha256:15dc355095de5bc0531d8a5c51921327a035218d6ceacbb1ae2e192520484a9c
 size 2444578688

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0a160051551f2ba37a0854b3693db6fe4e173a278be8b6941230c67ade1132f9
-size 5304

 version https://git-lfs.github.com/spec/v1
+oid sha256:de4f3769d6de6c92d29d8133d95694e9e21e47e7c2a0e4feab6c5b3f6b73d3f5
+size 5112