aparajitha commited on
Commit
7423e07
·
verified ·
1 Parent(s): 12c054b

aparajitha/mbart-sci-ms-tr-hi

Browse files
Files changed (5) hide show
  1. README.md +13 -14
  2. config.json +2 -2
  3. generation_config.json +1 -1
  4. model.safetensors +1 -1
  5. training_args.bin +2 -2
README.md CHANGED
@@ -1,7 +1,6 @@
1
  ---
2
- library_name: transformers
3
  license: mit
4
- base_model: aparajitha/mbart-sci-ms-tr
5
  tags:
6
  - generated_from_trainer
7
  model-index:
@@ -14,9 +13,9 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  # mbart-sci-ms-tr-hi
16
 
17
- This model is a fine-tuned version of [aparajitha/mbart-sci-ms-tr](https://huggingface.co/aparajitha/mbart-sci-ms-tr) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 2.3624
20
 
21
  ## Model description
22
 
@@ -45,19 +44,19 @@ The following hyperparameters were used during training:
45
 
46
  ### Training results
47
 
48
- | Training Loss | Epoch | Step | Validation Loss |
49
- |:-------------:|:-----:|:----:|:---------------:|
50
- | 2.4977 | 1.0 | 1092 | 2.3430 |
51
- | 2.099 | 2.0 | 2184 | 2.2412 |
52
- | 1.8193 | 3.0 | 3276 | 2.2304 |
53
- | 1.5883 | 4.0 | 4368 | 2.2644 |
54
- | 1.4088 | 5.0 | 5460 | 2.3011 |
55
- | 1.2952 | 6.0 | 6552 | 2.3624 |
56
 
57
 
58
  ### Framework versions
59
 
60
- - Transformers 4.44.2
61
- - Pytorch 2.4.1+cu121
62
  - Datasets 2.19.1
63
  - Tokenizers 0.19.1
 
1
  ---
 
2
  license: mit
3
+ base_model: aparajitha/mbart-ft-sci-ms-en
4
  tags:
5
  - generated_from_trainer
6
  model-index:
 
13
 
14
  # mbart-sci-ms-tr-hi
15
 
16
+ This model is a fine-tuned version of [aparajitha/mbart-ft-sci-ms-en](https://huggingface.co/aparajitha/mbart-ft-sci-ms-en) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 2.7499
19
 
20
  ## Model description
21
 
 
44
 
45
  ### Training results
46
 
47
+ | Training Loss | Epoch | Step | Validation Loss |
48
+ |:-------------:|:-----:|:-----:|:---------------:|
49
+ | 2.3763 | 1.0 | 2904 | 2.5883 |
50
+ | 2.1843 | 2.0 | 5808 | 2.5540 |
51
+ | 1.9488 | 3.0 | 8712 | 2.5917 |
52
+ | 1.7875 | 4.0 | 11616 | 2.6334 |
53
+ | 1.6368 | 5.0 | 14520 | 2.7048 |
54
+ | 1.5575 | 6.0 | 17424 | 2.7499 |
55
 
56
 
57
  ### Framework versions
58
 
59
+ - Transformers 4.40.2
60
+ - Pytorch 2.4.1.post300
61
  - Datasets 2.19.1
62
  - Tokenizers 0.19.1
config.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "_name_or_path": "aparajitha/mbart-sci-ms-tr",
3
  "_num_labels": 3,
4
  "activation_dropout": 0.0,
5
  "activation_function": "gelu",
@@ -52,7 +52,7 @@
52
  "static_position_embeddings": false,
53
  "tokenizer_class": "MBart50Tokenizer",
54
  "torch_dtype": "float32",
55
- "transformers_version": "4.44.2",
56
  "use_cache": true,
57
  "vocab_size": 250054
58
  }
 
1
  {
2
+ "_name_or_path": "aparajitha/mbart-ft-sci-ms-en",
3
  "_num_labels": 3,
4
  "activation_dropout": 0.0,
5
  "activation_function": "gelu",
 
52
  "static_position_embeddings": false,
53
  "tokenizer_class": "MBart50Tokenizer",
54
  "torch_dtype": "float32",
55
+ "transformers_version": "4.40.2",
56
  "use_cache": true,
57
  "vocab_size": 250054
58
  }
generation_config.json CHANGED
@@ -8,5 +8,5 @@
8
  "max_length": 200,
9
  "num_beams": 5,
10
  "pad_token_id": 1,
11
- "transformers_version": "4.44.2"
12
  }
 
8
  "max_length": 200,
9
  "num_beams": 5,
10
  "pad_token_id": 1,
11
+ "transformers_version": "4.40.2"
12
  }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:916fc78af513f8b3ea1b7832ac0caadd373deb63fd4cf8469ae737007b691d1b
3
  size 2444578688
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:15dc355095de5bc0531d8a5c51921327a035218d6ceacbb1ae2e192520484a9c
3
  size 2444578688
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0a160051551f2ba37a0854b3693db6fe4e173a278be8b6941230c67ade1132f9
3
- size 5304
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:de4f3769d6de6c92d29d8133d95694e9e21e47e7c2a0e4feab6c5b3f6b73d3f5
3
+ size 5112