Cernophil commited on
Commit
f456be2
·
verified ·
1 Parent(s): 1c820d3

End of training

Browse files
README.md ADDED
@@ -0,0 +1,62 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: Helsinki-NLP/opus-mt-en-ro
4
+ tags:
5
+ - generated_from_trainer
6
+ metrics:
7
+ - bleu
8
+ model-index:
9
+ - name: opus-mt-en-ro-finetuned-en-to-ro
10
+ results: []
11
+ ---
12
+
13
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
+ should probably proofread and complete it, then remove this comment. -->
15
+
16
+ # opus-mt-en-ro-finetuned-en-to-ro
17
+
18
+ This model is a fine-tuned version of [Helsinki-NLP/opus-mt-en-ro](https://huggingface.co/Helsinki-NLP/opus-mt-en-ro) on an unknown dataset.
19
+ It achieves the following results on the evaluation set:
20
+ - Loss: 1.2457
21
+ - Bleu: 28.3282
22
+ - Gen Len: 33.9535
23
+
24
+ ## Model description
25
+
26
+ More information needed
27
+
28
+ ## Intended uses & limitations
29
+
30
+ More information needed
31
+
32
+ ## Training and evaluation data
33
+
34
+ More information needed
35
+
36
+ ## Training procedure
37
+
38
+ ### Training hyperparameters
39
+
40
+ The following hyperparameters were used during training:
41
+ - learning_rate: 2e-05
42
+ - train_batch_size: 128
43
+ - eval_batch_size: 128
44
+ - seed: 42
45
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
+ - lr_scheduler_type: linear
47
+ - num_epochs: 1
48
+ - mixed_precision_training: Native AMP
49
+
50
+ ### Training results
51
+
52
+ | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
53
+ |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
54
+ | 0.7553 | 1.0 | 4769 | 1.2457 | 28.3282 | 33.9535 |
55
+
56
+
57
+ ### Framework versions
58
+
59
+ - Transformers 4.43.4
60
+ - Pytorch 2.3.0+cu121
61
+ - Datasets 2.20.0
62
+ - Tokenizers 0.19.1
emissions.csv ADDED
@@ -0,0 +1,2 @@
 
 
 
1
+ timestamp,project_name,run_id,duration,emissions,emissions_rate,cpu_power,gpu_power,ram_power,cpu_energy,gpu_energy,ram_energy,energy_consumed,country_name,country_iso_code,region,cloud_provider,cloud_region,os,python_version,codecarbon_version,cpu_count,cpu_model,gpu_count,gpu_model,longitude,latitude,ram_total_size,tracking_mode,on_cloud,pue
2
+ 2024-08-06T14:11:53,codecarbon,1f15ae92-3ddc-4a23-9a5a-4ad235604025,2646.9103710651398,0.3032049614015296,0.0001145505207565896,112.5,323.629,377.89435958862305,0.08269974178820845,0.496412659532951,0.2766324864531485,0.8557448877743082,Germany,DEU,north rhine-westphalia,,,Linux-5.15.0-113-generic-x86_64-with-glibc2.29,3.8.10,2.2.3,128,AMD EPYC 7542 32-Core Processor,8,8 x NVIDIA A100-PCIE-40GB,6.7958,51.0015,1007.7182922363281,machine,N,1.0
generation_config.json ADDED
@@ -0,0 +1,16 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bad_words_ids": [
3
+ [
4
+ 59542
5
+ ]
6
+ ],
7
+ "bos_token_id": 0,
8
+ "decoder_start_token_id": 59542,
9
+ "eos_token_id": 0,
10
+ "forced_eos_token_id": 0,
11
+ "max_length": 512,
12
+ "num_beams": 4,
13
+ "pad_token_id": 59542,
14
+ "renormalize_logits": true,
15
+ "transformers_version": "4.43.4"
16
+ }
runs/Aug06_13-27-36_ki-jupyternotebook-8cb7/events.out.tfevents.1722943664.ki-jupyternotebook-8cb7.171148.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:74207c56ebae5dc7dcec6a64d89e29a508ce116227b251f4d352674a14406bdb
3
- size 7786
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f6ec5ed490fe62a3ef39f6bf255bbc35176a6463fefaadb439e8258b2a1495a8
3
+ size 8510