belisards commited on
Commit
2891a9b
·
verified ·
1 Parent(s): 6b1a016

kairos-posicao-bert-custom

Browse files
Files changed (32) hide show
  1. README.md +17 -21
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
  4. wandb/debug-internal.log +41 -42
  5. wandb/debug.log +72 -72
  6. wandb/run-20241211_042633-wah4vux1/files/config.yaml +494 -0
  7. wandb/run-20241211_042633-wah4vux1/files/output.log +0 -0
  8. wandb/run-20241211_042633-wah4vux1/files/requirements.txt +578 -0
  9. wandb/run-20241211_042633-wah4vux1/files/wandb-metadata.json +38 -0
  10. wandb/run-20241211_042633-wah4vux1/files/wandb-summary.json +1 -0
  11. wandb/run-20241211_042633-wah4vux1/logs/debug-core.log +14 -0
  12. wandb/run-20241211_042633-wah4vux1/logs/debug-internal.log +29 -0
  13. wandb/run-20241211_042633-wah4vux1/logs/debug.log +48 -0
  14. wandb/run-20241211_042633-wah4vux1/run-wah4vux1.wandb +0 -0
  15. wandb/run-20241211_042903-vt1hbdpt/files/config.yaml +494 -0
  16. wandb/run-20241211_042903-vt1hbdpt/files/output.log +0 -0
  17. wandb/run-20241211_042903-vt1hbdpt/files/requirements.txt +578 -0
  18. wandb/run-20241211_042903-vt1hbdpt/files/wandb-metadata.json +38 -0
  19. wandb/run-20241211_042903-vt1hbdpt/files/wandb-summary.json +1 -0
  20. wandb/run-20241211_042903-vt1hbdpt/logs/debug-core.log +14 -0
  21. wandb/run-20241211_042903-vt1hbdpt/logs/debug-internal.log +25 -0
  22. wandb/run-20241211_042903-vt1hbdpt/logs/debug.log +42 -0
  23. wandb/run-20241211_042903-vt1hbdpt/run-vt1hbdpt.wandb +0 -0
  24. wandb/run-20241211_043031-pfhjmc9v/files/config.yaml +568 -0
  25. wandb/run-20241211_043031-pfhjmc9v/files/output.log +8 -0
  26. wandb/run-20241211_043031-pfhjmc9v/files/requirements.txt +578 -0
  27. wandb/run-20241211_043031-pfhjmc9v/files/wandb-metadata.json +38 -0
  28. wandb/run-20241211_043031-pfhjmc9v/files/wandb-summary.json +1 -0
  29. wandb/run-20241211_043031-pfhjmc9v/logs/debug-core.log +9 -0
  30. wandb/run-20241211_043031-pfhjmc9v/logs/debug-internal.log +41 -0
  31. wandb/run-20241211_043031-pfhjmc9v/logs/debug.log +73 -0
  32. wandb/run-20241211_043031-pfhjmc9v/run-pfhjmc9v.wandb +0 -0
README.md CHANGED
@@ -21,11 +21,11 @@ should probably proofread and complete it, then remove this comment. -->
21
 
22
  This model is a fine-tuned version of [belisards/congretimbau](https://huggingface.co/belisards/congretimbau) on the None dataset.
23
  It achieves the following results on the evaluation set:
24
- - Loss: 0.1176
25
- - Accuracy: 0.8027
26
- - F1: 0.7358
27
- - Recall: 0.7544
28
- - Precision: 0.7236
29
 
30
  ## Model description
31
 
@@ -45,31 +45,27 @@ More information needed
45
 
46
  The following hyperparameters were used during training:
47
  - learning_rate: 1e-05
48
- - train_batch_size: 64
49
- - eval_batch_size: 64
50
  - seed: 5151
51
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
52
  - lr_scheduler_type: linear
53
  - lr_scheduler_warmup_steps: 200
54
- - num_epochs: 15
55
 
56
  ### Training results
57
 
58
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Recall | Precision |
59
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:------:|:---------:|
60
- | 0.3405 | 1.0 | 18 | 0.2083 | 0.7232 | 0.4751 | 0.5104 | 0.5393 |
61
- | 0.1467 | 2.0 | 36 | 0.1258 | 0.4107 | 0.4105 | 0.5463 | 0.5486 |
62
- | 0.1198 | 3.0 | 54 | 0.1127 | 0.6607 | 0.5988 | 0.6140 | 0.5964 |
63
- | 0.107 | 4.0 | 72 | 0.0999 | 0.6696 | 0.6339 | 0.6762 | 0.6380 |
64
- | 0.0987 | 5.0 | 90 | 0.0943 | 0.6339 | 0.6113 | 0.6745 | 0.6339 |
65
- | 0.0911 | 6.0 | 108 | 0.0930 | 0.6875 | 0.6492 | 0.6882 | 0.6492 |
66
- | 0.078 | 7.0 | 126 | 0.0953 | 0.7321 | 0.6883 | 0.7183 | 0.6805 |
67
- | 0.0671 | 8.0 | 144 | 0.0934 | 0.7232 | 0.6850 | 0.7235 | 0.6798 |
68
- | 0.0534 | 9.0 | 162 | 0.1065 | 0.8036 | 0.7441 | 0.7441 | 0.7441 |
69
- | 0.0355 | 10.0 | 180 | 0.1363 | 0.8214 | 0.7724 | 0.7786 | 0.7670 |
70
- | 0.0263 | 11.0 | 198 | 0.1411 | 0.8214 | 0.7724 | 0.7786 | 0.7670 |
71
- | 0.013 | 12.0 | 216 | 0.2712 | 0.8214 | 0.7560 | 0.7449 | 0.7710 |
72
- | 0.0074 | 13.0 | 234 | 0.3198 | 0.7946 | 0.7294 | 0.7268 | 0.7321 |
73
 
74
 
75
  ### Framework versions
 
21
 
22
  This model is a fine-tuned version of [belisards/congretimbau](https://huggingface.co/belisards/congretimbau) on the None dataset.
23
  It achieves the following results on the evaluation set:
24
+ - Loss: 0.1076
25
+ - Accuracy: 0.8503
26
+ - F1: 0.7896
27
+ - Recall: 0.7959
28
+ - Precision: 0.7839
29
 
30
  ## Model description
31
 
 
45
 
46
  The following hyperparameters were used during training:
47
  - learning_rate: 1e-05
48
+ - train_batch_size: 32
49
+ - eval_batch_size: 32
50
  - seed: 5151
51
  - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
52
  - lr_scheduler_type: linear
53
  - lr_scheduler_warmup_steps: 200
54
+ - num_epochs: 18
55
 
56
  ### Training results
57
 
58
  | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1 | Recall | Precision |
59
  |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|:------:|:---------:|
60
+ | 0.1548 | 1.0 | 35 | 0.1456 | 0.7411 | 0.4571 | 0.5112 | 0.6227 |
61
+ | 0.1572 | 2.0 | 70 | 0.1354 | 0.7411 | 0.6588 | 0.6570 | 0.6607 |
62
+ | 0.1305 | 3.0 | 105 | 0.1212 | 0.7768 | 0.6402 | 0.6251 | 0.7194 |
63
+ | 0.1069 | 4.0 | 140 | 0.1155 | 0.8393 | 0.7857 | 0.7794 | 0.7930 |
64
+ | 0.0937 | 5.0 | 175 | 0.1216 | 0.8304 | 0.7764 | 0.7734 | 0.7798 |
65
+ | 0.0639 | 6.0 | 210 | 0.1257 | 0.8482 | 0.7899 | 0.7742 | 0.8125 |
66
+ | 0.0437 | 7.0 | 245 | 0.1610 | 0.8393 | 0.7614 | 0.7345 | 0.8195 |
67
+ | 0.0254 | 8.0 | 280 | 0.2101 | 0.8482 | 0.7842 | 0.7630 | 0.8197 |
68
+ | 0.0067 | 9.0 | 315 | 0.2555 | 0.8482 | 0.7899 | 0.7742 | 0.8125 |
 
 
 
 
69
 
70
 
71
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e6e2ba5dd0e166261ec945162d8e79d2e859733d2b3200f2c280ed9a43e2d802
3
  size 1337640872
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5e98475e3643c74cadf401ba22ec56f223c7f43b5dc3d62fb197cc7f68607494
3
  size 1337640872
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6435f8462aca2daab46d187fd20638c82a75b3f148c4cada62a5aae5ab0a2e1b
3
  size 5304
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d52eab71080d2aed6a921ca9f734c67e7deaa4b336d62f88d5819be64e81acb2
3
  size 5304
wandb/debug-internal.log CHANGED
@@ -1,42 +1,41 @@
1
- {"time":"2024-12-11T03:56:14.972563192Z","level":"INFO","msg":"using version","core version":"0.19.0"}
2
- {"time":"2024-12-11T03:56:14.972584119Z","level":"INFO","msg":"created symlink","path":"/content/wandb/run-20241211_035614-4h85t3a9/logs/debug-core.log"}
3
- {"time":"2024-12-11T03:56:15.07514169Z","level":"INFO","msg":"created new stream","id":"4h85t3a9"}
4
- {"time":"2024-12-11T03:56:15.075175758Z","level":"INFO","msg":"stream: started","id":"4h85t3a9"}
5
- {"time":"2024-12-11T03:56:15.075212228Z","level":"INFO","msg":"writer: Do: started","stream_id":"4h85t3a9"}
6
- {"time":"2024-12-11T03:56:15.07523211Z","level":"INFO","msg":"sender: started","stream_id":"4h85t3a9"}
7
- {"time":"2024-12-11T03:56:15.075321233Z","level":"INFO","msg":"handler: started","stream_id":"4h85t3a9"}
8
- {"time":"2024-12-11T03:56:15.968097784Z","level":"INFO","msg":"Starting system monitor"}
9
- {"time":"2024-12-11T03:56:16.456638208Z","level":"INFO","msg":"Pausing system monitor"}
10
- {"time":"2024-12-11T03:56:16.456680859Z","level":"INFO","msg":"Resuming system monitor"}
11
- {"time":"2024-12-11T03:56:16.456686338Z","level":"INFO","msg":"Pausing system monitor"}
12
- {"time":"2024-12-11T03:56:16.456690298Z","level":"INFO","msg":"Resuming system monitor"}
13
- {"time":"2024-12-11T03:56:16.456694053Z","level":"INFO","msg":"Pausing system monitor"}
14
- {"time":"2024-12-11T03:56:16.456697689Z","level":"INFO","msg":"Resuming system monitor"}
15
- {"time":"2024-12-11T03:56:17.11108643Z","level":"INFO","msg":"Pausing system monitor"}
16
- {"time":"2024-12-11T03:56:17.151485643Z","level":"INFO","msg":"Resuming system monitor"}
17
- {"time":"2024-12-11T04:00:46.468742444Z","level":"INFO","msg":"api: retrying error","error":"Post \"https://api.wandb.ai/graphql\": read tcp 172.28.0.12:55104->35.186.228.49:443: read: connection reset by peer"}
18
- {"time":"2024-12-11T04:05:11.747524724Z","level":"INFO","msg":"Pausing system monitor"}
19
- {"time":"2024-12-11T04:05:11.752230979Z","level":"INFO","msg":"Resuming system monitor"}
20
- {"time":"2024-12-11T04:05:13.109043385Z","level":"INFO","msg":"Pausing system monitor"}
21
- {"time":"2024-12-11T04:05:13.11322247Z","level":"INFO","msg":"Resuming system monitor"}
22
- {"time":"2024-12-11T04:05:13.113974041Z","level":"INFO","msg":"Pausing system monitor"}
23
- {"time":"2024-12-11T04:05:13.119121659Z","level":"INFO","msg":"Resuming system monitor"}
24
- {"time":"2024-12-11T04:05:13.201633635Z","level":"INFO","msg":"Pausing system monitor"}
25
- {"time":"2024-12-11T04:05:13.206478279Z","level":"INFO","msg":"Resuming system monitor"}
26
- {"time":"2024-12-11T04:05:13.209051195Z","level":"INFO","msg":"Pausing system monitor"}
27
- {"time":"2024-12-11T04:05:13.214107299Z","level":"INFO","msg":"Resuming system monitor"}
28
- {"time":"2024-12-11T04:05:16.798316096Z","level":"INFO","msg":"Pausing system monitor"}
29
- {"time":"2024-12-11T04:05:16.838607253Z","level":"INFO","msg":"Resuming system monitor"}
30
- {"time":"2024-12-11T04:05:16.838637201Z","level":"INFO","msg":"Pausing system monitor"}
31
- {"time":"2024-12-11T04:05:16.83864231Z","level":"INFO","msg":"Resuming system monitor"}
32
- {"time":"2024-12-11T04:05:16.838647523Z","level":"INFO","msg":"Pausing system monitor"}
33
- {"time":"2024-12-11T04:05:16.838651293Z","level":"INFO","msg":"Resuming system monitor"}
34
- {"time":"2024-12-11T04:05:16.838673832Z","level":"INFO","msg":"Stopping system monitor"}
35
- {"time":"2024-12-11T04:05:16.839335764Z","level":"INFO","msg":"Stopped system monitor"}
36
- {"time":"2024-12-11T04:05:17.084769004Z","level":"INFO","msg":"fileTransfer: Close: file transfer manager closed"}
37
- {"time":"2024-12-11T04:05:17.148988564Z","level":"INFO","msg":"handler: operation stats","stats":{}}
38
- {"time":"2024-12-11T04:05:18.165248123Z","level":"INFO","msg":"stream: closing","id":"4h85t3a9"}
39
- {"time":"2024-12-11T04:05:18.165271526Z","level":"INFO","msg":"handler: closed","stream_id":"4h85t3a9"}
40
- {"time":"2024-12-11T04:05:18.165281328Z","level":"INFO","msg":"writer: Close: closed","stream_id":"4h85t3a9"}
41
- {"time":"2024-12-11T04:05:18.165315071Z","level":"INFO","msg":"sender: closed","stream_id":"4h85t3a9"}
42
- {"time":"2024-12-11T04:05:18.165351695Z","level":"INFO","msg":"stream: closed","id":"4h85t3a9"}
 
1
+ {"time":"2024-12-11T04:30:31.64567442Z","level":"INFO","msg":"using version","core version":"0.19.0"}
2
+ {"time":"2024-12-11T04:30:31.645694977Z","level":"INFO","msg":"created symlink","path":"/content/wandb/run-20241211_043031-pfhjmc9v/logs/debug-core.log"}
3
+ {"time":"2024-12-11T04:30:31.748128189Z","level":"INFO","msg":"created new stream","id":"pfhjmc9v"}
4
+ {"time":"2024-12-11T04:30:31.748162278Z","level":"INFO","msg":"stream: started","id":"pfhjmc9v"}
5
+ {"time":"2024-12-11T04:30:31.748191334Z","level":"INFO","msg":"writer: Do: started","stream_id":"pfhjmc9v"}
6
+ {"time":"2024-12-11T04:30:31.748217805Z","level":"INFO","msg":"handler: started","stream_id":"pfhjmc9v"}
7
+ {"time":"2024-12-11T04:30:31.748239576Z","level":"INFO","msg":"sender: started","stream_id":"pfhjmc9v"}
8
+ {"time":"2024-12-11T04:30:34.483159987Z","level":"INFO","msg":"Starting system monitor"}
9
+ {"time":"2024-12-11T04:30:35.046627667Z","level":"INFO","msg":"Pausing system monitor"}
10
+ {"time":"2024-12-11T04:30:35.046662301Z","level":"INFO","msg":"Resuming system monitor"}
11
+ {"time":"2024-12-11T04:30:35.046670624Z","level":"INFO","msg":"Pausing system monitor"}
12
+ {"time":"2024-12-11T04:30:35.046676728Z","level":"INFO","msg":"Resuming system monitor"}
13
+ {"time":"2024-12-11T04:30:35.046682641Z","level":"INFO","msg":"Pausing system monitor"}
14
+ {"time":"2024-12-11T04:30:35.046688398Z","level":"INFO","msg":"Resuming system monitor"}
15
+ {"time":"2024-12-11T04:30:35.626440922Z","level":"INFO","msg":"Pausing system monitor"}
16
+ {"time":"2024-12-11T04:30:35.667423399Z","level":"INFO","msg":"Resuming system monitor"}
17
+ {"time":"2024-12-11T04:37:00.589693882Z","level":"INFO","msg":"Pausing system monitor"}
18
+ {"time":"2024-12-11T04:37:00.594358479Z","level":"INFO","msg":"Resuming system monitor"}
19
+ {"time":"2024-12-11T04:37:02.005522092Z","level":"INFO","msg":"Pausing system monitor"}
20
+ {"time":"2024-12-11T04:37:02.010541772Z","level":"INFO","msg":"Resuming system monitor"}
21
+ {"time":"2024-12-11T04:37:02.011288477Z","level":"INFO","msg":"Pausing system monitor"}
22
+ {"time":"2024-12-11T04:37:02.017364808Z","level":"INFO","msg":"Resuming system monitor"}
23
+ {"time":"2024-12-11T04:37:02.171915316Z","level":"INFO","msg":"Pausing system monitor"}
24
+ {"time":"2024-12-11T04:37:02.177365784Z","level":"INFO","msg":"Resuming system monitor"}
25
+ {"time":"2024-12-11T04:37:02.180571775Z","level":"INFO","msg":"Pausing system monitor"}
26
+ {"time":"2024-12-11T04:37:02.185372366Z","level":"INFO","msg":"Resuming system monitor"}
27
+ {"time":"2024-12-11T04:37:05.365253262Z","level":"INFO","msg":"Pausing system monitor"}
28
+ {"time":"2024-12-11T04:37:05.372555953Z","level":"INFO","msg":"Resuming system monitor"}
29
+ {"time":"2024-12-11T04:37:05.373998802Z","level":"INFO","msg":"Pausing system monitor"}
30
+ {"time":"2024-12-11T04:37:05.379408244Z","level":"INFO","msg":"Resuming system monitor"}
31
+ {"time":"2024-12-11T04:37:05.3922832Z","level":"INFO","msg":"Pausing system monitor"}
32
+ {"time":"2024-12-11T04:37:05.396859255Z","level":"INFO","msg":"Resuming system monitor"}
33
+ {"time":"2024-12-11T04:37:05.398980138Z","level":"INFO","msg":"Stopping system monitor"}
34
+ {"time":"2024-12-11T04:37:05.399571058Z","level":"INFO","msg":"Stopped system monitor"}
35
+ {"time":"2024-12-11T04:37:05.608796436Z","level":"INFO","msg":"fileTransfer: Close: file transfer manager closed"}
36
+ {"time":"2024-12-11T04:37:05.673605445Z","level":"INFO","msg":"handler: operation stats","stats":{}}
37
+ {"time":"2024-12-11T04:37:06.688087405Z","level":"INFO","msg":"stream: closing","id":"pfhjmc9v"}
38
+ {"time":"2024-12-11T04:37:06.68810837Z","level":"INFO","msg":"handler: closed","stream_id":"pfhjmc9v"}
39
+ {"time":"2024-12-11T04:37:06.688117223Z","level":"INFO","msg":"writer: Close: closed","stream_id":"pfhjmc9v"}
40
+ {"time":"2024-12-11T04:37:06.688143843Z","level":"INFO","msg":"sender: closed","stream_id":"pfhjmc9v"}
41
+ {"time":"2024-12-11T04:37:06.688182944Z","level":"INFO","msg":"stream: closed","id":"pfhjmc9v"}
 
wandb/debug.log CHANGED
@@ -1,73 +1,73 @@
1
- 2024-12-11 03:56:14,967 INFO MainThread:5214 [wandb_setup.py:_flush():68] Current SDK version is 0.19.0
2
- 2024-12-11 03:56:14,967 INFO MainThread:5214 [wandb_setup.py:_flush():68] Configure stats pid to 5214
3
- 2024-12-11 03:56:14,967 INFO MainThread:5214 [wandb_setup.py:_flush():68] Loading settings from /root/.config/wandb/settings
4
- 2024-12-11 03:56:14,967 INFO MainThread:5214 [wandb_setup.py:_flush():68] Loading settings from /content/wandb/settings
5
- 2024-12-11 03:56:14,967 INFO MainThread:5214 [wandb_setup.py:_flush():68] Loading settings from environment variables
6
- 2024-12-11 03:56:14,967 INFO MainThread:5214 [wandb_init.py:_log_setup():528] Logging user logs to /content/wandb/run-20241211_035614-4h85t3a9/logs/debug.log
7
- 2024-12-11 03:56:14,967 INFO MainThread:5214 [wandb_init.py:_log_setup():529] Logging internal logs to /content/wandb/run-20241211_035614-4h85t3a9/logs/debug-internal.log
8
- 2024-12-11 03:56:14,967 INFO MainThread:5214 [wandb_init.py:_jupyter_setup():474] configuring jupyter hooks <wandb.sdk.wandb_init._WandbInit object at 0x7bea66d57c70>
9
- 2024-12-11 03:56:14,968 INFO MainThread:5214 [wandb_init.py:init():639] calling init triggers
10
- 2024-12-11 03:56:14,968 INFO MainThread:5214 [wandb_init.py:init():645] wandb.init called with sweep_config: {}
11
  config: {}
12
- 2024-12-11 03:56:14,968 INFO MainThread:5214 [wandb_init.py:init():688] starting backend
13
- 2024-12-11 03:56:14,968 INFO MainThread:5214 [wandb_init.py:init():692] sending inform_init request
14
- 2024-12-11 03:56:14,970 INFO MainThread:5214 [backend.py:_multiprocessing_setup():104] multiprocessing start_methods=fork,spawn,forkserver, using: spawn
15
- 2024-12-11 03:56:14,971 INFO MainThread:5214 [wandb_init.py:init():705] backend started and connected
16
- 2024-12-11 03:56:14,978 INFO MainThread:5214 [wandb_run.py:_label_probe_notebook():1327] probe notebook
17
- 2024-12-11 03:56:15,888 INFO MainThread:5214 [wandb_init.py:init():798] updated telemetry
18
- 2024-12-11 03:56:15,889 INFO MainThread:5214 [wandb_init.py:init():830] communicating run to backend with 90.0 second timeout
19
- 2024-12-11 03:56:15,963 INFO MainThread:5214 [wandb_init.py:init():882] starting run threads in backend
20
- 2024-12-11 03:56:16,417 INFO MainThread:5214 [wandb_run.py:_console_start():2443] atexit reg
21
- 2024-12-11 03:56:16,417 INFO MainThread:5214 [wandb_run.py:_redirect():2293] redirect: wrap_raw
22
- 2024-12-11 03:56:16,417 INFO MainThread:5214 [wandb_run.py:_redirect():2358] Wrapping output streams.
23
- 2024-12-11 03:56:16,417 INFO MainThread:5214 [wandb_run.py:_redirect():2383] Redirects installed.
24
- 2024-12-11 03:56:16,419 INFO MainThread:5214 [wandb_init.py:init():925] run started, returning control to user process
25
- 2024-12-11 03:56:16,423 INFO MainThread:5214 [jupyter.py:save_ipynb():387] not saving jupyter notebook
26
- 2024-12-11 03:56:16,423 INFO MainThread:5214 [wandb_init.py:_pause_backend():439] pausing backend
27
- 2024-12-11 03:56:16,428 INFO MainThread:5214 [wandb_init.py:_resume_backend():444] resuming backend
28
- 2024-12-11 03:56:16,429 INFO MainThread:5214 [jupyter.py:save_ipynb():387] not saving jupyter notebook
29
- 2024-12-11 03:56:16,429 INFO MainThread:5214 [wandb_init.py:_pause_backend():439] pausing backend
30
- 2024-12-11 03:56:16,439 INFO MainThread:5214 [wandb_init.py:_resume_backend():444] resuming backend
31
- 2024-12-11 03:56:16,441 INFO MainThread:5214 [jupyter.py:save_ipynb():387] not saving jupyter notebook
32
- 2024-12-11 03:56:16,441 INFO MainThread:5214 [wandb_init.py:_pause_backend():439] pausing backend
33
- 2024-12-11 03:56:16,449 INFO MainThread:5214 [wandb_init.py:_resume_backend():444] resuming backend
34
- 2024-12-11 03:56:17,110 INFO MainThread:5214 [jupyter.py:save_ipynb():387] not saving jupyter notebook
35
- 2024-12-11 03:56:17,110 INFO MainThread:5214 [wandb_init.py:_pause_backend():439] pausing backend
36
- 2024-12-11 03:56:17,115 INFO MainThread:5214 [wandb_init.py:_resume_backend():444] resuming backend
37
- 2024-12-11 03:56:17,487 INFO MainThread:5214 [wandb_run.py:_config_callback():1375] config_cb None None {'return_dict': True, 'output_hidden_states': False, 'output_attentions': False, 'torchscript': False, 'torch_dtype': 'float32', 'use_bfloat16': False, 'tf_legacy_loss': False, 'pruned_heads': {}, 'tie_word_embeddings': True, 'chunk_size_feed_forward': 0, 'is_encoder_decoder': False, 'is_decoder': False, 'cross_attention_hidden_size': None, 'add_cross_attention': False, 'tie_encoder_decoder': False, 'max_length': 20, 'min_length': 0, 'do_sample': False, 'early_stopping': False, 'num_beams': 1, 'num_beam_groups': 1, 'diversity_penalty': 0.0, 'temperature': 1.0, 'top_k': 50, 'top_p': 1.0, 'typical_p': 1.0, 'repetition_penalty': 1.0, 'length_penalty': 1.0, 'no_repeat_ngram_size': 0, 'encoder_no_repeat_ngram_size': 0, 'bad_words_ids': None, 'num_return_sequences': 1, 'output_scores': False, 'return_dict_in_generate': False, 'forced_bos_token_id': None, 'forced_eos_token_id': None, 'remove_invalid_values': False, 'exponential_decay_length_penalty': None, 'suppress_tokens': None, 'begin_suppress_tokens': None, 'architectures': ['BertForMaskedLM'], 'finetuning_task': None, 'id2label': {0: '0', 1: '1'}, 'label2id': {'0': 0, '1': 1}, 'tokenizer_class': None, 'prefix': None, 'bos_token_id': None, 'pad_token_id': 0, 'eos_token_id': None, 'sep_token_id': None, 'decoder_start_token_id': None, 'task_specific_params': None, 'problem_type': None, '_name_or_path': 'belisards/congretimbau', '_attn_implementation_autoset': True, 'transformers_version': '4.47.0', 'directionality': 'bidi', 'model_type': 'bert', 'output_past': True, 'pooler_fc_size': 768, 'pooler_num_attention_heads': 12, 'pooler_num_fc_layers': 3, 'pooler_size_per_head': 128, 'pooler_type': 'first_token_transform', 'vocab_size': 29794, 'hidden_size': 1024, 'num_hidden_layers': 24, 'num_attention_heads': 16, 'hidden_act': 'gelu', 'intermediate_size': 4096, 'hidden_dropout_prob': 0.05, 'attention_probs_dropout_prob': 0.05, 'max_position_embeddings': 512, 'type_vocab_size': 2, 'initializer_range': 0.02, 'layer_norm_eps': 1e-12, 'position_embedding_type': 'absolute', 'use_cache': True, 'classifier_dropout': None, 'output_dir': '.', 'overwrite_output_dir': False, 'do_train': False, 'do_eval': True, 'do_predict': False, 'eval_strategy': 'epoch', 'prediction_loss_only': False, 'per_device_train_batch_size': 64, 'per_device_eval_batch_size': 64, 'per_gpu_train_batch_size': None, 'per_gpu_eval_batch_size': None, 'gradient_accumulation_steps': 1, 'eval_accumulation_steps': None, 'eval_delay': 0, 'torch_empty_cache_steps': None, 'learning_rate': 1e-05, 'weight_decay': 0.02, 'adam_beta1': 0.9, 'adam_beta2': 0.999, 'adam_epsilon': 1e-08, 'max_grad_norm': 1.0, 'num_train_epochs': 15, 'max_steps': -1, 'lr_scheduler_type': 'linear', 'lr_scheduler_kwargs': {}, 'warmup_ratio': 0.0, 'warmup_steps': 200, 'log_level': 'passive', 'log_level_replica': 'warning', 'log_on_each_node': True, 'logging_dir': './runs/Dec11_03-56-16_0a818e0bf512', 'logging_strategy': 'steps', 'logging_first_step': False, 'logging_steps': 10, 'logging_nan_inf_filter': True, 'save_strategy': 'epoch', 'save_steps': 102, 'save_total_limit': 1, 'save_safetensors': True, 'save_on_each_node': False, 'save_only_model': False, 'restore_callback_states_from_checkpoint': False, 'no_cuda': False, 'use_cpu': False, 'use_mps_device': False, 'seed': 5151, 'data_seed': None, 'jit_mode_eval': False, 'use_ipex': False, 'bf16': False, 'fp16': False, 'fp16_opt_level': 'O1', 'half_precision_backend': 'auto', 'bf16_full_eval': False, 'fp16_full_eval': False, 'tf32': None, 'local_rank': 0, 'ddp_backend': None, 'tpu_num_cores': None, 'tpu_metrics_debug': False, 'debug': [], 'dataloader_drop_last': False, 'eval_steps': 51, 'dataloader_num_workers': 0, 'dataloader_prefetch_factor': None, 'past_index': -1, 'run_name': 'm2-congretimbau_tunado', 'disable_tqdm': False, 'remove_unused_columns': True, 'label_names': None, 'load_best_model_at_end': True, 'metric_for_best_model': 'f1', 'greater_is_better': True, 'ignore_data_skip': False, 'fsdp': [], 'fsdp_min_num_params': 0, 'fsdp_config': {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}, 'fsdp_transformer_layer_cls_to_wrap': None, 'accelerator_config': {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}, 'deepspeed': None, 'label_smoothing_factor': 0.0, 'optim': 'adamw_torch', 'optim_args': None, 'adafactor': False, 'group_by_length': False, 'length_column_name': 'length', 'report_to': ['wandb'], 'ddp_find_unused_parameters': None, 'ddp_bucket_cap_mb': None, 'ddp_broadcast_buffers': None, 'dataloader_pin_memory': True, 'dataloader_persistent_workers': False, 'skip_memory_metrics': True, 'use_legacy_prediction_loop': False, 'push_to_hub': False, 'resume_from_checkpoint': None, 'hub_model_id': None, 'hub_strategy': 'every_save', 'hub_token': '<HUB_TOKEN>', 'hub_private_repo': None, 'hub_always_push': False, 'gradient_checkpointing': False, 'gradient_checkpointing_kwargs': None, 'include_inputs_for_metrics': False, 'include_for_metrics': [], 'eval_do_concat_batches': True, 'fp16_backend': 'auto', 'evaluation_strategy': None, 'push_to_hub_model_id': None, 'push_to_hub_organization': None, 'push_to_hub_token': '<PUSH_TO_HUB_TOKEN>', 'mp_parameters': '', 'auto_find_batch_size': False, 'full_determinism': False, 'torchdynamo': None, 'ray_scope': 'last', 'ddp_timeout': 1800, 'torch_compile': False, 'torch_compile_backend': None, 'torch_compile_mode': None, 'dispatch_batches': None, 'split_batches': None, 'include_tokens_per_second': False, 'include_num_input_tokens_seen': False, 'neftune_noise_alpha': None, 'optim_target_modules': None, 'batch_eval_metrics': False, 'eval_on_start': False, 'use_liger_kernel': False, 'eval_use_gather_object': False, 'average_tokens_across_devices': False}
38
- 2024-12-11 03:56:17,491 INFO MainThread:5214 [wandb_config.py:__setitem__():154] config set model/num_parameters = 334398466 - <bound method Run._config_callback of <wandb.sdk.wandb_run.Run object at 0x7bea30363fd0>>
39
- 2024-12-11 03:56:17,491 INFO MainThread:5214 [wandb_run.py:_config_callback():1375] config_cb model/num_parameters 334398466 None
40
- 2024-12-11 04:05:11,747 INFO MainThread:5214 [jupyter.py:save_ipynb():387] not saving jupyter notebook
41
- 2024-12-11 04:05:11,747 INFO MainThread:5214 [wandb_init.py:_pause_backend():439] pausing backend
42
- 2024-12-11 04:05:11,751 INFO MainThread:5214 [wandb_init.py:_resume_backend():444] resuming backend
43
- 2024-12-11 04:05:13,108 INFO MainThread:5214 [jupyter.py:save_ipynb():387] not saving jupyter notebook
44
- 2024-12-11 04:05:13,108 INFO MainThread:5214 [wandb_init.py:_pause_backend():439] pausing backend
45
- 2024-12-11 04:05:13,112 INFO MainThread:5214 [wandb_init.py:_resume_backend():444] resuming backend
46
- 2024-12-11 04:05:13,113 INFO MainThread:5214 [jupyter.py:save_ipynb():387] not saving jupyter notebook
47
- 2024-12-11 04:05:13,113 INFO MainThread:5214 [wandb_init.py:_pause_backend():439] pausing backend
48
- 2024-12-11 04:05:13,118 INFO MainThread:5214 [wandb_init.py:_resume_backend():444] resuming backend
49
- 2024-12-11 04:05:13,201 INFO MainThread:5214 [jupyter.py:save_ipynb():387] not saving jupyter notebook
50
- 2024-12-11 04:05:13,201 INFO MainThread:5214 [wandb_init.py:_pause_backend():439] pausing backend
51
- 2024-12-11 04:05:13,206 INFO MainThread:5214 [wandb_init.py:_resume_backend():444] resuming backend
52
- 2024-12-11 04:05:13,208 INFO MainThread:5214 [jupyter.py:save_ipynb():387] not saving jupyter notebook
53
- 2024-12-11 04:05:13,208 INFO MainThread:5214 [wandb_init.py:_pause_backend():439] pausing backend
54
- 2024-12-11 04:05:13,213 INFO MainThread:5214 [wandb_init.py:_resume_backend():444] resuming backend
55
- 2024-12-11 04:05:16,797 INFO MainThread:5214 [jupyter.py:save_ipynb():387] not saving jupyter notebook
56
- 2024-12-11 04:05:16,798 INFO MainThread:5214 [wandb_init.py:_pause_backend():439] pausing backend
57
- 2024-12-11 04:05:16,806 INFO MainThread:5214 [wandb_init.py:_resume_backend():444] resuming backend
58
- 2024-12-11 04:05:16,807 INFO MainThread:5214 [jupyter.py:save_ipynb():387] not saving jupyter notebook
59
- 2024-12-11 04:05:16,807 INFO MainThread:5214 [wandb_init.py:_pause_backend():439] pausing backend
60
- 2024-12-11 04:05:16,813 INFO MainThread:5214 [wandb_init.py:_resume_backend():444] resuming backend
61
- 2024-12-11 04:05:16,825 INFO MainThread:5214 [jupyter.py:save_ipynb():387] not saving jupyter notebook
62
- 2024-12-11 04:05:16,826 INFO MainThread:5214 [wandb_init.py:_pause_backend():439] pausing backend
63
- 2024-12-11 04:05:16,830 INFO MainThread:5214 [wandb_init.py:_resume_backend():444] resuming backend
64
- 2024-12-11 04:05:16,831 INFO MainThread:5214 [wandb_run.py:_finish():2169] finishing run adrianobf/AzMina-M2/4h85t3a9
65
- 2024-12-11 04:05:16,831 INFO MainThread:5214 [jupyter.py:save_history():459] not saving jupyter history
66
- 2024-12-11 04:05:16,831 INFO MainThread:5214 [jupyter.py:save_ipynb():387] not saving jupyter notebook
67
- 2024-12-11 04:05:16,831 INFO MainThread:5214 [wandb_init.py:_jupyter_teardown():456] cleaning up jupyter logic
68
- 2024-12-11 04:05:16,832 INFO MainThread:5214 [wandb_run.py:_atexit_cleanup():2408] got exitcode: 0
69
- 2024-12-11 04:05:16,832 INFO MainThread:5214 [wandb_run.py:_restore():2390] restore
70
- 2024-12-11 04:05:16,832 INFO MainThread:5214 [wandb_run.py:_restore():2396] restore done
71
- 2024-12-11 04:05:18,152 INFO MainThread:5214 [wandb_run.py:_footer_history_summary_info():3951] rendering history
72
- 2024-12-11 04:05:18,152 INFO MainThread:5214 [wandb_run.py:_footer_history_summary_info():3983] rendering summary
73
- 2024-12-11 04:05:18,160 INFO MainThread:5214 [wandb_run.py:_footer_sync_info():3912] logging synced files
 
1
+ 2024-12-11 04:30:31,640 INFO MainThread:2341 [wandb_setup.py:_flush():68] Current SDK version is 0.19.0
2
+ 2024-12-11 04:30:31,640 INFO MainThread:2341 [wandb_setup.py:_flush():68] Configure stats pid to 2341
3
+ 2024-12-11 04:30:31,640 INFO MainThread:2341 [wandb_setup.py:_flush():68] Loading settings from /root/.config/wandb/settings
4
+ 2024-12-11 04:30:31,640 INFO MainThread:2341 [wandb_setup.py:_flush():68] Loading settings from /content/wandb/settings
5
+ 2024-12-11 04:30:31,640 INFO MainThread:2341 [wandb_setup.py:_flush():68] Loading settings from environment variables
6
+ 2024-12-11 04:30:31,640 INFO MainThread:2341 [wandb_init.py:_log_setup():528] Logging user logs to /content/wandb/run-20241211_043031-pfhjmc9v/logs/debug.log
7
+ 2024-12-11 04:30:31,640 INFO MainThread:2341 [wandb_init.py:_log_setup():529] Logging internal logs to /content/wandb/run-20241211_043031-pfhjmc9v/logs/debug-internal.log
8
+ 2024-12-11 04:30:31,640 INFO MainThread:2341 [wandb_init.py:_jupyter_setup():474] configuring jupyter hooks <wandb.sdk.wandb_init._WandbInit object at 0x7de625ba39d0>
9
+ 2024-12-11 04:30:31,641 INFO MainThread:2341 [wandb_init.py:init():639] calling init triggers
10
+ 2024-12-11 04:30:31,641 INFO MainThread:2341 [wandb_init.py:init():645] wandb.init called with sweep_config: {}
11
  config: {}
12
+ 2024-12-11 04:30:31,641 INFO MainThread:2341 [wandb_init.py:init():688] starting backend
13
+ 2024-12-11 04:30:31,641 INFO MainThread:2341 [wandb_init.py:init():692] sending inform_init request
14
+ 2024-12-11 04:30:31,643 INFO MainThread:2341 [backend.py:_multiprocessing_setup():104] multiprocessing start_methods=fork,spawn,forkserver, using: spawn
15
+ 2024-12-11 04:30:31,644 INFO MainThread:2341 [wandb_init.py:init():705] backend started and connected
16
+ 2024-12-11 04:30:31,652 INFO MainThread:2341 [wandb_run.py:_label_probe_notebook():1327] probe notebook
17
+ 2024-12-11 04:30:34,399 INFO MainThread:2341 [wandb_init.py:init():798] updated telemetry
18
+ 2024-12-11 04:30:34,399 INFO MainThread:2341 [wandb_init.py:init():830] communicating run to backend with 90.0 second timeout
19
+ 2024-12-11 04:30:34,477 INFO MainThread:2341 [wandb_init.py:init():882] starting run threads in backend
20
+ 2024-12-11 04:30:35,007 INFO MainThread:2341 [wandb_run.py:_console_start():2443] atexit reg
21
+ 2024-12-11 04:30:35,007 INFO MainThread:2341 [wandb_run.py:_redirect():2293] redirect: wrap_raw
22
+ 2024-12-11 04:30:35,007 INFO MainThread:2341 [wandb_run.py:_redirect():2358] Wrapping output streams.
23
+ 2024-12-11 04:30:35,007 INFO MainThread:2341 [wandb_run.py:_redirect():2383] Redirects installed.
24
+ 2024-12-11 04:30:35,009 INFO MainThread:2341 [wandb_init.py:init():925] run started, returning control to user process
25
+ 2024-12-11 04:30:35,013 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
26
+ 2024-12-11 04:30:35,013 INFO MainThread:2341 [wandb_init.py:_pause_backend():439] pausing backend
27
+ 2024-12-11 04:30:35,019 INFO MainThread:2341 [wandb_init.py:_resume_backend():444] resuming backend
28
+ 2024-12-11 04:30:35,020 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
29
+ 2024-12-11 04:30:35,020 INFO MainThread:2341 [wandb_init.py:_pause_backend():439] pausing backend
30
+ 2024-12-11 04:30:35,028 INFO MainThread:2341 [wandb_init.py:_resume_backend():444] resuming backend
31
+ 2024-12-11 04:30:35,029 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
32
+ 2024-12-11 04:30:35,030 INFO MainThread:2341 [wandb_init.py:_pause_backend():439] pausing backend
33
+ 2024-12-11 04:30:35,037 INFO MainThread:2341 [wandb_init.py:_resume_backend():444] resuming backend
34
+ 2024-12-11 04:30:35,626 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
35
+ 2024-12-11 04:30:35,626 INFO MainThread:2341 [wandb_init.py:_pause_backend():439] pausing backend
36
+ 2024-12-11 04:30:35,630 INFO MainThread:2341 [wandb_init.py:_resume_backend():444] resuming backend
37
+ 2024-12-11 04:30:36,007 INFO MainThread:2341 [wandb_run.py:_config_callback():1375] config_cb None None {'return_dict': True, 'output_hidden_states': False, 'output_attentions': False, 'torchscript': False, 'torch_dtype': 'float32', 'use_bfloat16': False, 'tf_legacy_loss': False, 'pruned_heads': {}, 'tie_word_embeddings': True, 'chunk_size_feed_forward': 0, 'is_encoder_decoder': False, 'is_decoder': False, 'cross_attention_hidden_size': None, 'add_cross_attention': False, 'tie_encoder_decoder': False, 'max_length': 20, 'min_length': 0, 'do_sample': False, 'early_stopping': False, 'num_beams': 1, 'num_beam_groups': 1, 'diversity_penalty': 0.0, 'temperature': 1.0, 'top_k': 50, 'top_p': 1.0, 'typical_p': 1.0, 'repetition_penalty': 1.0, 'length_penalty': 1.0, 'no_repeat_ngram_size': 0, 'encoder_no_repeat_ngram_size': 0, 'bad_words_ids': None, 'num_return_sequences': 1, 'output_scores': False, 'return_dict_in_generate': False, 'forced_bos_token_id': None, 'forced_eos_token_id': None, 'remove_invalid_values': False, 'exponential_decay_length_penalty': None, 'suppress_tokens': None, 'begin_suppress_tokens': None, 'architectures': ['BertForMaskedLM'], 'finetuning_task': None, 'id2label': {0: '0', 1: '1'}, 'label2id': {'0': 0, '1': 1}, 'tokenizer_class': None, 'prefix': None, 'bos_token_id': None, 'pad_token_id': 0, 'eos_token_id': None, 'sep_token_id': None, 'decoder_start_token_id': None, 'task_specific_params': None, 'problem_type': None, '_name_or_path': 'belisards/congretimbau', '_attn_implementation_autoset': True, 'transformers_version': '4.47.0', 'directionality': 'bidi', 'model_type': 'bert', 'output_past': True, 'pooler_fc_size': 768, 'pooler_num_attention_heads': 12, 'pooler_num_fc_layers': 3, 'pooler_size_per_head': 128, 'pooler_type': 'first_token_transform', 'vocab_size': 29794, 'hidden_size': 1024, 'num_hidden_layers': 24, 'num_attention_heads': 16, 'hidden_act': 'gelu', 'intermediate_size': 4096, 'hidden_dropout_prob': 0.05, 'attention_probs_dropout_prob': 0.05, 'max_position_embeddings': 512, 'type_vocab_size': 2, 'initializer_range': 0.02, 'layer_norm_eps': 1e-12, 'position_embedding_type': 'absolute', 'use_cache': True, 'classifier_dropout': None, 'output_dir': '.', 'overwrite_output_dir': False, 'do_train': False, 'do_eval': True, 'do_predict': False, 'eval_strategy': 'epoch', 'prediction_loss_only': False, 'per_device_train_batch_size': 32, 'per_device_eval_batch_size': 32, 'per_gpu_train_batch_size': None, 'per_gpu_eval_batch_size': None, 'gradient_accumulation_steps': 1, 'eval_accumulation_steps': None, 'eval_delay': 0, 'torch_empty_cache_steps': None, 'learning_rate': 1e-05, 'weight_decay': 0.02, 'adam_beta1': 0.9, 'adam_beta2': 0.999, 'adam_epsilon': 1e-08, 'max_grad_norm': 1.0, 'num_train_epochs': 18, 'max_steps': -1, 'lr_scheduler_type': 'linear', 'lr_scheduler_kwargs': {}, 'warmup_ratio': 0.0, 'warmup_steps': 200, 'log_level': 'passive', 'log_level_replica': 'warning', 'log_on_each_node': True, 'logging_dir': './runs/Dec11_04-30-35_fd55a770be24', 'logging_strategy': 'steps', 'logging_first_step': False, 'logging_steps': 10, 'logging_nan_inf_filter': True, 'save_strategy': 'epoch', 'save_steps': 204, 'save_total_limit': 1, 'save_safetensors': True, 'save_on_each_node': False, 'save_only_model': False, 'restore_callback_states_from_checkpoint': False, 'no_cuda': False, 'use_cpu': False, 'use_mps_device': False, 'seed': 5151, 'data_seed': None, 'jit_mode_eval': False, 'use_ipex': False, 'bf16': False, 'fp16': False, 'fp16_opt_level': 'O1', 'half_precision_backend': 'auto', 'bf16_full_eval': False, 'fp16_full_eval': False, 'tf32': None, 'local_rank': 0, 'ddp_backend': None, 'tpu_num_cores': None, 'tpu_metrics_debug': False, 'debug': [], 'dataloader_drop_last': False, 'eval_steps': 102, 'dataloader_num_workers': 0, 'dataloader_prefetch_factor': None, 'past_index': -1, 'run_name': 'm2-congretimbau_tunado', 'disable_tqdm': False, 'remove_unused_columns': True, 'label_names': None, 'load_best_model_at_end': True, 'metric_for_best_model': 'f1', 'greater_is_better': True, 'ignore_data_skip': False, 'fsdp': [], 'fsdp_min_num_params': 0, 'fsdp_config': {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}, 'fsdp_transformer_layer_cls_to_wrap': None, 'accelerator_config': {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}, 'deepspeed': None, 'label_smoothing_factor': 0.0, 'optim': 'adamw_torch', 'optim_args': None, 'adafactor': False, 'group_by_length': False, 'length_column_name': 'length', 'report_to': ['wandb'], 'ddp_find_unused_parameters': None, 'ddp_bucket_cap_mb': None, 'ddp_broadcast_buffers': None, 'dataloader_pin_memory': True, 'dataloader_persistent_workers': False, 'skip_memory_metrics': True, 'use_legacy_prediction_loop': False, 'push_to_hub': False, 'resume_from_checkpoint': None, 'hub_model_id': None, 'hub_strategy': 'every_save', 'hub_token': '<HUB_TOKEN>', 'hub_private_repo': None, 'hub_always_push': False, 'gradient_checkpointing': False, 'gradient_checkpointing_kwargs': None, 'include_inputs_for_metrics': False, 'include_for_metrics': [], 'eval_do_concat_batches': True, 'fp16_backend': 'auto', 'evaluation_strategy': None, 'push_to_hub_model_id': None, 'push_to_hub_organization': None, 'push_to_hub_token': '<PUSH_TO_HUB_TOKEN>', 'mp_parameters': '', 'auto_find_batch_size': False, 'full_determinism': False, 'torchdynamo': None, 'ray_scope': 'last', 'ddp_timeout': 1800, 'torch_compile': False, 'torch_compile_backend': None, 'torch_compile_mode': None, 'dispatch_batches': None, 'split_batches': None, 'include_tokens_per_second': False, 'include_num_input_tokens_seen': False, 'neftune_noise_alpha': None, 'optim_target_modules': None, 'batch_eval_metrics': False, 'eval_on_start': False, 'use_liger_kernel': False, 'eval_use_gather_object': False, 'average_tokens_across_devices': False}
38
+ 2024-12-11 04:30:36,011 INFO MainThread:2341 [wandb_config.py:__setitem__():154] config set model/num_parameters = 334398466 - <bound method Run._config_callback of <wandb.sdk.wandb_run.Run object at 0x7de6143df1c0>>
39
+ 2024-12-11 04:30:36,011 INFO MainThread:2341 [wandb_run.py:_config_callback():1375] config_cb model/num_parameters 334398466 None
40
+ 2024-12-11 04:37:00,589 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
41
+ 2024-12-11 04:37:00,589 INFO MainThread:2341 [wandb_init.py:_pause_backend():439] pausing backend
42
+ 2024-12-11 04:37:00,594 INFO MainThread:2341 [wandb_init.py:_resume_backend():444] resuming backend
43
+ 2024-12-11 04:37:02,005 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
44
+ 2024-12-11 04:37:02,005 INFO MainThread:2341 [wandb_init.py:_pause_backend():439] pausing backend
45
+ 2024-12-11 04:37:02,010 INFO MainThread:2341 [wandb_init.py:_resume_backend():444] resuming backend
46
+ 2024-12-11 04:37:02,010 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
47
+ 2024-12-11 04:37:02,011 INFO MainThread:2341 [wandb_init.py:_pause_backend():439] pausing backend
48
+ 2024-12-11 04:37:02,017 INFO MainThread:2341 [wandb_init.py:_resume_backend():444] resuming backend
49
+ 2024-12-11 04:37:02,171 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
50
+ 2024-12-11 04:37:02,171 INFO MainThread:2341 [wandb_init.py:_pause_backend():439] pausing backend
51
+ 2024-12-11 04:37:02,177 INFO MainThread:2341 [wandb_init.py:_resume_backend():444] resuming backend
52
+ 2024-12-11 04:37:02,180 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
53
+ 2024-12-11 04:37:02,180 INFO MainThread:2341 [wandb_init.py:_pause_backend():439] pausing backend
54
+ 2024-12-11 04:37:02,184 INFO MainThread:2341 [wandb_init.py:_resume_backend():444] resuming backend
55
+ 2024-12-11 04:37:05,364 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
56
+ 2024-12-11 04:37:05,364 INFO MainThread:2341 [wandb_init.py:_pause_backend():439] pausing backend
57
+ 2024-12-11 04:37:05,372 INFO MainThread:2341 [wandb_init.py:_resume_backend():444] resuming backend
58
+ 2024-12-11 04:37:05,373 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
59
+ 2024-12-11 04:37:05,373 INFO MainThread:2341 [wandb_init.py:_pause_backend():439] pausing backend
60
+ 2024-12-11 04:37:05,379 INFO MainThread:2341 [wandb_init.py:_resume_backend():444] resuming backend
61
+ 2024-12-11 04:37:05,392 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
62
+ 2024-12-11 04:37:05,392 INFO MainThread:2341 [wandb_init.py:_pause_backend():439] pausing backend
63
+ 2024-12-11 04:37:05,396 INFO MainThread:2341 [wandb_init.py:_resume_backend():444] resuming backend
64
+ 2024-12-11 04:37:05,397 INFO MainThread:2341 [wandb_run.py:_finish():2169] finishing run adrianobf/AzMina-M2/pfhjmc9v
65
+ 2024-12-11 04:37:05,397 INFO MainThread:2341 [jupyter.py:save_history():459] not saving jupyter history
66
+ 2024-12-11 04:37:05,397 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
67
+ 2024-12-11 04:37:05,397 INFO MainThread:2341 [wandb_init.py:_jupyter_teardown():456] cleaning up jupyter logic
68
+ 2024-12-11 04:37:05,397 INFO MainThread:2341 [wandb_run.py:_atexit_cleanup():2408] got exitcode: 0
69
+ 2024-12-11 04:37:05,398 INFO MainThread:2341 [wandb_run.py:_restore():2390] restore
70
+ 2024-12-11 04:37:05,398 INFO MainThread:2341 [wandb_run.py:_restore():2396] restore done
71
+ 2024-12-11 04:37:06,676 INFO MainThread:2341 [wandb_run.py:_footer_history_summary_info():3951] rendering history
72
+ 2024-12-11 04:37:06,677 INFO MainThread:2341 [wandb_run.py:_footer_history_summary_info():3983] rendering summary
73
+ 2024-12-11 04:37:06,685 INFO MainThread:2341 [wandb_run.py:_footer_sync_info():3912] logging synced files
wandb/run-20241211_042633-wah4vux1/files/config.yaml ADDED
@@ -0,0 +1,494 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ _attn_implementation_autoset:
2
+ value: true
3
+ _name_or_path:
4
+ value: belisards/congretimbau
5
+ _wandb:
6
+ value:
7
+ cli_version: 0.19.0
8
+ m:
9
+ - "1": train/global_step
10
+ "6":
11
+ - 3
12
+ "7": []
13
+ python_version: 3.10.12
14
+ t:
15
+ "1":
16
+ - 1
17
+ - 2
18
+ - 3
19
+ - 5
20
+ - 11
21
+ - 12
22
+ - 41
23
+ - 49
24
+ - 51
25
+ - 53
26
+ - 55
27
+ - 71
28
+ - 98
29
+ - 100
30
+ - 105
31
+ "2":
32
+ - 1
33
+ - 2
34
+ - 3
35
+ - 5
36
+ - 11
37
+ - 12
38
+ - 41
39
+ - 49
40
+ - 51
41
+ - 53
42
+ - 55
43
+ - 71
44
+ - 98
45
+ - 100
46
+ - 105
47
+ "3":
48
+ - 7
49
+ - 13
50
+ - 19
51
+ - 23
52
+ - 55
53
+ - 66
54
+ "4": 3.10.12
55
+ "5": 0.19.0
56
+ "6": 4.47.0
57
+ "8":
58
+ - 1
59
+ - 5
60
+ - 12
61
+ "9":
62
+ "1": transformers_trainer
63
+ "12": 0.19.0
64
+ "13": linux-x86_64
65
+ accelerator_config:
66
+ value:
67
+ dispatch_batches: null
68
+ even_batches: true
69
+ gradient_accumulation_kwargs: null
70
+ non_blocking: false
71
+ split_batches: false
72
+ use_seedable_sampler: true
73
+ adafactor:
74
+ value: false
75
+ adam_beta1:
76
+ value: 0.9
77
+ adam_beta2:
78
+ value: 0.999
79
+ adam_epsilon:
80
+ value: 1e-08
81
+ add_cross_attention:
82
+ value: false
83
+ architectures:
84
+ value:
85
+ - BertForMaskedLM
86
+ attention_probs_dropout_prob:
87
+ value: 0.05
88
+ auto_find_batch_size:
89
+ value: false
90
+ average_tokens_across_devices:
91
+ value: false
92
+ bad_words_ids:
93
+ value: null
94
+ batch_eval_metrics:
95
+ value: false
96
+ begin_suppress_tokens:
97
+ value: null
98
+ bf16:
99
+ value: false
100
+ bf16_full_eval:
101
+ value: false
102
+ bos_token_id:
103
+ value: null
104
+ chunk_size_feed_forward:
105
+ value: 0
106
+ classifier_dropout:
107
+ value: null
108
+ cross_attention_hidden_size:
109
+ value: null
110
+ data_seed:
111
+ value: null
112
+ dataloader_drop_last:
113
+ value: false
114
+ dataloader_num_workers:
115
+ value: 0
116
+ dataloader_persistent_workers:
117
+ value: false
118
+ dataloader_pin_memory:
119
+ value: true
120
+ dataloader_prefetch_factor:
121
+ value: null
122
+ ddp_backend:
123
+ value: null
124
+ ddp_broadcast_buffers:
125
+ value: null
126
+ ddp_bucket_cap_mb:
127
+ value: null
128
+ ddp_find_unused_parameters:
129
+ value: null
130
+ ddp_timeout:
131
+ value: 1800
132
+ debug:
133
+ value: []
134
+ decoder_start_token_id:
135
+ value: null
136
+ deepspeed:
137
+ value: null
138
+ directionality:
139
+ value: bidi
140
+ disable_tqdm:
141
+ value: false
142
+ dispatch_batches:
143
+ value: null
144
+ diversity_penalty:
145
+ value: 0
146
+ do_eval:
147
+ value: true
148
+ do_predict:
149
+ value: false
150
+ do_sample:
151
+ value: false
152
+ do_train:
153
+ value: false
154
+ early_stopping:
155
+ value: false
156
+ encoder_no_repeat_ngram_size:
157
+ value: 0
158
+ eos_token_id:
159
+ value: null
160
+ eval_accumulation_steps:
161
+ value: null
162
+ eval_delay:
163
+ value: 0
164
+ eval_do_concat_batches:
165
+ value: true
166
+ eval_on_start:
167
+ value: false
168
+ eval_steps:
169
+ value: 24
170
+ eval_strategy:
171
+ value: epoch
172
+ eval_use_gather_object:
173
+ value: false
174
+ evaluation_strategy:
175
+ value: null
176
+ exponential_decay_length_penalty:
177
+ value: null
178
+ finetuning_task:
179
+ value: null
180
+ forced_bos_token_id:
181
+ value: null
182
+ forced_eos_token_id:
183
+ value: null
184
+ fp16:
185
+ value: false
186
+ fp16_backend:
187
+ value: auto
188
+ fp16_full_eval:
189
+ value: false
190
+ fp16_opt_level:
191
+ value: O1
192
+ fsdp:
193
+ value: []
194
+ fsdp_config:
195
+ value:
196
+ min_num_params: 0
197
+ xla: false
198
+ xla_fsdp_grad_ckpt: false
199
+ xla_fsdp_v2: false
200
+ fsdp_min_num_params:
201
+ value: 0
202
+ fsdp_transformer_layer_cls_to_wrap:
203
+ value: null
204
+ full_determinism:
205
+ value: false
206
+ gradient_accumulation_steps:
207
+ value: 1
208
+ gradient_checkpointing:
209
+ value: false
210
+ gradient_checkpointing_kwargs:
211
+ value: null
212
+ greater_is_better:
213
+ value: true
214
+ group_by_length:
215
+ value: false
216
+ half_precision_backend:
217
+ value: auto
218
+ hidden_act:
219
+ value: gelu
220
+ hidden_dropout_prob:
221
+ value: 0.05
222
+ hidden_size:
223
+ value: 1024
224
+ hub_always_push:
225
+ value: false
226
+ hub_model_id:
227
+ value: null
228
+ hub_private_repo:
229
+ value: null
230
+ hub_strategy:
231
+ value: every_save
232
+ hub_token:
233
+ value: <HUB_TOKEN>
234
+ id2label:
235
+ value:
236
+ "0": "0"
237
+ "1": "1"
238
+ ignore_data_skip:
239
+ value: false
240
+ include_for_metrics:
241
+ value: []
242
+ include_inputs_for_metrics:
243
+ value: false
244
+ include_num_input_tokens_seen:
245
+ value: false
246
+ include_tokens_per_second:
247
+ value: false
248
+ initializer_range:
249
+ value: 0.02
250
+ intermediate_size:
251
+ value: 4096
252
+ is_decoder:
253
+ value: false
254
+ is_encoder_decoder:
255
+ value: false
256
+ jit_mode_eval:
257
+ value: false
258
+ label_names:
259
+ value: null
260
+ label_smoothing_factor:
261
+ value: 0
262
+ label2id:
263
+ value:
264
+ "0": 0
265
+ "1": 1
266
+ layer_norm_eps:
267
+ value: 1e-12
268
+ learning_rate:
269
+ value: 1e-05
270
+ length_column_name:
271
+ value: length
272
+ length_penalty:
273
+ value: 1
274
+ load_best_model_at_end:
275
+ value: true
276
+ local_rank:
277
+ value: 0
278
+ log_level:
279
+ value: passive
280
+ log_level_replica:
281
+ value: warning
282
+ log_on_each_node:
283
+ value: true
284
+ logging_dir:
285
+ value: ./runs/Dec11_04-28-19_fd55a770be24
286
+ logging_first_step:
287
+ value: false
288
+ logging_nan_inf_filter:
289
+ value: true
290
+ logging_steps:
291
+ value: 10
292
+ logging_strategy:
293
+ value: steps
294
+ lr_scheduler_type:
295
+ value: linear
296
+ max_grad_norm:
297
+ value: 1
298
+ max_length:
299
+ value: 20
300
+ max_position_embeddings:
301
+ value: 512
302
+ max_steps:
303
+ value: -1
304
+ metric_for_best_model:
305
+ value: f1
306
+ min_length:
307
+ value: 0
308
+ model/num_parameters:
309
+ value: 334398466
310
+ model_type:
311
+ value: bert
312
+ mp_parameters:
313
+ value: ""
314
+ neftune_noise_alpha:
315
+ value: null
316
+ no_cuda:
317
+ value: false
318
+ no_repeat_ngram_size:
319
+ value: 0
320
+ num_attention_heads:
321
+ value: 16
322
+ num_beam_groups:
323
+ value: 1
324
+ num_beams:
325
+ value: 1
326
+ num_hidden_layers:
327
+ value: 24
328
+ num_return_sequences:
329
+ value: 1
330
+ num_train_epochs:
331
+ value: 15
332
+ optim:
333
+ value: adamw_torch
334
+ optim_args:
335
+ value: null
336
+ optim_target_modules:
337
+ value: null
338
+ output_attentions:
339
+ value: false
340
+ output_dir:
341
+ value: .
342
+ output_hidden_states:
343
+ value: false
344
+ output_past:
345
+ value: true
346
+ output_scores:
347
+ value: false
348
+ overwrite_output_dir:
349
+ value: false
350
+ pad_token_id:
351
+ value: 0
352
+ past_index:
353
+ value: -1
354
+ per_device_eval_batch_size:
355
+ value: 128
356
+ per_device_train_batch_size:
357
+ value: 128
358
+ per_gpu_eval_batch_size:
359
+ value: null
360
+ per_gpu_train_batch_size:
361
+ value: null
362
+ pooler_fc_size:
363
+ value: 768
364
+ pooler_num_attention_heads:
365
+ value: 12
366
+ pooler_num_fc_layers:
367
+ value: 3
368
+ pooler_size_per_head:
369
+ value: 128
370
+ pooler_type:
371
+ value: first_token_transform
372
+ position_embedding_type:
373
+ value: absolute
374
+ prediction_loss_only:
375
+ value: false
376
+ prefix:
377
+ value: null
378
+ problem_type:
379
+ value: null
380
+ push_to_hub:
381
+ value: false
382
+ push_to_hub_model_id:
383
+ value: null
384
+ push_to_hub_organization:
385
+ value: null
386
+ push_to_hub_token:
387
+ value: <PUSH_TO_HUB_TOKEN>
388
+ ray_scope:
389
+ value: last
390
+ remove_invalid_values:
391
+ value: false
392
+ remove_unused_columns:
393
+ value: true
394
+ repetition_penalty:
395
+ value: 1
396
+ report_to:
397
+ value:
398
+ - wandb
399
+ restore_callback_states_from_checkpoint:
400
+ value: false
401
+ resume_from_checkpoint:
402
+ value: null
403
+ return_dict:
404
+ value: true
405
+ return_dict_in_generate:
406
+ value: false
407
+ run_name:
408
+ value: m2-congretimbau_tunado
409
+ save_on_each_node:
410
+ value: false
411
+ save_only_model:
412
+ value: false
413
+ save_safetensors:
414
+ value: true
415
+ save_steps:
416
+ value: 48
417
+ save_strategy:
418
+ value: epoch
419
+ save_total_limit:
420
+ value: 1
421
+ seed:
422
+ value: 5151
423
+ sep_token_id:
424
+ value: null
425
+ skip_memory_metrics:
426
+ value: true
427
+ split_batches:
428
+ value: null
429
+ suppress_tokens:
430
+ value: null
431
+ task_specific_params:
432
+ value: null
433
+ temperature:
434
+ value: 1
435
+ tf_legacy_loss:
436
+ value: false
437
+ tf32:
438
+ value: null
439
+ tie_encoder_decoder:
440
+ value: false
441
+ tie_word_embeddings:
442
+ value: true
443
+ tokenizer_class:
444
+ value: null
445
+ top_k:
446
+ value: 50
447
+ top_p:
448
+ value: 1
449
+ torch_compile:
450
+ value: false
451
+ torch_compile_backend:
452
+ value: null
453
+ torch_compile_mode:
454
+ value: null
455
+ torch_dtype:
456
+ value: float32
457
+ torch_empty_cache_steps:
458
+ value: null
459
+ torchdynamo:
460
+ value: null
461
+ torchscript:
462
+ value: false
463
+ tpu_metrics_debug:
464
+ value: false
465
+ tpu_num_cores:
466
+ value: null
467
+ transformers_version:
468
+ value: 4.47.0
469
+ type_vocab_size:
470
+ value: 2
471
+ typical_p:
472
+ value: 1
473
+ use_bfloat16:
474
+ value: false
475
+ use_cache:
476
+ value: true
477
+ use_cpu:
478
+ value: false
479
+ use_ipex:
480
+ value: false
481
+ use_legacy_prediction_loop:
482
+ value: false
483
+ use_liger_kernel:
484
+ value: false
485
+ use_mps_device:
486
+ value: false
487
+ vocab_size:
488
+ value: 29794
489
+ warmup_ratio:
490
+ value: 0
491
+ warmup_steps:
492
+ value: 200
493
+ weight_decay:
494
+ value: 0.02
wandb/run-20241211_042633-wah4vux1/files/output.log ADDED
File without changes
wandb/run-20241211_042633-wah4vux1/files/requirements.txt ADDED
@@ -0,0 +1,578 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ multiprocess==0.70.16
2
+ accelerate==1.2.0
3
+ fsspec==2024.9.0
4
+ wandb==0.19.0
5
+ transformers==4.47.0
6
+ evaluate==0.4.3
7
+ dill==0.3.8
8
+ tokenizers==0.21.0
9
+ datasets==3.2.0
10
+ xxhash==3.5.0
11
+ google-colab==1.0.0
12
+ termcolor==2.5.0
13
+ numexpr==2.10.2
14
+ duckdb==1.1.3
15
+ slicer==0.0.8
16
+ pandas-stubs==2.2.2.240909
17
+ yarl==1.18.3
18
+ jupyter-client==6.1.12
19
+ umf==0.9.1
20
+ aiohappyeyeballs==2.4.4
21
+ tensorflow-io-gcs-filesystem==0.37.1
22
+ nibabel==5.3.2
23
+ google-api-python-client==2.151.0
24
+ gensim==4.3.3
25
+ plotnine==0.14.3
26
+ stanio==0.5.1
27
+ pyogrio==0.10.0
28
+ nvidia-curand-cu12==10.3.7.77
29
+ Bottleneck==1.4.2
30
+ openpyxl==3.1.5
31
+ audioread==3.0.1
32
+ libcudf-cu12==24.10.1
33
+ networkx==3.4.2
34
+ cupy-cuda12x==12.2.0
35
+ nx-cugraph-cu12==24.10.0
36
+ aiosignal==1.3.1
37
+ StrEnum==0.4.15
38
+ jsonpickle==4.0.0
39
+ google-cloud-firestore==2.19.0
40
+ scipy==1.13.1
41
+ colour==0.1.5
42
+ pycparser==2.22
43
+ cmake==3.30.5
44
+ fastdownload==0.0.7
45
+ google-cloud-functions==1.18.1
46
+ imagesize==1.4.1
47
+ imageio==2.36.1
48
+ frozendict==2.4.6
49
+ ndindex==1.9.2
50
+ html5lib==1.1
51
+ python-louvain==0.16
52
+ Cython==3.0.11
53
+ fastai==2.7.18
54
+ cuda-python==12.2.1
55
+ parsy==2.1
56
+ smmap==5.0.1
57
+ weasel==0.4.1
58
+ pyasn1==0.6.1
59
+ matplotlib==3.8.0
60
+ immutabledict==4.2.1
61
+ dopamine_rl==4.0.9
62
+ uc-micro-py==1.0.3
63
+ websocket-client==1.8.0
64
+ anyio==3.7.1
65
+ msgpack==1.1.0
66
+ multitasking==0.0.11
67
+ albucore==0.0.19
68
+ scikit-image==0.24.0
69
+ setproctitle==1.3.4
70
+ marisa-trie==1.2.1
71
+ nvidia-cuda-nvcc-cu12==12.6.85
72
+ sphinxcontrib-qthelp==2.0.0
73
+ tables==3.10.1
74
+ kaggle==1.6.17
75
+ google-cloud-core==2.4.1
76
+ soupsieve==2.6
77
+ opencv-contrib-python==4.10.0.84
78
+ python-box==7.2.0
79
+ fonttools==4.55.1
80
+ Markdown==3.7
81
+ googleapis-common-protos==1.66.0
82
+ tensorflow-metadata==1.13.1
83
+ google-auth-httplib2==0.2.0
84
+ autograd==1.7.0
85
+ pydantic_core==2.27.1
86
+ pexpect==4.9.0
87
+ tweepy==4.14.0
88
+ pyshp==2.3.1
89
+ sklearn-pandas==2.2.0
90
+ toml==0.10.2
91
+ pydotplus==2.0.2
92
+ click==8.1.7
93
+ optree==0.13.1
94
+ catalogue==2.0.10
95
+ google-api-core==2.19.2
96
+ en-core-web-sm==3.7.1
97
+ tensorflow-datasets==4.9.7
98
+ pynvjitlink-cu12==0.4.0
99
+ pytest==8.3.4
100
+ portpicker==1.5.2
101
+ dm-tree==0.1.8
102
+ h5py==3.12.1
103
+ pandas-gbq==0.24.0
104
+ qdldl==0.1.7.post4
105
+ partd==1.4.2
106
+ wrapt==1.17.0
107
+ blis==0.7.11
108
+ textblob==0.17.1
109
+ jupyterlab_widgets==3.0.13
110
+ filelock==3.16.1
111
+ oauth2client==4.1.3
112
+ iniconfig==2.0.0
113
+ notebook==6.5.5
114
+ pydot==3.0.3
115
+ tomli==2.2.1
116
+ py4j==0.10.9.7
117
+ langchain-text-splitters==0.3.2
118
+ typer==0.15.0
119
+ contourpy==1.3.1
120
+ aiohttp==3.11.9
121
+ langchain==0.3.9
122
+ datascience==0.17.6
123
+ etuples==0.3.9
124
+ six==1.16.0
125
+ tornado==6.3.3
126
+ torchaudio==2.5.1+cu121
127
+ httpcore==1.0.7
128
+ importlib_metadata==8.5.0
129
+ holoviews==1.20.0
130
+ google-ai-generativelanguage==0.6.10
131
+ osqp==0.6.7.post3
132
+ pymc==5.18.2
133
+ google-pasta==0.2.0
134
+ py-cpuinfo==9.0.0
135
+ colorcet==3.1.0
136
+ blosc2==2.7.1
137
+ nvidia-cuda-cupti-cu12==12.6.80
138
+ colorlover==0.3.0
139
+ pymystem3==0.2.0
140
+ MarkupSafe==3.0.2
141
+ astunparse==1.6.3
142
+ google-cloud-bigquery==3.25.0
143
+ opencv-python==4.10.0.84
144
+ Flask==3.0.3
145
+ fastjsonschema==2.21.1
146
+ pylibcugraph-cu12==24.10.0
147
+ google-cloud-datastore==2.20.1
148
+ jaxlib==0.4.33
149
+ rich==13.9.4
150
+ widgetsnbextension==3.6.10
151
+ vega-datasets==0.9.0
152
+ natsort==8.4.0
153
+ blinker==1.9.0
154
+ statsmodels==0.14.4
155
+ sniffio==1.3.1
156
+ cymem==2.0.10
157
+ threadpoolctl==3.5.0
158
+ proto-plus==1.25.0
159
+ httpimport==1.4.0
160
+ matplotlib-inline==0.1.7
161
+ ratelim==0.1.6
162
+ python-utils==3.9.1
163
+ prompt_toolkit==3.0.48
164
+ gspread==6.0.2
165
+ google-cloud-storage==2.8.0
166
+ PyDrive==1.3.1
167
+ tensorflow-probability==0.24.0
168
+ xarray==2024.10.0
169
+ eerepr==0.0.4
170
+ murmurhash==1.0.11
171
+ pylibcudf-cu12==24.10.1
172
+ music21==9.3.0
173
+ geographiclib==2.0
174
+ smart-open==7.0.5
175
+ pygit2==1.16.0
176
+ more-itertools==10.5.0
177
+ cycler==0.12.1
178
+ idna==3.10
179
+ h11==0.14.0
180
+ ipython-genutils==0.2.0
181
+ referencing==0.35.1
182
+ GitPython==3.1.43
183
+ sentencepiece==0.2.0
184
+ CacheControl==0.14.1
185
+ tf_keras==2.17.0
186
+ annotated-types==0.7.0
187
+ ecos==2.0.14
188
+ geopandas==1.0.1
189
+ jieba==0.42.1
190
+ shapely==2.0.6
191
+ ipyparallel==8.8.0
192
+ pyOpenSSL==24.2.1
193
+ pyarrow==17.0.0
194
+ opencv-python-headless==4.10.0.84
195
+ openai==1.54.5
196
+ sentry-sdk==2.19.0
197
+ pyarrow-hotfix==0.6
198
+ gitdb==4.0.11
199
+ prophet==1.1.6
200
+ ipython-sql==0.5.0
201
+ tf-slim==1.1.0
202
+ PySocks==1.7.1
203
+ imbalanced-learn==0.12.4
204
+ mlxtend==0.23.3
205
+ mkl==2025.0.1
206
+ itsdangerous==2.2.0
207
+ gast==0.6.0
208
+ torchsummary==1.5.1
209
+ urllib3==2.2.3
210
+ earthengine-api==1.2.0
211
+ jupyter-server==1.24.0
212
+ webencodings==0.5.1
213
+ seaborn==0.13.2
214
+ SQLAlchemy==2.0.36
215
+ pyerfa==2.0.1.5
216
+ xarray-einstats==0.8.0
217
+ jsonschema==4.23.0
218
+ parso==0.8.4
219
+ tensorflow-hub==0.16.1
220
+ ipyfilechooser==0.6.0
221
+ packaging==24.2
222
+ param==2.1.1
223
+ et_xmlfile==2.0.0
224
+ kagglehub==0.3.4
225
+ yfinance==0.2.50
226
+ spacy==3.7.5
227
+ imgaug==0.4.0
228
+ sqlparse==0.5.2
229
+ certifi==2024.8.30
230
+ plotly==5.24.1
231
+ pandas-datareader==0.10.0
232
+ charset-normalizer==3.4.0
233
+ cloudpathlib==0.20.0
234
+ decorator==4.4.2
235
+ prometheus_client==0.21.1
236
+ google-auth==2.27.0
237
+ pyzmq==24.0.1
238
+ tzdata==2024.2
239
+ scs==3.2.7
240
+ types-pytz==2024.2.0.20241003
241
+ db-dtypes==1.3.1
242
+ chex==0.1.87
243
+ spacy-loggers==1.0.5
244
+ peewee==3.17.8
245
+ nvidia-cuda-runtime-cu12==12.6.77
246
+ ibis-framework==9.2.0
247
+ wasabi==1.1.3
248
+ python-dateutil==2.8.2
249
+ google-generativeai==0.8.3
250
+ google-cloud-aiplatform==1.73.0
251
+ jupyterlab_pygments==0.3.0
252
+ google-cloud-translate==3.17.0
253
+ exceptiongroup==1.2.2
254
+ intel-openmp==2025.0.3
255
+ nvidia-cufft-cu12==11.3.0.4
256
+ timm==1.0.12
257
+ python-slugify==8.0.4
258
+ cryptography==43.0.3
259
+ multipledispatch==1.0.0
260
+ google-crc32c==1.6.0
261
+ rsa==4.9
262
+ cffi==1.17.1
263
+ holidays==0.62
264
+ tinycss2==1.4.0
265
+ google-cloud-iam==2.16.1
266
+ google-cloud-resource-manager==1.13.1
267
+ typeguard==4.4.1
268
+ xlrd==2.0.1
269
+ google==2.0.3
270
+ pandocfilters==1.5.1
271
+ promise==2.3
272
+ progressbar2==4.5.0
273
+ wheel==0.45.1
274
+ google-resumable-media==2.7.2
275
+ einops==0.8.0
276
+ defusedxml==0.7.1
277
+ llvmlite==0.43.0
278
+ attrs==24.2.0
279
+ preshed==3.0.9
280
+ geocoder==1.38.1
281
+ firebase-admin==6.5.0
282
+ ipython==7.34.0
283
+ cvxopt==1.3.2
284
+ greenlet==3.1.1
285
+ confection==0.1.5
286
+ keras==3.5.0
287
+ pandas==2.2.2
288
+ graphviz==0.20.3
289
+ tenacity==9.0.0
290
+ future==1.0.0
291
+ inflect==7.4.0
292
+ langsmith==0.1.147
293
+ babel==2.16.0
294
+ nvidia-cusparse-cu12==12.5.4.2
295
+ importlib_resources==6.4.5
296
+ jiter==0.8.0
297
+ sphinxcontrib-htmlhelp==2.1.0
298
+ moviepy==1.0.3
299
+ typing_extensions==4.12.2
300
+ tcmlib==1.2.0
301
+ snowballstemmer==2.2.0
302
+ scikit-learn==1.5.2
303
+ jsonpatch==1.33
304
+ notebook_shim==0.2.4
305
+ mizani==0.13.0
306
+ simple-parsing==0.1.6
307
+ humanize==4.11.0
308
+ hyperopt==0.2.7
309
+ frozenlist==1.5.0
310
+ uritemplate==4.1.1
311
+ pydata-google-auth==1.9.0
312
+ pyperclip==1.9.0
313
+ lazy_loader==0.4
314
+ bigframes==1.27.0
315
+ numba==0.60.0
316
+ pytensor==2.26.4
317
+ geemap==0.35.1
318
+ polars==1.9.0
319
+ orjson==3.10.12
320
+ fastrlock==0.8.2
321
+ mistune==3.0.2
322
+ sphinxcontrib-applehelp==2.0.0
323
+ mpmath==1.3.0
324
+ mdurl==0.1.2
325
+ async-timeout==4.0.3
326
+ jax==0.4.33
327
+ panel==1.5.4
328
+ patsy==1.0.1
329
+ oauthlib==3.2.2
330
+ librosa==0.10.2.post1
331
+ imutils==0.5.4
332
+ xyzservices==2024.9.0
333
+ Pygments==2.18.0
334
+ sphinxcontrib-jsmath==1.0.1
335
+ pyasn1_modules==0.4.1
336
+ pyproj==3.7.0
337
+ Pyomo==6.8.2
338
+ nvidia-cublas-cu12==12.6.4.1
339
+ numpy==1.26.4
340
+ google-cloud-bigquery-storage==2.27.0
341
+ soxr==0.5.0.post1
342
+ bleach==6.2.0
343
+ sympy==1.13.1
344
+ dask==2024.10.0
345
+ miniKanren==1.0.3
346
+ pathlib==1.0.1
347
+ torch==2.5.1+cu121
348
+ distro==1.9.0
349
+ gym==0.25.2
350
+ lxml==5.3.0
351
+ folium==0.18.0
352
+ altair==4.2.2
353
+ tifffile==2024.9.20
354
+ diffusers==0.31.0
355
+ requests-oauthlib==1.3.1
356
+ google-cloud-language==2.15.1
357
+ ptyprocess==0.7.0
358
+ tensorboard-data-server==0.7.2
359
+ locket==1.0.0
360
+ intel-cmplr-lib-ur==2025.0.3
361
+ lightgbm==4.5.0
362
+ flax==0.8.5
363
+ mdit-py-plugins==0.4.2
364
+ pygame==2.6.1
365
+ pytz==2024.2
366
+ pyparsing==3.2.0
367
+ alabaster==1.0.0
368
+ jupyter_core==5.7.2
369
+ cloudpickle==3.1.0
370
+ logical-unification==0.4.6
371
+ traittypes==0.2.1
372
+ terminado==0.18.1
373
+ beautifulsoup4==4.12.3
374
+ google-cloud-bigquery-connection==1.16.1
375
+ scooby==0.10.0
376
+ google-auth-oauthlib==1.2.1
377
+ protobuf==4.25.5
378
+ astropy==6.1.7
379
+ GDAL==3.6.4
380
+ nbformat==5.10.4
381
+ safetensors==0.4.5
382
+ tzlocal==5.2
383
+ chardet==5.2.0
384
+ psutil==5.9.5
385
+ jsonpointer==3.0.0
386
+ psycopg2==2.9.10
387
+ cachetools==5.5.0
388
+ pyspark==3.5.3
389
+ google-cloud-pubsub==2.27.1
390
+ pillow==11.0.0
391
+ pydantic==2.10.3
392
+ langcodes==3.5.0
393
+ jsonschema-specifications==2024.10.1
394
+ markdown-it-py==3.0.0
395
+ nltk==3.9.1
396
+ nbconvert==7.16.4
397
+ pooch==1.8.2
398
+ huggingface-hub==0.26.3
399
+ tbb==2022.0.0
400
+ ipywidgets==7.7.1
401
+ matplotlib-venn==1.1.1
402
+ rmm-cu12==24.10.0
403
+ sphinxcontrib-serializinghtml==2.0.0
404
+ branca==0.8.0
405
+ gin-config==0.5.0
406
+ bqplot==0.12.43
407
+ opentelemetry-sdk==1.28.2
408
+ community==1.0.0b1
409
+ soundfile==0.12.1
410
+ shap==0.46.0
411
+ joblib==1.4.2
412
+ entrypoints==0.4
413
+ docstring_parser==0.16
414
+ multidict==6.1.0
415
+ grpcio-status==1.62.3
416
+ wcwidth==0.2.13
417
+ geopy==2.4.1
418
+ prettytable==3.12.0
419
+ propcache==0.2.1
420
+ ml-dtypes==0.4.1
421
+ atpublic==4.1.0
422
+ stringzilla==3.11.0
423
+ libclang==18.1.1
424
+ arviz==0.20.0
425
+ requests==2.32.3
426
+ fastprogress==1.0.3
427
+ dlib==19.24.2
428
+ platformdirs==4.3.6
429
+ tabulate==0.9.0
430
+ cufflinks==0.17.3
431
+ spacy-legacy==3.0.12
432
+ grpcio==1.68.1
433
+ cvxpy==1.5.4
434
+ flatbuffers==24.3.25
435
+ pluggy==1.5.0
436
+ regex==2024.9.11
437
+ clarabel==0.9.0
438
+ httplib2==0.22.0
439
+ language_data==1.3.0
440
+ backcall==0.2.0
441
+ nbclient==0.10.1
442
+ ipyleaflet==0.19.2
443
+ PyJWT==2.10.1
444
+ toolz==0.12.1
445
+ opt_einsum==3.4.0
446
+ nvtx==0.2.10
447
+ nvidia-nccl-cu12==2.23.4
448
+ debugpy==1.8.0
449
+ sphinxcontrib-devhelp==2.0.0
450
+ jax-cuda12-pjrt==0.4.33
451
+ missingno==0.5.2
452
+ proglog==0.1.10
453
+ nest-asyncio==1.6.0
454
+ nvidia-cusolver-cu12==11.7.1.2
455
+ gspread-dataframe==3.3.1
456
+ bigquery-magics==0.4.0
457
+ jellyfish==1.1.2
458
+ text-unidecode==1.3
459
+ Deprecated==1.2.15
460
+ srsly==2.4.8
461
+ ipytree==0.2.2
462
+ gcsfs==2024.10.0
463
+ ipyevents==2.0.2
464
+ docker-pycreds==0.4.0
465
+ glob2==0.7
466
+ wordcloud==1.9.4
467
+ traitlets==5.7.1
468
+ nbclassic==1.1.0
469
+ nvidia-cudnn-cu12==9.6.0.74
470
+ easydict==1.13
471
+ httpx==0.28.0
472
+ Jinja2==3.1.4
473
+ argon2-cffi==23.1.0
474
+ linkify-it-py==2.0.3
475
+ optax==0.2.4
476
+ cons==0.4.6
477
+ requests-toolbelt==1.0.0
478
+ editdistance==0.8.1
479
+ jupyter-leaflet==0.19.2
480
+ webcolors==24.11.1
481
+ jax-cuda12-plugin==0.4.33
482
+ albumentations==1.4.20
483
+ PyOpenGL==3.1.7
484
+ tqdm==4.66.6
485
+ googledrivedownloader==0.4
486
+ peft==0.13.2
487
+ absl-py==1.4.0
488
+ eval_type_backport==0.2.0
489
+ opentelemetry-api==1.28.2
490
+ Send2Trash==1.8.3
491
+ gdown==5.2.0
492
+ google-cloud-bigtable==2.27.0
493
+ grpc-google-iam-v1==0.13.1
494
+ rpds-py==0.22.3
495
+ xgboost==2.1.3
496
+ pyviz_comms==3.0.3
497
+ ipykernel==5.5.6
498
+ gym-notices==0.0.8
499
+ opentelemetry-semantic-conventions==0.49b2
500
+ fastcore==1.7.22
501
+ torchvision==0.20.1+cu121
502
+ Sphinx==8.1.3
503
+ ply==3.11
504
+ tensorboard==2.17.1
505
+ namex==0.0.8
506
+ astropy-iers-data==0.2024.12.2.0.35.34
507
+ rpy2==3.4.2
508
+ nvidia-nvjitlink-cu12==12.6.85
509
+ array_record==0.5.1
510
+ PyYAML==6.0.2
511
+ tensorflow==2.17.1
512
+ yellowbrick==1.5
513
+ pylibraft-cu12==24.10.0
514
+ orbax-checkpoint==0.6.4
515
+ Werkzeug==3.1.3
516
+ jupyter-console==6.1.0
517
+ thinc==8.2.5
518
+ etils==1.11.0
519
+ langchain-core==0.3.21
520
+ cmdstanpy==1.2.4
521
+ h5netcdf==1.4.1
522
+ cudf-cu12==24.10.1
523
+ zipp==3.21.0
524
+ PyDrive2==1.21.3
525
+ kiwisolver==1.4.7
526
+ pickleshare==0.7.5
527
+ bokeh==3.6.2
528
+ shellingham==1.5.4
529
+ imageio-ffmpeg==0.5.1
530
+ pycocotools==2.0.8
531
+ sqlglot==25.1.0
532
+ tensorstore==0.1.69
533
+ argon2-cffi-bindings==21.2.0
534
+ sentence-transformers==3.2.1
535
+ docutils==0.21.2
536
+ python-apt==0.0.0
537
+ setuptools==75.1.0
538
+ pip==24.1.2
539
+ requirements-parser==0.9.0
540
+ types-setuptools==75.6.0.20241126
541
+ importlib-metadata==4.6.4
542
+ blinker==1.4
543
+ PyJWT==2.3.0
544
+ PyGObject==3.42.1
545
+ cryptography==3.4.8
546
+ pyparsing==2.4.7
547
+ wadllib==1.3.6
548
+ more-itertools==8.10.0
549
+ zipp==1.0.0
550
+ distro==1.7.0
551
+ six==1.16.0
552
+ python-apt==2.4.0+ubuntu4
553
+ jeepney==0.7.1
554
+ SecretStorage==3.3.1
555
+ oauthlib==3.2.0
556
+ lazr.restfulclient==0.14.4
557
+ httplib2==0.20.2
558
+ launchpadlib==1.10.16
559
+ keyring==23.5.0
560
+ lazr.uri==1.0.6
561
+ dbus-python==1.2.18
562
+ inflect==7.3.1
563
+ wheel==0.43.0
564
+ packaging==24.1
565
+ importlib_metadata==8.0.0
566
+ more-itertools==10.3.0
567
+ platformdirs==4.2.2
568
+ backports.tarfile==1.2.0
569
+ typing_extensions==4.12.2
570
+ jaraco.functools==4.0.1
571
+ jaraco.collections==5.1.0
572
+ tomli==2.0.1
573
+ importlib_resources==6.4.0
574
+ jaraco.text==3.12.1
575
+ jaraco.context==5.3.0
576
+ typeguard==4.3.0
577
+ autocommand==2.2.2
578
+ zipp==3.19.2
wandb/run-20241211_042633-wah4vux1/files/wandb-metadata.json ADDED
@@ -0,0 +1,38 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "os": "Linux-6.1.85+-x86_64-with-glibc2.35",
3
+ "python": "CPython 3.10.12",
4
+ "startedAt": "2024-12-11T04:26:33.646888Z",
5
+ "program": "m2-congretimbau-focal_loss-sempartido.ipynb",
6
+ "email": "[email protected]",
7
+ "root": "/content",
8
+ "host": "fd55a770be24",
9
+ "username": "root",
10
+ "executable": "/usr/bin/python3",
11
+ "colab": "https://colab.research.google.com/notebook#fileId=1a_b6x6zKkekdeqrCzP83ovWk7MhdV3DO",
12
+ "cpu_count": 6,
13
+ "cpu_count_logical": 12,
14
+ "gpu": "NVIDIA A100-SXM4-40GB",
15
+ "gpu_count": 1,
16
+ "disk": {
17
+ "/": {
18
+ "total": "120942624768",
19
+ "used": "36614164480"
20
+ }
21
+ },
22
+ "memory": {
23
+ "total": "89632796672"
24
+ },
25
+ "cpu": {
26
+ "count": 6,
27
+ "countLogical": 12
28
+ },
29
+ "gpu_nvidia": [
30
+ {
31
+ "name": "NVIDIA A100-SXM4-40GB",
32
+ "memoryTotal": "42949672960",
33
+ "cudaCores": 6912,
34
+ "architecture": "Ampere"
35
+ }
36
+ ],
37
+ "cudaVersion": "12.2"
38
+ }
wandb/run-20241211_042633-wah4vux1/files/wandb-summary.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"_wandb":{"runtime":8}}
wandb/run-20241211_042633-wah4vux1/logs/debug-core.log ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {"time":"2024-12-11T04:26:32.996515483Z","level":"INFO","msg":"started logging, with flags","port-filename":"/tmp/tmp1ht12chm/port-464.txt","pid":464,"debug":false,"disable-analytics":false}
2
+ {"time":"2024-12-11T04:26:32.99654412Z","level":"INFO","msg":"FeatureState","shutdownOnParentExitEnabled":false}
3
+ {"time":"2024-12-11T04:26:33.443154063Z","level":"INFO","msg":"Will exit if parent process dies.","ppid":464}
4
+ {"time":"2024-12-11T04:26:33.443156332Z","level":"INFO","msg":"server is running","addr":{"IP":"127.0.0.1","Port":40707,"Zone":""}}
5
+ {"time":"2024-12-11T04:26:33.585385719Z","level":"INFO","msg":"connection: ManageConnectionData: new connection created","id":"127.0.0.1:53892"}
6
+ {"time":"2024-12-11T04:26:33.64885795Z","level":"INFO","msg":"handleInformInit: received","streamId":"wah4vux1","id":"127.0.0.1:53892"}
7
+ {"time":"2024-12-11T04:26:33.751779758Z","level":"INFO","msg":"handleInformInit: stream started","streamId":"wah4vux1","id":"127.0.0.1:53892"}
8
+ {"time":"2024-12-11T04:28:38.430422355Z","level":"INFO","msg":"handleInformTeardown: server teardown initiated","id":"127.0.0.1:53892"}
9
+ {"time":"2024-12-11T04:28:38.430524985Z","level":"INFO","msg":"connection: Close: initiating connection closure","id":"127.0.0.1:53892"}
10
+ {"time":"2024-12-11T04:28:38.430642627Z","level":"INFO","msg":"connection: Close: connection successfully closed","id":"127.0.0.1:53892"}
11
+ {"time":"2024-12-11T04:28:38.430541905Z","level":"INFO","msg":"server is shutting down"}
12
+ {"time":"2024-12-11T04:28:38.669777272Z","level":"INFO","msg":"handleInformTeardown: server shutdown complete","id":"127.0.0.1:53892"}
13
+ {"time":"2024-12-11T04:28:38.669791299Z","level":"INFO","msg":"connection: ManageConnectionData: connection closed","id":"127.0.0.1:53892"}
14
+ {"time":"2024-12-11T04:28:38.669799437Z","level":"INFO","msg":"server is closed"}
wandb/run-20241211_042633-wah4vux1/logs/debug-internal.log ADDED
@@ -0,0 +1,29 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {"time":"2024-12-11T04:26:33.649024782Z","level":"INFO","msg":"using version","core version":"0.19.0"}
2
+ {"time":"2024-12-11T04:26:33.649049094Z","level":"INFO","msg":"created symlink","path":"/content/wandb/run-20241211_042633-wah4vux1/logs/debug-core.log"}
3
+ {"time":"2024-12-11T04:26:33.751742244Z","level":"INFO","msg":"created new stream","id":"wah4vux1"}
4
+ {"time":"2024-12-11T04:26:33.75177344Z","level":"INFO","msg":"stream: started","id":"wah4vux1"}
5
+ {"time":"2024-12-11T04:26:33.751819655Z","level":"INFO","msg":"writer: Do: started","stream_id":"wah4vux1"}
6
+ {"time":"2024-12-11T04:26:33.751846429Z","level":"INFO","msg":"sender: started","stream_id":"wah4vux1"}
7
+ {"time":"2024-12-11T04:26:33.751889938Z","level":"INFO","msg":"handler: started","stream_id":"wah4vux1"}
8
+ {"time":"2024-12-11T04:26:37.433996195Z","level":"INFO","msg":"Starting system monitor"}
9
+ {"time":"2024-12-11T04:26:37.918501173Z","level":"INFO","msg":"Pausing system monitor"}
10
+ {"time":"2024-12-11T04:26:37.918542387Z","level":"INFO","msg":"Resuming system monitor"}
11
+ {"time":"2024-12-11T04:26:37.918551346Z","level":"INFO","msg":"Pausing system monitor"}
12
+ {"time":"2024-12-11T04:26:37.918557135Z","level":"INFO","msg":"Resuming system monitor"}
13
+ {"time":"2024-12-11T04:26:38.209467967Z","level":"INFO","msg":"Pausing system monitor"}
14
+ {"time":"2024-12-11T04:28:13.514944092Z","level":"INFO","msg":"Resuming system monitor"}
15
+ {"time":"2024-12-11T04:28:13.517111071Z","level":"INFO","msg":"Pausing system monitor"}
16
+ {"time":"2024-12-11T04:28:19.599319Z","level":"INFO","msg":"Resuming system monitor"}
17
+ {"time":"2024-12-11T04:28:20.269551048Z","level":"INFO","msg":"Pausing system monitor"}
18
+ {"time":"2024-12-11T04:28:20.274021048Z","level":"INFO","msg":"Resuming system monitor"}
19
+ {"time":"2024-12-11T04:28:22.701050286Z","level":"INFO","msg":"Pausing system monitor"}
20
+ {"time":"2024-12-11T04:28:27.079815946Z","level":"INFO","msg":"Resuming system monitor"}
21
+ {"time":"2024-12-11T04:28:27.602762512Z","level":"INFO","msg":"Pausing system monitor"}
22
+ {"time":"2024-12-11T04:28:38.430506078Z","level":"INFO","msg":"stream: closing","id":"wah4vux1"}
23
+ {"time":"2024-12-11T04:28:38.430543738Z","level":"INFO","msg":"Stopping system monitor"}
24
+ {"time":"2024-12-11T04:28:38.431098624Z","level":"INFO","msg":"Stopped system monitor"}
25
+ {"time":"2024-12-11T04:28:38.624699235Z","level":"INFO","msg":"fileTransfer: Close: file transfer manager closed"}
26
+ {"time":"2024-12-11T04:28:38.669629724Z","level":"INFO","msg":"handler: closed","stream_id":"wah4vux1"}
27
+ {"time":"2024-12-11T04:28:38.669657538Z","level":"INFO","msg":"sender: closed","stream_id":"wah4vux1"}
28
+ {"time":"2024-12-11T04:28:38.669655507Z","level":"INFO","msg":"writer: Close: closed","stream_id":"wah4vux1"}
29
+ {"time":"2024-12-11T04:28:38.669727276Z","level":"INFO","msg":"stream: closed","id":"wah4vux1"}
wandb/run-20241211_042633-wah4vux1/logs/debug.log ADDED
@@ -0,0 +1,48 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ 2024-12-11 04:26:33,635 INFO MainThread:464 [wandb_setup.py:_flush():68] Current SDK version is 0.19.0
2
+ 2024-12-11 04:26:33,635 INFO MainThread:464 [wandb_setup.py:_flush():68] Configure stats pid to 464
3
+ 2024-12-11 04:26:33,635 INFO MainThread:464 [wandb_setup.py:_flush():68] Loading settings from /root/.config/wandb/settings
4
+ 2024-12-11 04:26:33,635 INFO MainThread:464 [wandb_setup.py:_flush():68] Loading settings from /content/wandb/settings
5
+ 2024-12-11 04:26:33,635 INFO MainThread:464 [wandb_setup.py:_flush():68] Loading settings from environment variables
6
+ 2024-12-11 04:26:33,635 INFO MainThread:464 [wandb_init.py:_log_setup():528] Logging user logs to /content/wandb/run-20241211_042633-wah4vux1/logs/debug.log
7
+ 2024-12-11 04:26:33,635 INFO MainThread:464 [wandb_init.py:_log_setup():529] Logging internal logs to /content/wandb/run-20241211_042633-wah4vux1/logs/debug-internal.log
8
+ 2024-12-11 04:26:33,636 INFO MainThread:464 [wandb_init.py:_jupyter_setup():474] configuring jupyter hooks <wandb.sdk.wandb_init._WandbInit object at 0x7bbc23b69150>
9
+ 2024-12-11 04:26:33,636 INFO MainThread:464 [wandb_init.py:init():639] calling init triggers
10
+ 2024-12-11 04:26:33,636 INFO MainThread:464 [wandb_init.py:init():645] wandb.init called with sweep_config: {}
11
+ config: {}
12
+ 2024-12-11 04:26:33,636 INFO MainThread:464 [wandb_init.py:init():688] starting backend
13
+ 2024-12-11 04:26:33,636 INFO MainThread:464 [wandb_init.py:init():692] sending inform_init request
14
+ 2024-12-11 04:26:33,646 INFO MainThread:464 [backend.py:_multiprocessing_setup():104] multiprocessing start_methods=fork,spawn,forkserver, using: spawn
15
+ 2024-12-11 04:26:33,646 INFO MainThread:464 [wandb_init.py:init():705] backend started and connected
16
+ 2024-12-11 04:26:33,654 INFO MainThread:464 [wandb_run.py:_label_probe_notebook():1327] probe notebook
17
+ 2024-12-11 04:26:37,354 INFO MainThread:464 [wandb_init.py:init():798] updated telemetry
18
+ 2024-12-11 04:26:37,354 INFO MainThread:464 [wandb_init.py:init():830] communicating run to backend with 90.0 second timeout
19
+ 2024-12-11 04:26:37,429 INFO MainThread:464 [wandb_init.py:init():882] starting run threads in backend
20
+ 2024-12-11 04:26:37,879 INFO MainThread:464 [wandb_run.py:_console_start():2443] atexit reg
21
+ 2024-12-11 04:26:37,879 INFO MainThread:464 [wandb_run.py:_redirect():2293] redirect: wrap_raw
22
+ 2024-12-11 04:26:37,879 INFO MainThread:464 [wandb_run.py:_redirect():2358] Wrapping output streams.
23
+ 2024-12-11 04:26:37,879 INFO MainThread:464 [wandb_run.py:_redirect():2383] Redirects installed.
24
+ 2024-12-11 04:26:37,881 INFO MainThread:464 [wandb_init.py:init():925] run started, returning control to user process
25
+ 2024-12-11 04:26:37,884 INFO MainThread:464 [jupyter.py:save_ipynb():387] not saving jupyter notebook
26
+ 2024-12-11 04:26:37,884 INFO MainThread:464 [wandb_init.py:_pause_backend():439] pausing backend
27
+ 2024-12-11 04:26:37,888 INFO MainThread:464 [wandb_init.py:_resume_backend():444] resuming backend
28
+ 2024-12-11 04:26:37,889 INFO MainThread:464 [jupyter.py:save_ipynb():387] not saving jupyter notebook
29
+ 2024-12-11 04:26:37,889 INFO MainThread:464 [wandb_init.py:_pause_backend():439] pausing backend
30
+ 2024-12-11 04:26:37,896 INFO MainThread:464 [wandb_init.py:_resume_backend():444] resuming backend
31
+ 2024-12-11 04:26:38,208 INFO MainThread:464 [jupyter.py:save_ipynb():387] not saving jupyter notebook
32
+ 2024-12-11 04:26:38,209 INFO MainThread:464 [wandb_init.py:_pause_backend():439] pausing backend
33
+ 2024-12-11 04:28:13,514 INFO MainThread:464 [wandb_init.py:_resume_backend():444] resuming backend
34
+ 2024-12-11 04:28:13,516 INFO MainThread:464 [jupyter.py:save_ipynb():387] not saving jupyter notebook
35
+ 2024-12-11 04:28:13,516 INFO MainThread:464 [wandb_init.py:_pause_backend():439] pausing backend
36
+ 2024-12-11 04:28:19,598 INFO MainThread:464 [wandb_init.py:_resume_backend():444] resuming backend
37
+ 2024-12-11 04:28:20,269 INFO MainThread:464 [jupyter.py:save_ipynb():387] not saving jupyter notebook
38
+ 2024-12-11 04:28:20,269 INFO MainThread:464 [wandb_init.py:_pause_backend():439] pausing backend
39
+ 2024-12-11 04:28:20,273 INFO MainThread:464 [wandb_init.py:_resume_backend():444] resuming backend
40
+ 2024-12-11 04:28:20,618 INFO MainThread:464 [wandb_run.py:_config_callback():1375] config_cb None None {'return_dict': True, 'output_hidden_states': False, 'output_attentions': False, 'torchscript': False, 'torch_dtype': 'float32', 'use_bfloat16': False, 'tf_legacy_loss': False, 'pruned_heads': {}, 'tie_word_embeddings': True, 'chunk_size_feed_forward': 0, 'is_encoder_decoder': False, 'is_decoder': False, 'cross_attention_hidden_size': None, 'add_cross_attention': False, 'tie_encoder_decoder': False, 'max_length': 20, 'min_length': 0, 'do_sample': False, 'early_stopping': False, 'num_beams': 1, 'num_beam_groups': 1, 'diversity_penalty': 0.0, 'temperature': 1.0, 'top_k': 50, 'top_p': 1.0, 'typical_p': 1.0, 'repetition_penalty': 1.0, 'length_penalty': 1.0, 'no_repeat_ngram_size': 0, 'encoder_no_repeat_ngram_size': 0, 'bad_words_ids': None, 'num_return_sequences': 1, 'output_scores': False, 'return_dict_in_generate': False, 'forced_bos_token_id': None, 'forced_eos_token_id': None, 'remove_invalid_values': False, 'exponential_decay_length_penalty': None, 'suppress_tokens': None, 'begin_suppress_tokens': None, 'architectures': ['BertForMaskedLM'], 'finetuning_task': None, 'id2label': {0: '0', 1: '1'}, 'label2id': {'0': 0, '1': 1}, 'tokenizer_class': None, 'prefix': None, 'bos_token_id': None, 'pad_token_id': 0, 'eos_token_id': None, 'sep_token_id': None, 'decoder_start_token_id': None, 'task_specific_params': None, 'problem_type': None, '_name_or_path': 'belisards/congretimbau', '_attn_implementation_autoset': True, 'transformers_version': '4.47.0', 'directionality': 'bidi', 'model_type': 'bert', 'output_past': True, 'pooler_fc_size': 768, 'pooler_num_attention_heads': 12, 'pooler_num_fc_layers': 3, 'pooler_size_per_head': 128, 'pooler_type': 'first_token_transform', 'vocab_size': 29794, 'hidden_size': 1024, 'num_hidden_layers': 24, 'num_attention_heads': 16, 'hidden_act': 'gelu', 'intermediate_size': 4096, 'hidden_dropout_prob': 0.05, 'attention_probs_dropout_prob': 0.05, 'max_position_embeddings': 512, 'type_vocab_size': 2, 'initializer_range': 0.02, 'layer_norm_eps': 1e-12, 'position_embedding_type': 'absolute', 'use_cache': True, 'classifier_dropout': None, 'output_dir': '.', 'overwrite_output_dir': False, 'do_train': False, 'do_eval': True, 'do_predict': False, 'eval_strategy': 'epoch', 'prediction_loss_only': False, 'per_device_train_batch_size': 128, 'per_device_eval_batch_size': 128, 'per_gpu_train_batch_size': None, 'per_gpu_eval_batch_size': None, 'gradient_accumulation_steps': 1, 'eval_accumulation_steps': None, 'eval_delay': 0, 'torch_empty_cache_steps': None, 'learning_rate': 1e-05, 'weight_decay': 0.02, 'adam_beta1': 0.9, 'adam_beta2': 0.999, 'adam_epsilon': 1e-08, 'max_grad_norm': 1.0, 'num_train_epochs': 15, 'max_steps': -1, 'lr_scheduler_type': 'linear', 'lr_scheduler_kwargs': {}, 'warmup_ratio': 0.0, 'warmup_steps': 200, 'log_level': 'passive', 'log_level_replica': 'warning', 'log_on_each_node': True, 'logging_dir': './runs/Dec11_04-28-19_fd55a770be24', 'logging_strategy': 'steps', 'logging_first_step': False, 'logging_steps': 10, 'logging_nan_inf_filter': True, 'save_strategy': 'epoch', 'save_steps': 48, 'save_total_limit': 1, 'save_safetensors': True, 'save_on_each_node': False, 'save_only_model': False, 'restore_callback_states_from_checkpoint': False, 'no_cuda': False, 'use_cpu': False, 'use_mps_device': False, 'seed': 5151, 'data_seed': None, 'jit_mode_eval': False, 'use_ipex': False, 'bf16': False, 'fp16': False, 'fp16_opt_level': 'O1', 'half_precision_backend': 'auto', 'bf16_full_eval': False, 'fp16_full_eval': False, 'tf32': None, 'local_rank': 0, 'ddp_backend': None, 'tpu_num_cores': None, 'tpu_metrics_debug': False, 'debug': [], 'dataloader_drop_last': False, 'eval_steps': 24, 'dataloader_num_workers': 0, 'dataloader_prefetch_factor': None, 'past_index': -1, 'run_name': 'm2-congretimbau_tunado', 'disable_tqdm': False, 'remove_unused_columns': True, 'label_names': None, 'load_best_model_at_end': True, 'metric_for_best_model': 'f1', 'greater_is_better': True, 'ignore_data_skip': False, 'fsdp': [], 'fsdp_min_num_params': 0, 'fsdp_config': {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}, 'fsdp_transformer_layer_cls_to_wrap': None, 'accelerator_config': {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}, 'deepspeed': None, 'label_smoothing_factor': 0.0, 'optim': 'adamw_torch', 'optim_args': None, 'adafactor': False, 'group_by_length': False, 'length_column_name': 'length', 'report_to': ['wandb'], 'ddp_find_unused_parameters': None, 'ddp_bucket_cap_mb': None, 'ddp_broadcast_buffers': None, 'dataloader_pin_memory': True, 'dataloader_persistent_workers': False, 'skip_memory_metrics': True, 'use_legacy_prediction_loop': False, 'push_to_hub': False, 'resume_from_checkpoint': None, 'hub_model_id': None, 'hub_strategy': 'every_save', 'hub_token': '<HUB_TOKEN>', 'hub_private_repo': None, 'hub_always_push': False, 'gradient_checkpointing': False, 'gradient_checkpointing_kwargs': None, 'include_inputs_for_metrics': False, 'include_for_metrics': [], 'eval_do_concat_batches': True, 'fp16_backend': 'auto', 'evaluation_strategy': None, 'push_to_hub_model_id': None, 'push_to_hub_organization': None, 'push_to_hub_token': '<PUSH_TO_HUB_TOKEN>', 'mp_parameters': '', 'auto_find_batch_size': False, 'full_determinism': False, 'torchdynamo': None, 'ray_scope': 'last', 'ddp_timeout': 1800, 'torch_compile': False, 'torch_compile_backend': None, 'torch_compile_mode': None, 'dispatch_batches': None, 'split_batches': None, 'include_tokens_per_second': False, 'include_num_input_tokens_seen': False, 'neftune_noise_alpha': None, 'optim_target_modules': None, 'batch_eval_metrics': False, 'eval_on_start': False, 'use_liger_kernel': False, 'eval_use_gather_object': False, 'average_tokens_across_devices': False}
41
+ 2024-12-11 04:28:20,622 INFO MainThread:464 [wandb_config.py:__setitem__():154] config set model/num_parameters = 334398466 - <bound method Run._config_callback of <wandb.sdk.wandb_run.Run object at 0x7bbc208d5ba0>>
42
+ 2024-12-11 04:28:20,622 INFO MainThread:464 [wandb_run.py:_config_callback():1375] config_cb model/num_parameters 334398466 None
43
+ 2024-12-11 04:28:22,699 INFO MainThread:464 [jupyter.py:save_ipynb():387] not saving jupyter notebook
44
+ 2024-12-11 04:28:22,700 INFO MainThread:464 [wandb_init.py:_pause_backend():439] pausing backend
45
+ 2024-12-11 04:28:27,079 INFO MainThread:464 [wandb_init.py:_resume_backend():444] resuming backend
46
+ 2024-12-11 04:28:27,602 INFO MainThread:464 [jupyter.py:save_ipynb():387] not saving jupyter notebook
47
+ 2024-12-11 04:28:27,602 INFO MainThread:464 [wandb_init.py:_pause_backend():439] pausing backend
48
+ 2024-12-11 04:28:38,430 WARNING MsgRouterThr:464 [router.py:message_loop():75] message_loop has been closed
wandb/run-20241211_042633-wah4vux1/run-wah4vux1.wandb ADDED
Binary file (9.64 kB). View file
 
wandb/run-20241211_042903-vt1hbdpt/files/config.yaml ADDED
@@ -0,0 +1,494 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ _attn_implementation_autoset:
2
+ value: true
3
+ _name_or_path:
4
+ value: belisards/congretimbau
5
+ _wandb:
6
+ value:
7
+ cli_version: 0.19.0
8
+ m:
9
+ - "1": train/global_step
10
+ "6":
11
+ - 3
12
+ "7": []
13
+ python_version: 3.10.12
14
+ t:
15
+ "1":
16
+ - 1
17
+ - 2
18
+ - 3
19
+ - 5
20
+ - 11
21
+ - 12
22
+ - 41
23
+ - 49
24
+ - 51
25
+ - 53
26
+ - 55
27
+ - 71
28
+ - 98
29
+ - 100
30
+ - 105
31
+ "2":
32
+ - 1
33
+ - 2
34
+ - 3
35
+ - 5
36
+ - 11
37
+ - 12
38
+ - 41
39
+ - 49
40
+ - 51
41
+ - 53
42
+ - 55
43
+ - 71
44
+ - 98
45
+ - 100
46
+ - 105
47
+ "3":
48
+ - 7
49
+ - 13
50
+ - 19
51
+ - 23
52
+ - 55
53
+ - 66
54
+ "4": 3.10.12
55
+ "5": 0.19.0
56
+ "6": 4.47.0
57
+ "8":
58
+ - 1
59
+ - 5
60
+ - 12
61
+ "9":
62
+ "1": transformers_trainer
63
+ "12": 0.19.0
64
+ "13": linux-x86_64
65
+ accelerator_config:
66
+ value:
67
+ dispatch_batches: null
68
+ even_batches: true
69
+ gradient_accumulation_kwargs: null
70
+ non_blocking: false
71
+ split_batches: false
72
+ use_seedable_sampler: true
73
+ adafactor:
74
+ value: false
75
+ adam_beta1:
76
+ value: 0.9
77
+ adam_beta2:
78
+ value: 0.999
79
+ adam_epsilon:
80
+ value: 1e-08
81
+ add_cross_attention:
82
+ value: false
83
+ architectures:
84
+ value:
85
+ - BertForMaskedLM
86
+ attention_probs_dropout_prob:
87
+ value: 0.05
88
+ auto_find_batch_size:
89
+ value: false
90
+ average_tokens_across_devices:
91
+ value: false
92
+ bad_words_ids:
93
+ value: null
94
+ batch_eval_metrics:
95
+ value: false
96
+ begin_suppress_tokens:
97
+ value: null
98
+ bf16:
99
+ value: false
100
+ bf16_full_eval:
101
+ value: false
102
+ bos_token_id:
103
+ value: null
104
+ chunk_size_feed_forward:
105
+ value: 0
106
+ classifier_dropout:
107
+ value: null
108
+ cross_attention_hidden_size:
109
+ value: null
110
+ data_seed:
111
+ value: null
112
+ dataloader_drop_last:
113
+ value: false
114
+ dataloader_num_workers:
115
+ value: 0
116
+ dataloader_persistent_workers:
117
+ value: false
118
+ dataloader_pin_memory:
119
+ value: true
120
+ dataloader_prefetch_factor:
121
+ value: null
122
+ ddp_backend:
123
+ value: null
124
+ ddp_broadcast_buffers:
125
+ value: null
126
+ ddp_bucket_cap_mb:
127
+ value: null
128
+ ddp_find_unused_parameters:
129
+ value: null
130
+ ddp_timeout:
131
+ value: 1800
132
+ debug:
133
+ value: []
134
+ decoder_start_token_id:
135
+ value: null
136
+ deepspeed:
137
+ value: null
138
+ directionality:
139
+ value: bidi
140
+ disable_tqdm:
141
+ value: false
142
+ dispatch_batches:
143
+ value: null
144
+ diversity_penalty:
145
+ value: 0
146
+ do_eval:
147
+ value: true
148
+ do_predict:
149
+ value: false
150
+ do_sample:
151
+ value: false
152
+ do_train:
153
+ value: false
154
+ early_stopping:
155
+ value: false
156
+ encoder_no_repeat_ngram_size:
157
+ value: 0
158
+ eos_token_id:
159
+ value: null
160
+ eval_accumulation_steps:
161
+ value: null
162
+ eval_delay:
163
+ value: 0
164
+ eval_do_concat_batches:
165
+ value: true
166
+ eval_on_start:
167
+ value: false
168
+ eval_steps:
169
+ value: 24
170
+ eval_strategy:
171
+ value: epoch
172
+ eval_use_gather_object:
173
+ value: false
174
+ evaluation_strategy:
175
+ value: null
176
+ exponential_decay_length_penalty:
177
+ value: null
178
+ finetuning_task:
179
+ value: null
180
+ forced_bos_token_id:
181
+ value: null
182
+ forced_eos_token_id:
183
+ value: null
184
+ fp16:
185
+ value: false
186
+ fp16_backend:
187
+ value: auto
188
+ fp16_full_eval:
189
+ value: false
190
+ fp16_opt_level:
191
+ value: O1
192
+ fsdp:
193
+ value: []
194
+ fsdp_config:
195
+ value:
196
+ min_num_params: 0
197
+ xla: false
198
+ xla_fsdp_grad_ckpt: false
199
+ xla_fsdp_v2: false
200
+ fsdp_min_num_params:
201
+ value: 0
202
+ fsdp_transformer_layer_cls_to_wrap:
203
+ value: null
204
+ full_determinism:
205
+ value: false
206
+ gradient_accumulation_steps:
207
+ value: 1
208
+ gradient_checkpointing:
209
+ value: false
210
+ gradient_checkpointing_kwargs:
211
+ value: null
212
+ greater_is_better:
213
+ value: true
214
+ group_by_length:
215
+ value: false
216
+ half_precision_backend:
217
+ value: auto
218
+ hidden_act:
219
+ value: gelu
220
+ hidden_dropout_prob:
221
+ value: 0.05
222
+ hidden_size:
223
+ value: 1024
224
+ hub_always_push:
225
+ value: false
226
+ hub_model_id:
227
+ value: null
228
+ hub_private_repo:
229
+ value: null
230
+ hub_strategy:
231
+ value: every_save
232
+ hub_token:
233
+ value: <HUB_TOKEN>
234
+ id2label:
235
+ value:
236
+ "0": "0"
237
+ "1": "1"
238
+ ignore_data_skip:
239
+ value: false
240
+ include_for_metrics:
241
+ value: []
242
+ include_inputs_for_metrics:
243
+ value: false
244
+ include_num_input_tokens_seen:
245
+ value: false
246
+ include_tokens_per_second:
247
+ value: false
248
+ initializer_range:
249
+ value: 0.02
250
+ intermediate_size:
251
+ value: 4096
252
+ is_decoder:
253
+ value: false
254
+ is_encoder_decoder:
255
+ value: false
256
+ jit_mode_eval:
257
+ value: false
258
+ label_names:
259
+ value: null
260
+ label_smoothing_factor:
261
+ value: 0
262
+ label2id:
263
+ value:
264
+ "0": 0
265
+ "1": 1
266
+ layer_norm_eps:
267
+ value: 1e-12
268
+ learning_rate:
269
+ value: 1e-05
270
+ length_column_name:
271
+ value: length
272
+ length_penalty:
273
+ value: 1
274
+ load_best_model_at_end:
275
+ value: true
276
+ local_rank:
277
+ value: 0
278
+ log_level:
279
+ value: passive
280
+ log_level_replica:
281
+ value: warning
282
+ log_on_each_node:
283
+ value: true
284
+ logging_dir:
285
+ value: ./runs/Dec11_04-29-04_fd55a770be24
286
+ logging_first_step:
287
+ value: false
288
+ logging_nan_inf_filter:
289
+ value: true
290
+ logging_steps:
291
+ value: 10
292
+ logging_strategy:
293
+ value: steps
294
+ lr_scheduler_type:
295
+ value: linear
296
+ max_grad_norm:
297
+ value: 1
298
+ max_length:
299
+ value: 20
300
+ max_position_embeddings:
301
+ value: 512
302
+ max_steps:
303
+ value: -1
304
+ metric_for_best_model:
305
+ value: f1
306
+ min_length:
307
+ value: 0
308
+ model/num_parameters:
309
+ value: 334398466
310
+ model_type:
311
+ value: bert
312
+ mp_parameters:
313
+ value: ""
314
+ neftune_noise_alpha:
315
+ value: null
316
+ no_cuda:
317
+ value: false
318
+ no_repeat_ngram_size:
319
+ value: 0
320
+ num_attention_heads:
321
+ value: 16
322
+ num_beam_groups:
323
+ value: 1
324
+ num_beams:
325
+ value: 1
326
+ num_hidden_layers:
327
+ value: 24
328
+ num_return_sequences:
329
+ value: 1
330
+ num_train_epochs:
331
+ value: 15
332
+ optim:
333
+ value: adamw_torch
334
+ optim_args:
335
+ value: null
336
+ optim_target_modules:
337
+ value: null
338
+ output_attentions:
339
+ value: false
340
+ output_dir:
341
+ value: .
342
+ output_hidden_states:
343
+ value: false
344
+ output_past:
345
+ value: true
346
+ output_scores:
347
+ value: false
348
+ overwrite_output_dir:
349
+ value: false
350
+ pad_token_id:
351
+ value: 0
352
+ past_index:
353
+ value: -1
354
+ per_device_eval_batch_size:
355
+ value: 128
356
+ per_device_train_batch_size:
357
+ value: 128
358
+ per_gpu_eval_batch_size:
359
+ value: null
360
+ per_gpu_train_batch_size:
361
+ value: null
362
+ pooler_fc_size:
363
+ value: 768
364
+ pooler_num_attention_heads:
365
+ value: 12
366
+ pooler_num_fc_layers:
367
+ value: 3
368
+ pooler_size_per_head:
369
+ value: 128
370
+ pooler_type:
371
+ value: first_token_transform
372
+ position_embedding_type:
373
+ value: absolute
374
+ prediction_loss_only:
375
+ value: false
376
+ prefix:
377
+ value: null
378
+ problem_type:
379
+ value: null
380
+ push_to_hub:
381
+ value: false
382
+ push_to_hub_model_id:
383
+ value: null
384
+ push_to_hub_organization:
385
+ value: null
386
+ push_to_hub_token:
387
+ value: <PUSH_TO_HUB_TOKEN>
388
+ ray_scope:
389
+ value: last
390
+ remove_invalid_values:
391
+ value: false
392
+ remove_unused_columns:
393
+ value: true
394
+ repetition_penalty:
395
+ value: 1
396
+ report_to:
397
+ value:
398
+ - wandb
399
+ restore_callback_states_from_checkpoint:
400
+ value: false
401
+ resume_from_checkpoint:
402
+ value: null
403
+ return_dict:
404
+ value: true
405
+ return_dict_in_generate:
406
+ value: false
407
+ run_name:
408
+ value: m2-congretimbau_tunado
409
+ save_on_each_node:
410
+ value: false
411
+ save_only_model:
412
+ value: false
413
+ save_safetensors:
414
+ value: true
415
+ save_steps:
416
+ value: 48
417
+ save_strategy:
418
+ value: epoch
419
+ save_total_limit:
420
+ value: 1
421
+ seed:
422
+ value: 5151
423
+ sep_token_id:
424
+ value: null
425
+ skip_memory_metrics:
426
+ value: true
427
+ split_batches:
428
+ value: null
429
+ suppress_tokens:
430
+ value: null
431
+ task_specific_params:
432
+ value: null
433
+ temperature:
434
+ value: 1
435
+ tf_legacy_loss:
436
+ value: false
437
+ tf32:
438
+ value: null
439
+ tie_encoder_decoder:
440
+ value: false
441
+ tie_word_embeddings:
442
+ value: true
443
+ tokenizer_class:
444
+ value: null
445
+ top_k:
446
+ value: 50
447
+ top_p:
448
+ value: 1
449
+ torch_compile:
450
+ value: false
451
+ torch_compile_backend:
452
+ value: null
453
+ torch_compile_mode:
454
+ value: null
455
+ torch_dtype:
456
+ value: float32
457
+ torch_empty_cache_steps:
458
+ value: null
459
+ torchdynamo:
460
+ value: null
461
+ torchscript:
462
+ value: false
463
+ tpu_metrics_debug:
464
+ value: false
465
+ tpu_num_cores:
466
+ value: null
467
+ transformers_version:
468
+ value: 4.47.0
469
+ type_vocab_size:
470
+ value: 2
471
+ typical_p:
472
+ value: 1
473
+ use_bfloat16:
474
+ value: false
475
+ use_cache:
476
+ value: true
477
+ use_cpu:
478
+ value: false
479
+ use_ipex:
480
+ value: false
481
+ use_legacy_prediction_loop:
482
+ value: false
483
+ use_liger_kernel:
484
+ value: false
485
+ use_mps_device:
486
+ value: false
487
+ vocab_size:
488
+ value: 29794
489
+ warmup_ratio:
490
+ value: 0
491
+ warmup_steps:
492
+ value: 200
493
+ weight_decay:
494
+ value: 0.02
wandb/run-20241211_042903-vt1hbdpt/files/output.log ADDED
File without changes
wandb/run-20241211_042903-vt1hbdpt/files/requirements.txt ADDED
@@ -0,0 +1,578 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ multiprocess==0.70.16
2
+ accelerate==1.2.0
3
+ fsspec==2024.9.0
4
+ wandb==0.19.0
5
+ transformers==4.47.0
6
+ evaluate==0.4.3
7
+ dill==0.3.8
8
+ tokenizers==0.21.0
9
+ datasets==3.2.0
10
+ xxhash==3.5.0
11
+ google-colab==1.0.0
12
+ termcolor==2.5.0
13
+ numexpr==2.10.2
14
+ duckdb==1.1.3
15
+ slicer==0.0.8
16
+ pandas-stubs==2.2.2.240909
17
+ yarl==1.18.3
18
+ jupyter-client==6.1.12
19
+ umf==0.9.1
20
+ aiohappyeyeballs==2.4.4
21
+ tensorflow-io-gcs-filesystem==0.37.1
22
+ nibabel==5.3.2
23
+ google-api-python-client==2.151.0
24
+ gensim==4.3.3
25
+ plotnine==0.14.3
26
+ stanio==0.5.1
27
+ pyogrio==0.10.0
28
+ nvidia-curand-cu12==10.3.7.77
29
+ Bottleneck==1.4.2
30
+ openpyxl==3.1.5
31
+ audioread==3.0.1
32
+ libcudf-cu12==24.10.1
33
+ networkx==3.4.2
34
+ cupy-cuda12x==12.2.0
35
+ nx-cugraph-cu12==24.10.0
36
+ aiosignal==1.3.1
37
+ StrEnum==0.4.15
38
+ jsonpickle==4.0.0
39
+ google-cloud-firestore==2.19.0
40
+ scipy==1.13.1
41
+ colour==0.1.5
42
+ pycparser==2.22
43
+ cmake==3.30.5
44
+ fastdownload==0.0.7
45
+ google-cloud-functions==1.18.1
46
+ imagesize==1.4.1
47
+ imageio==2.36.1
48
+ frozendict==2.4.6
49
+ ndindex==1.9.2
50
+ html5lib==1.1
51
+ python-louvain==0.16
52
+ Cython==3.0.11
53
+ fastai==2.7.18
54
+ cuda-python==12.2.1
55
+ parsy==2.1
56
+ smmap==5.0.1
57
+ weasel==0.4.1
58
+ pyasn1==0.6.1
59
+ matplotlib==3.8.0
60
+ immutabledict==4.2.1
61
+ dopamine_rl==4.0.9
62
+ uc-micro-py==1.0.3
63
+ websocket-client==1.8.0
64
+ anyio==3.7.1
65
+ msgpack==1.1.0
66
+ multitasking==0.0.11
67
+ albucore==0.0.19
68
+ scikit-image==0.24.0
69
+ setproctitle==1.3.4
70
+ marisa-trie==1.2.1
71
+ nvidia-cuda-nvcc-cu12==12.6.85
72
+ sphinxcontrib-qthelp==2.0.0
73
+ tables==3.10.1
74
+ kaggle==1.6.17
75
+ google-cloud-core==2.4.1
76
+ soupsieve==2.6
77
+ opencv-contrib-python==4.10.0.84
78
+ python-box==7.2.0
79
+ fonttools==4.55.1
80
+ Markdown==3.7
81
+ googleapis-common-protos==1.66.0
82
+ tensorflow-metadata==1.13.1
83
+ google-auth-httplib2==0.2.0
84
+ autograd==1.7.0
85
+ pydantic_core==2.27.1
86
+ pexpect==4.9.0
87
+ tweepy==4.14.0
88
+ pyshp==2.3.1
89
+ sklearn-pandas==2.2.0
90
+ toml==0.10.2
91
+ pydotplus==2.0.2
92
+ click==8.1.7
93
+ optree==0.13.1
94
+ catalogue==2.0.10
95
+ google-api-core==2.19.2
96
+ en-core-web-sm==3.7.1
97
+ tensorflow-datasets==4.9.7
98
+ pynvjitlink-cu12==0.4.0
99
+ pytest==8.3.4
100
+ portpicker==1.5.2
101
+ dm-tree==0.1.8
102
+ h5py==3.12.1
103
+ pandas-gbq==0.24.0
104
+ qdldl==0.1.7.post4
105
+ partd==1.4.2
106
+ wrapt==1.17.0
107
+ blis==0.7.11
108
+ textblob==0.17.1
109
+ jupyterlab_widgets==3.0.13
110
+ filelock==3.16.1
111
+ oauth2client==4.1.3
112
+ iniconfig==2.0.0
113
+ notebook==6.5.5
114
+ pydot==3.0.3
115
+ tomli==2.2.1
116
+ py4j==0.10.9.7
117
+ langchain-text-splitters==0.3.2
118
+ typer==0.15.0
119
+ contourpy==1.3.1
120
+ aiohttp==3.11.9
121
+ langchain==0.3.9
122
+ datascience==0.17.6
123
+ etuples==0.3.9
124
+ six==1.16.0
125
+ tornado==6.3.3
126
+ torchaudio==2.5.1+cu121
127
+ httpcore==1.0.7
128
+ importlib_metadata==8.5.0
129
+ holoviews==1.20.0
130
+ google-ai-generativelanguage==0.6.10
131
+ osqp==0.6.7.post3
132
+ pymc==5.18.2
133
+ google-pasta==0.2.0
134
+ py-cpuinfo==9.0.0
135
+ colorcet==3.1.0
136
+ blosc2==2.7.1
137
+ nvidia-cuda-cupti-cu12==12.6.80
138
+ colorlover==0.3.0
139
+ pymystem3==0.2.0
140
+ MarkupSafe==3.0.2
141
+ astunparse==1.6.3
142
+ google-cloud-bigquery==3.25.0
143
+ opencv-python==4.10.0.84
144
+ Flask==3.0.3
145
+ fastjsonschema==2.21.1
146
+ pylibcugraph-cu12==24.10.0
147
+ google-cloud-datastore==2.20.1
148
+ jaxlib==0.4.33
149
+ rich==13.9.4
150
+ widgetsnbextension==3.6.10
151
+ vega-datasets==0.9.0
152
+ natsort==8.4.0
153
+ blinker==1.9.0
154
+ statsmodels==0.14.4
155
+ sniffio==1.3.1
156
+ cymem==2.0.10
157
+ threadpoolctl==3.5.0
158
+ proto-plus==1.25.0
159
+ httpimport==1.4.0
160
+ matplotlib-inline==0.1.7
161
+ ratelim==0.1.6
162
+ python-utils==3.9.1
163
+ prompt_toolkit==3.0.48
164
+ gspread==6.0.2
165
+ google-cloud-storage==2.8.0
166
+ PyDrive==1.3.1
167
+ tensorflow-probability==0.24.0
168
+ xarray==2024.10.0
169
+ eerepr==0.0.4
170
+ murmurhash==1.0.11
171
+ pylibcudf-cu12==24.10.1
172
+ music21==9.3.0
173
+ geographiclib==2.0
174
+ smart-open==7.0.5
175
+ pygit2==1.16.0
176
+ more-itertools==10.5.0
177
+ cycler==0.12.1
178
+ idna==3.10
179
+ h11==0.14.0
180
+ ipython-genutils==0.2.0
181
+ referencing==0.35.1
182
+ GitPython==3.1.43
183
+ sentencepiece==0.2.0
184
+ CacheControl==0.14.1
185
+ tf_keras==2.17.0
186
+ annotated-types==0.7.0
187
+ ecos==2.0.14
188
+ geopandas==1.0.1
189
+ jieba==0.42.1
190
+ shapely==2.0.6
191
+ ipyparallel==8.8.0
192
+ pyOpenSSL==24.2.1
193
+ pyarrow==17.0.0
194
+ opencv-python-headless==4.10.0.84
195
+ openai==1.54.5
196
+ sentry-sdk==2.19.0
197
+ pyarrow-hotfix==0.6
198
+ gitdb==4.0.11
199
+ prophet==1.1.6
200
+ ipython-sql==0.5.0
201
+ tf-slim==1.1.0
202
+ PySocks==1.7.1
203
+ imbalanced-learn==0.12.4
204
+ mlxtend==0.23.3
205
+ mkl==2025.0.1
206
+ itsdangerous==2.2.0
207
+ gast==0.6.0
208
+ torchsummary==1.5.1
209
+ urllib3==2.2.3
210
+ earthengine-api==1.2.0
211
+ jupyter-server==1.24.0
212
+ webencodings==0.5.1
213
+ seaborn==0.13.2
214
+ SQLAlchemy==2.0.36
215
+ pyerfa==2.0.1.5
216
+ xarray-einstats==0.8.0
217
+ jsonschema==4.23.0
218
+ parso==0.8.4
219
+ tensorflow-hub==0.16.1
220
+ ipyfilechooser==0.6.0
221
+ packaging==24.2
222
+ param==2.1.1
223
+ et_xmlfile==2.0.0
224
+ kagglehub==0.3.4
225
+ yfinance==0.2.50
226
+ spacy==3.7.5
227
+ imgaug==0.4.0
228
+ sqlparse==0.5.2
229
+ certifi==2024.8.30
230
+ plotly==5.24.1
231
+ pandas-datareader==0.10.0
232
+ charset-normalizer==3.4.0
233
+ cloudpathlib==0.20.0
234
+ decorator==4.4.2
235
+ prometheus_client==0.21.1
236
+ google-auth==2.27.0
237
+ pyzmq==24.0.1
238
+ tzdata==2024.2
239
+ scs==3.2.7
240
+ types-pytz==2024.2.0.20241003
241
+ db-dtypes==1.3.1
242
+ chex==0.1.87
243
+ spacy-loggers==1.0.5
244
+ peewee==3.17.8
245
+ nvidia-cuda-runtime-cu12==12.6.77
246
+ ibis-framework==9.2.0
247
+ wasabi==1.1.3
248
+ python-dateutil==2.8.2
249
+ google-generativeai==0.8.3
250
+ google-cloud-aiplatform==1.73.0
251
+ jupyterlab_pygments==0.3.0
252
+ google-cloud-translate==3.17.0
253
+ exceptiongroup==1.2.2
254
+ intel-openmp==2025.0.3
255
+ nvidia-cufft-cu12==11.3.0.4
256
+ timm==1.0.12
257
+ python-slugify==8.0.4
258
+ cryptography==43.0.3
259
+ multipledispatch==1.0.0
260
+ google-crc32c==1.6.0
261
+ rsa==4.9
262
+ cffi==1.17.1
263
+ holidays==0.62
264
+ tinycss2==1.4.0
265
+ google-cloud-iam==2.16.1
266
+ google-cloud-resource-manager==1.13.1
267
+ typeguard==4.4.1
268
+ xlrd==2.0.1
269
+ google==2.0.3
270
+ pandocfilters==1.5.1
271
+ promise==2.3
272
+ progressbar2==4.5.0
273
+ wheel==0.45.1
274
+ google-resumable-media==2.7.2
275
+ einops==0.8.0
276
+ defusedxml==0.7.1
277
+ llvmlite==0.43.0
278
+ attrs==24.2.0
279
+ preshed==3.0.9
280
+ geocoder==1.38.1
281
+ firebase-admin==6.5.0
282
+ ipython==7.34.0
283
+ cvxopt==1.3.2
284
+ greenlet==3.1.1
285
+ confection==0.1.5
286
+ keras==3.5.0
287
+ pandas==2.2.2
288
+ graphviz==0.20.3
289
+ tenacity==9.0.0
290
+ future==1.0.0
291
+ inflect==7.4.0
292
+ langsmith==0.1.147
293
+ babel==2.16.0
294
+ nvidia-cusparse-cu12==12.5.4.2
295
+ importlib_resources==6.4.5
296
+ jiter==0.8.0
297
+ sphinxcontrib-htmlhelp==2.1.0
298
+ moviepy==1.0.3
299
+ typing_extensions==4.12.2
300
+ tcmlib==1.2.0
301
+ snowballstemmer==2.2.0
302
+ scikit-learn==1.5.2
303
+ jsonpatch==1.33
304
+ notebook_shim==0.2.4
305
+ mizani==0.13.0
306
+ simple-parsing==0.1.6
307
+ humanize==4.11.0
308
+ hyperopt==0.2.7
309
+ frozenlist==1.5.0
310
+ uritemplate==4.1.1
311
+ pydata-google-auth==1.9.0
312
+ pyperclip==1.9.0
313
+ lazy_loader==0.4
314
+ bigframes==1.27.0
315
+ numba==0.60.0
316
+ pytensor==2.26.4
317
+ geemap==0.35.1
318
+ polars==1.9.0
319
+ orjson==3.10.12
320
+ fastrlock==0.8.2
321
+ mistune==3.0.2
322
+ sphinxcontrib-applehelp==2.0.0
323
+ mpmath==1.3.0
324
+ mdurl==0.1.2
325
+ async-timeout==4.0.3
326
+ jax==0.4.33
327
+ panel==1.5.4
328
+ patsy==1.0.1
329
+ oauthlib==3.2.2
330
+ librosa==0.10.2.post1
331
+ imutils==0.5.4
332
+ xyzservices==2024.9.0
333
+ Pygments==2.18.0
334
+ sphinxcontrib-jsmath==1.0.1
335
+ pyasn1_modules==0.4.1
336
+ pyproj==3.7.0
337
+ Pyomo==6.8.2
338
+ nvidia-cublas-cu12==12.6.4.1
339
+ numpy==1.26.4
340
+ google-cloud-bigquery-storage==2.27.0
341
+ soxr==0.5.0.post1
342
+ bleach==6.2.0
343
+ sympy==1.13.1
344
+ dask==2024.10.0
345
+ miniKanren==1.0.3
346
+ pathlib==1.0.1
347
+ torch==2.5.1+cu121
348
+ distro==1.9.0
349
+ gym==0.25.2
350
+ lxml==5.3.0
351
+ folium==0.18.0
352
+ altair==4.2.2
353
+ tifffile==2024.9.20
354
+ diffusers==0.31.0
355
+ requests-oauthlib==1.3.1
356
+ google-cloud-language==2.15.1
357
+ ptyprocess==0.7.0
358
+ tensorboard-data-server==0.7.2
359
+ locket==1.0.0
360
+ intel-cmplr-lib-ur==2025.0.3
361
+ lightgbm==4.5.0
362
+ flax==0.8.5
363
+ mdit-py-plugins==0.4.2
364
+ pygame==2.6.1
365
+ pytz==2024.2
366
+ pyparsing==3.2.0
367
+ alabaster==1.0.0
368
+ jupyter_core==5.7.2
369
+ cloudpickle==3.1.0
370
+ logical-unification==0.4.6
371
+ traittypes==0.2.1
372
+ terminado==0.18.1
373
+ beautifulsoup4==4.12.3
374
+ google-cloud-bigquery-connection==1.16.1
375
+ scooby==0.10.0
376
+ google-auth-oauthlib==1.2.1
377
+ protobuf==4.25.5
378
+ astropy==6.1.7
379
+ GDAL==3.6.4
380
+ nbformat==5.10.4
381
+ safetensors==0.4.5
382
+ tzlocal==5.2
383
+ chardet==5.2.0
384
+ psutil==5.9.5
385
+ jsonpointer==3.0.0
386
+ psycopg2==2.9.10
387
+ cachetools==5.5.0
388
+ pyspark==3.5.3
389
+ google-cloud-pubsub==2.27.1
390
+ pillow==11.0.0
391
+ pydantic==2.10.3
392
+ langcodes==3.5.0
393
+ jsonschema-specifications==2024.10.1
394
+ markdown-it-py==3.0.0
395
+ nltk==3.9.1
396
+ nbconvert==7.16.4
397
+ pooch==1.8.2
398
+ huggingface-hub==0.26.3
399
+ tbb==2022.0.0
400
+ ipywidgets==7.7.1
401
+ matplotlib-venn==1.1.1
402
+ rmm-cu12==24.10.0
403
+ sphinxcontrib-serializinghtml==2.0.0
404
+ branca==0.8.0
405
+ gin-config==0.5.0
406
+ bqplot==0.12.43
407
+ opentelemetry-sdk==1.28.2
408
+ community==1.0.0b1
409
+ soundfile==0.12.1
410
+ shap==0.46.0
411
+ joblib==1.4.2
412
+ entrypoints==0.4
413
+ docstring_parser==0.16
414
+ multidict==6.1.0
415
+ grpcio-status==1.62.3
416
+ wcwidth==0.2.13
417
+ geopy==2.4.1
418
+ prettytable==3.12.0
419
+ propcache==0.2.1
420
+ ml-dtypes==0.4.1
421
+ atpublic==4.1.0
422
+ stringzilla==3.11.0
423
+ libclang==18.1.1
424
+ arviz==0.20.0
425
+ requests==2.32.3
426
+ fastprogress==1.0.3
427
+ dlib==19.24.2
428
+ platformdirs==4.3.6
429
+ tabulate==0.9.0
430
+ cufflinks==0.17.3
431
+ spacy-legacy==3.0.12
432
+ grpcio==1.68.1
433
+ cvxpy==1.5.4
434
+ flatbuffers==24.3.25
435
+ pluggy==1.5.0
436
+ regex==2024.9.11
437
+ clarabel==0.9.0
438
+ httplib2==0.22.0
439
+ language_data==1.3.0
440
+ backcall==0.2.0
441
+ nbclient==0.10.1
442
+ ipyleaflet==0.19.2
443
+ PyJWT==2.10.1
444
+ toolz==0.12.1
445
+ opt_einsum==3.4.0
446
+ nvtx==0.2.10
447
+ nvidia-nccl-cu12==2.23.4
448
+ debugpy==1.8.0
449
+ sphinxcontrib-devhelp==2.0.0
450
+ jax-cuda12-pjrt==0.4.33
451
+ missingno==0.5.2
452
+ proglog==0.1.10
453
+ nest-asyncio==1.6.0
454
+ nvidia-cusolver-cu12==11.7.1.2
455
+ gspread-dataframe==3.3.1
456
+ bigquery-magics==0.4.0
457
+ jellyfish==1.1.2
458
+ text-unidecode==1.3
459
+ Deprecated==1.2.15
460
+ srsly==2.4.8
461
+ ipytree==0.2.2
462
+ gcsfs==2024.10.0
463
+ ipyevents==2.0.2
464
+ docker-pycreds==0.4.0
465
+ glob2==0.7
466
+ wordcloud==1.9.4
467
+ traitlets==5.7.1
468
+ nbclassic==1.1.0
469
+ nvidia-cudnn-cu12==9.6.0.74
470
+ easydict==1.13
471
+ httpx==0.28.0
472
+ Jinja2==3.1.4
473
+ argon2-cffi==23.1.0
474
+ linkify-it-py==2.0.3
475
+ optax==0.2.4
476
+ cons==0.4.6
477
+ requests-toolbelt==1.0.0
478
+ editdistance==0.8.1
479
+ jupyter-leaflet==0.19.2
480
+ webcolors==24.11.1
481
+ jax-cuda12-plugin==0.4.33
482
+ albumentations==1.4.20
483
+ PyOpenGL==3.1.7
484
+ tqdm==4.66.6
485
+ googledrivedownloader==0.4
486
+ peft==0.13.2
487
+ absl-py==1.4.0
488
+ eval_type_backport==0.2.0
489
+ opentelemetry-api==1.28.2
490
+ Send2Trash==1.8.3
491
+ gdown==5.2.0
492
+ google-cloud-bigtable==2.27.0
493
+ grpc-google-iam-v1==0.13.1
494
+ rpds-py==0.22.3
495
+ xgboost==2.1.3
496
+ pyviz_comms==3.0.3
497
+ ipykernel==5.5.6
498
+ gym-notices==0.0.8
499
+ opentelemetry-semantic-conventions==0.49b2
500
+ fastcore==1.7.22
501
+ torchvision==0.20.1+cu121
502
+ Sphinx==8.1.3
503
+ ply==3.11
504
+ tensorboard==2.17.1
505
+ namex==0.0.8
506
+ astropy-iers-data==0.2024.12.2.0.35.34
507
+ rpy2==3.4.2
508
+ nvidia-nvjitlink-cu12==12.6.85
509
+ array_record==0.5.1
510
+ PyYAML==6.0.2
511
+ tensorflow==2.17.1
512
+ yellowbrick==1.5
513
+ pylibraft-cu12==24.10.0
514
+ orbax-checkpoint==0.6.4
515
+ Werkzeug==3.1.3
516
+ jupyter-console==6.1.0
517
+ thinc==8.2.5
518
+ etils==1.11.0
519
+ langchain-core==0.3.21
520
+ cmdstanpy==1.2.4
521
+ h5netcdf==1.4.1
522
+ cudf-cu12==24.10.1
523
+ zipp==3.21.0
524
+ PyDrive2==1.21.3
525
+ kiwisolver==1.4.7
526
+ pickleshare==0.7.5
527
+ bokeh==3.6.2
528
+ shellingham==1.5.4
529
+ imageio-ffmpeg==0.5.1
530
+ pycocotools==2.0.8
531
+ sqlglot==25.1.0
532
+ tensorstore==0.1.69
533
+ argon2-cffi-bindings==21.2.0
534
+ sentence-transformers==3.2.1
535
+ docutils==0.21.2
536
+ python-apt==0.0.0
537
+ setuptools==75.1.0
538
+ pip==24.1.2
539
+ requirements-parser==0.9.0
540
+ types-setuptools==75.6.0.20241126
541
+ importlib-metadata==4.6.4
542
+ blinker==1.4
543
+ PyJWT==2.3.0
544
+ PyGObject==3.42.1
545
+ cryptography==3.4.8
546
+ pyparsing==2.4.7
547
+ wadllib==1.3.6
548
+ more-itertools==8.10.0
549
+ zipp==1.0.0
550
+ distro==1.7.0
551
+ six==1.16.0
552
+ python-apt==2.4.0+ubuntu4
553
+ jeepney==0.7.1
554
+ SecretStorage==3.3.1
555
+ oauthlib==3.2.0
556
+ lazr.restfulclient==0.14.4
557
+ httplib2==0.20.2
558
+ launchpadlib==1.10.16
559
+ keyring==23.5.0
560
+ lazr.uri==1.0.6
561
+ dbus-python==1.2.18
562
+ inflect==7.3.1
563
+ wheel==0.43.0
564
+ packaging==24.1
565
+ importlib_metadata==8.0.0
566
+ more-itertools==10.3.0
567
+ platformdirs==4.2.2
568
+ backports.tarfile==1.2.0
569
+ typing_extensions==4.12.2
570
+ jaraco.functools==4.0.1
571
+ jaraco.collections==5.1.0
572
+ tomli==2.0.1
573
+ importlib_resources==6.4.0
574
+ jaraco.text==3.12.1
575
+ jaraco.context==5.3.0
576
+ typeguard==4.3.0
577
+ autocommand==2.2.2
578
+ zipp==3.19.2
wandb/run-20241211_042903-vt1hbdpt/files/wandb-metadata.json ADDED
@@ -0,0 +1,38 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "os": "Linux-6.1.85+-x86_64-with-glibc2.35",
3
+ "python": "CPython 3.10.12",
4
+ "startedAt": "2024-12-11T04:29:03.449972Z",
5
+ "program": "m2-congretimbau-focal_loss-sempartido.ipynb",
6
+ "email": "[email protected]",
7
+ "root": "/content",
8
+ "host": "fd55a770be24",
9
+ "username": "root",
10
+ "executable": "/usr/bin/python3",
11
+ "colab": "https://colab.research.google.com/notebook#fileId=1a_b6x6zKkekdeqrCzP83ovWk7MhdV3DO",
12
+ "cpu_count": 6,
13
+ "cpu_count_logical": 12,
14
+ "gpu": "NVIDIA A100-SXM4-40GB",
15
+ "gpu_count": 1,
16
+ "disk": {
17
+ "/": {
18
+ "total": "120942624768",
19
+ "used": "36614742016"
20
+ }
21
+ },
22
+ "memory": {
23
+ "total": "89632796672"
24
+ },
25
+ "cpu": {
26
+ "count": 6,
27
+ "countLogical": 12
28
+ },
29
+ "gpu_nvidia": [
30
+ {
31
+ "name": "NVIDIA A100-SXM4-40GB",
32
+ "memoryTotal": "42949672960",
33
+ "cudaCores": 6912,
34
+ "architecture": "Ampere"
35
+ }
36
+ ],
37
+ "cudaVersion": "12.2"
38
+ }
wandb/run-20241211_042903-vt1hbdpt/files/wandb-summary.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"_wandb":{"runtime":4}}
wandb/run-20241211_042903-vt1hbdpt/logs/debug-core.log ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {"time":"2024-12-11T04:29:03.186536795Z","level":"INFO","msg":"started logging, with flags","port-filename":"/tmp/tmphfycrb2n/port-1705.txt","pid":1705,"debug":false,"disable-analytics":false}
2
+ {"time":"2024-12-11T04:29:03.186572308Z","level":"INFO","msg":"FeatureState","shutdownOnParentExitEnabled":false}
3
+ {"time":"2024-12-11T04:29:03.191093789Z","level":"INFO","msg":"Will exit if parent process dies.","ppid":1705}
4
+ {"time":"2024-12-11T04:29:03.19108347Z","level":"INFO","msg":"server is running","addr":{"IP":"127.0.0.1","Port":33261,"Zone":""}}
5
+ {"time":"2024-12-11T04:29:03.375152489Z","level":"INFO","msg":"connection: ManageConnectionData: new connection created","id":"127.0.0.1:34992"}
6
+ {"time":"2024-12-11T04:29:03.451090957Z","level":"INFO","msg":"handleInformInit: received","streamId":"vt1hbdpt","id":"127.0.0.1:34992"}
7
+ {"time":"2024-12-11T04:29:03.553761261Z","level":"INFO","msg":"handleInformInit: stream started","streamId":"vt1hbdpt","id":"127.0.0.1:34992"}
8
+ {"time":"2024-12-11T04:30:04.523626016Z","level":"INFO","msg":"handleInformTeardown: server teardown initiated","id":"127.0.0.1:34992"}
9
+ {"time":"2024-12-11T04:30:04.523720479Z","level":"INFO","msg":"connection: Close: initiating connection closure","id":"127.0.0.1:34992"}
10
+ {"time":"2024-12-11T04:30:04.523896711Z","level":"INFO","msg":"connection: Close: connection successfully closed","id":"127.0.0.1:34992"}
11
+ {"time":"2024-12-11T04:30:04.523761243Z","level":"INFO","msg":"server is shutting down"}
12
+ {"time":"2024-12-11T04:30:04.758520303Z","level":"INFO","msg":"handleInformTeardown: server shutdown complete","id":"127.0.0.1:34992"}
13
+ {"time":"2024-12-11T04:30:04.758557239Z","level":"INFO","msg":"connection: ManageConnectionData: connection closed","id":"127.0.0.1:34992"}
14
+ {"time":"2024-12-11T04:30:04.758576834Z","level":"INFO","msg":"server is closed"}
wandb/run-20241211_042903-vt1hbdpt/logs/debug-internal.log ADDED
@@ -0,0 +1,25 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {"time":"2024-12-11T04:29:03.451234822Z","level":"INFO","msg":"using version","core version":"0.19.0"}
2
+ {"time":"2024-12-11T04:29:03.451254718Z","level":"INFO","msg":"created symlink","path":"/content/wandb/run-20241211_042903-vt1hbdpt/logs/debug-core.log"}
3
+ {"time":"2024-12-11T04:29:03.553721829Z","level":"INFO","msg":"created new stream","id":"vt1hbdpt"}
4
+ {"time":"2024-12-11T04:29:03.55375477Z","level":"INFO","msg":"stream: started","id":"vt1hbdpt"}
5
+ {"time":"2024-12-11T04:29:03.553784092Z","level":"INFO","msg":"writer: Do: started","stream_id":"vt1hbdpt"}
6
+ {"time":"2024-12-11T04:29:03.553816218Z","level":"INFO","msg":"sender: started","stream_id":"vt1hbdpt"}
7
+ {"time":"2024-12-11T04:29:03.553854616Z","level":"INFO","msg":"handler: started","stream_id":"vt1hbdpt"}
8
+ {"time":"2024-12-11T04:29:04.254776028Z","level":"INFO","msg":"Starting system monitor"}
9
+ {"time":"2024-12-11T04:29:04.735542801Z","level":"INFO","msg":"Pausing system monitor"}
10
+ {"time":"2024-12-11T04:29:04.735565431Z","level":"INFO","msg":"Resuming system monitor"}
11
+ {"time":"2024-12-11T04:29:04.735570019Z","level":"INFO","msg":"Pausing system monitor"}
12
+ {"time":"2024-12-11T04:29:04.73557386Z","level":"INFO","msg":"Resuming system monitor"}
13
+ {"time":"2024-12-11T04:29:04.735577453Z","level":"INFO","msg":"Pausing system monitor"}
14
+ {"time":"2024-12-11T04:29:04.73558085Z","level":"INFO","msg":"Resuming system monitor"}
15
+ {"time":"2024-12-11T04:29:05.371333804Z","level":"INFO","msg":"Pausing system monitor"}
16
+ {"time":"2024-12-11T04:29:05.412455486Z","level":"INFO","msg":"Resuming system monitor"}
17
+ {"time":"2024-12-11T04:29:07.712940187Z","level":"INFO","msg":"Pausing system monitor"}
18
+ {"time":"2024-12-11T04:30:04.523703541Z","level":"INFO","msg":"stream: closing","id":"vt1hbdpt"}
19
+ {"time":"2024-12-11T04:30:04.523822901Z","level":"INFO","msg":"Stopping system monitor"}
20
+ {"time":"2024-12-11T04:30:04.524610231Z","level":"INFO","msg":"Stopped system monitor"}
21
+ {"time":"2024-12-11T04:30:04.727360156Z","level":"INFO","msg":"fileTransfer: Close: file transfer manager closed"}
22
+ {"time":"2024-12-11T04:30:04.758327151Z","level":"INFO","msg":"handler: closed","stream_id":"vt1hbdpt"}
23
+ {"time":"2024-12-11T04:30:04.758371023Z","level":"INFO","msg":"writer: Close: closed","stream_id":"vt1hbdpt"}
24
+ {"time":"2024-12-11T04:30:04.758408383Z","level":"INFO","msg":"sender: closed","stream_id":"vt1hbdpt"}
25
+ {"time":"2024-12-11T04:30:04.758441234Z","level":"INFO","msg":"stream: closed","id":"vt1hbdpt"}
wandb/run-20241211_042903-vt1hbdpt/logs/debug.log ADDED
@@ -0,0 +1,42 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ 2024-12-11 04:29:03,445 INFO MainThread:1705 [wandb_setup.py:_flush():68] Current SDK version is 0.19.0
2
+ 2024-12-11 04:29:03,446 INFO MainThread:1705 [wandb_setup.py:_flush():68] Configure stats pid to 1705
3
+ 2024-12-11 04:29:03,446 INFO MainThread:1705 [wandb_setup.py:_flush():68] Loading settings from /root/.config/wandb/settings
4
+ 2024-12-11 04:29:03,446 INFO MainThread:1705 [wandb_setup.py:_flush():68] Loading settings from /content/wandb/settings
5
+ 2024-12-11 04:29:03,446 INFO MainThread:1705 [wandb_setup.py:_flush():68] Loading settings from environment variables
6
+ 2024-12-11 04:29:03,446 INFO MainThread:1705 [wandb_init.py:_log_setup():528] Logging user logs to /content/wandb/run-20241211_042903-vt1hbdpt/logs/debug.log
7
+ 2024-12-11 04:29:03,446 INFO MainThread:1705 [wandb_init.py:_log_setup():529] Logging internal logs to /content/wandb/run-20241211_042903-vt1hbdpt/logs/debug-internal.log
8
+ 2024-12-11 04:29:03,446 INFO MainThread:1705 [wandb_init.py:_jupyter_setup():474] configuring jupyter hooks <wandb.sdk.wandb_init._WandbInit object at 0x79799d31dc60>
9
+ 2024-12-11 04:29:03,446 INFO MainThread:1705 [wandb_init.py:init():639] calling init triggers
10
+ 2024-12-11 04:29:03,446 INFO MainThread:1705 [wandb_init.py:init():645] wandb.init called with sweep_config: {}
11
+ config: {}
12
+ 2024-12-11 04:29:03,447 INFO MainThread:1705 [wandb_init.py:init():688] starting backend
13
+ 2024-12-11 04:29:03,447 INFO MainThread:1705 [wandb_init.py:init():692] sending inform_init request
14
+ 2024-12-11 04:29:03,449 INFO MainThread:1705 [backend.py:_multiprocessing_setup():104] multiprocessing start_methods=fork,spawn,forkserver, using: spawn
15
+ 2024-12-11 04:29:03,449 INFO MainThread:1705 [wandb_init.py:init():705] backend started and connected
16
+ 2024-12-11 04:29:03,457 INFO MainThread:1705 [wandb_run.py:_label_probe_notebook():1327] probe notebook
17
+ 2024-12-11 04:29:04,113 INFO MainThread:1705 [wandb_init.py:init():798] updated telemetry
18
+ 2024-12-11 04:29:04,114 INFO MainThread:1705 [wandb_init.py:init():830] communicating run to backend with 90.0 second timeout
19
+ 2024-12-11 04:29:04,250 INFO MainThread:1705 [wandb_init.py:init():882] starting run threads in backend
20
+ 2024-12-11 04:29:04,695 INFO MainThread:1705 [wandb_run.py:_console_start():2443] atexit reg
21
+ 2024-12-11 04:29:04,695 INFO MainThread:1705 [wandb_run.py:_redirect():2293] redirect: wrap_raw
22
+ 2024-12-11 04:29:04,695 INFO MainThread:1705 [wandb_run.py:_redirect():2358] Wrapping output streams.
23
+ 2024-12-11 04:29:04,695 INFO MainThread:1705 [wandb_run.py:_redirect():2383] Redirects installed.
24
+ 2024-12-11 04:29:04,698 INFO MainThread:1705 [wandb_init.py:init():925] run started, returning control to user process
25
+ 2024-12-11 04:29:04,701 INFO MainThread:1705 [jupyter.py:save_ipynb():387] not saving jupyter notebook
26
+ 2024-12-11 04:29:04,701 INFO MainThread:1705 [wandb_init.py:_pause_backend():439] pausing backend
27
+ 2024-12-11 04:29:04,705 INFO MainThread:1705 [wandb_init.py:_resume_backend():444] resuming backend
28
+ 2024-12-11 04:29:04,706 INFO MainThread:1705 [jupyter.py:save_ipynb():387] not saving jupyter notebook
29
+ 2024-12-11 04:29:04,706 INFO MainThread:1705 [wandb_init.py:_pause_backend():439] pausing backend
30
+ 2024-12-11 04:29:04,713 INFO MainThread:1705 [wandb_init.py:_resume_backend():444] resuming backend
31
+ 2024-12-11 04:29:04,714 INFO MainThread:1705 [jupyter.py:save_ipynb():387] not saving jupyter notebook
32
+ 2024-12-11 04:29:04,714 INFO MainThread:1705 [wandb_init.py:_pause_backend():439] pausing backend
33
+ 2024-12-11 04:29:04,719 INFO MainThread:1705 [wandb_init.py:_resume_backend():444] resuming backend
34
+ 2024-12-11 04:29:05,370 INFO MainThread:1705 [jupyter.py:save_ipynb():387] not saving jupyter notebook
35
+ 2024-12-11 04:29:05,371 INFO MainThread:1705 [wandb_init.py:_pause_backend():439] pausing backend
36
+ 2024-12-11 04:29:05,375 INFO MainThread:1705 [wandb_init.py:_resume_backend():444] resuming backend
37
+ 2024-12-11 04:29:05,773 INFO MainThread:1705 [wandb_run.py:_config_callback():1375] config_cb None None {'return_dict': True, 'output_hidden_states': False, 'output_attentions': False, 'torchscript': False, 'torch_dtype': 'float32', 'use_bfloat16': False, 'tf_legacy_loss': False, 'pruned_heads': {}, 'tie_word_embeddings': True, 'chunk_size_feed_forward': 0, 'is_encoder_decoder': False, 'is_decoder': False, 'cross_attention_hidden_size': None, 'add_cross_attention': False, 'tie_encoder_decoder': False, 'max_length': 20, 'min_length': 0, 'do_sample': False, 'early_stopping': False, 'num_beams': 1, 'num_beam_groups': 1, 'diversity_penalty': 0.0, 'temperature': 1.0, 'top_k': 50, 'top_p': 1.0, 'typical_p': 1.0, 'repetition_penalty': 1.0, 'length_penalty': 1.0, 'no_repeat_ngram_size': 0, 'encoder_no_repeat_ngram_size': 0, 'bad_words_ids': None, 'num_return_sequences': 1, 'output_scores': False, 'return_dict_in_generate': False, 'forced_bos_token_id': None, 'forced_eos_token_id': None, 'remove_invalid_values': False, 'exponential_decay_length_penalty': None, 'suppress_tokens': None, 'begin_suppress_tokens': None, 'architectures': ['BertForMaskedLM'], 'finetuning_task': None, 'id2label': {0: '0', 1: '1'}, 'label2id': {'0': 0, '1': 1}, 'tokenizer_class': None, 'prefix': None, 'bos_token_id': None, 'pad_token_id': 0, 'eos_token_id': None, 'sep_token_id': None, 'decoder_start_token_id': None, 'task_specific_params': None, 'problem_type': None, '_name_or_path': 'belisards/congretimbau', '_attn_implementation_autoset': True, 'transformers_version': '4.47.0', 'directionality': 'bidi', 'model_type': 'bert', 'output_past': True, 'pooler_fc_size': 768, 'pooler_num_attention_heads': 12, 'pooler_num_fc_layers': 3, 'pooler_size_per_head': 128, 'pooler_type': 'first_token_transform', 'vocab_size': 29794, 'hidden_size': 1024, 'num_hidden_layers': 24, 'num_attention_heads': 16, 'hidden_act': 'gelu', 'intermediate_size': 4096, 'hidden_dropout_prob': 0.05, 'attention_probs_dropout_prob': 0.05, 'max_position_embeddings': 512, 'type_vocab_size': 2, 'initializer_range': 0.02, 'layer_norm_eps': 1e-12, 'position_embedding_type': 'absolute', 'use_cache': True, 'classifier_dropout': None, 'output_dir': '.', 'overwrite_output_dir': False, 'do_train': False, 'do_eval': True, 'do_predict': False, 'eval_strategy': 'epoch', 'prediction_loss_only': False, 'per_device_train_batch_size': 128, 'per_device_eval_batch_size': 128, 'per_gpu_train_batch_size': None, 'per_gpu_eval_batch_size': None, 'gradient_accumulation_steps': 1, 'eval_accumulation_steps': None, 'eval_delay': 0, 'torch_empty_cache_steps': None, 'learning_rate': 1e-05, 'weight_decay': 0.02, 'adam_beta1': 0.9, 'adam_beta2': 0.999, 'adam_epsilon': 1e-08, 'max_grad_norm': 1.0, 'num_train_epochs': 15, 'max_steps': -1, 'lr_scheduler_type': 'linear', 'lr_scheduler_kwargs': {}, 'warmup_ratio': 0.0, 'warmup_steps': 200, 'log_level': 'passive', 'log_level_replica': 'warning', 'log_on_each_node': True, 'logging_dir': './runs/Dec11_04-29-04_fd55a770be24', 'logging_strategy': 'steps', 'logging_first_step': False, 'logging_steps': 10, 'logging_nan_inf_filter': True, 'save_strategy': 'epoch', 'save_steps': 48, 'save_total_limit': 1, 'save_safetensors': True, 'save_on_each_node': False, 'save_only_model': False, 'restore_callback_states_from_checkpoint': False, 'no_cuda': False, 'use_cpu': False, 'use_mps_device': False, 'seed': 5151, 'data_seed': None, 'jit_mode_eval': False, 'use_ipex': False, 'bf16': False, 'fp16': False, 'fp16_opt_level': 'O1', 'half_precision_backend': 'auto', 'bf16_full_eval': False, 'fp16_full_eval': False, 'tf32': None, 'local_rank': 0, 'ddp_backend': None, 'tpu_num_cores': None, 'tpu_metrics_debug': False, 'debug': [], 'dataloader_drop_last': False, 'eval_steps': 24, 'dataloader_num_workers': 0, 'dataloader_prefetch_factor': None, 'past_index': -1, 'run_name': 'm2-congretimbau_tunado', 'disable_tqdm': False, 'remove_unused_columns': True, 'label_names': None, 'load_best_model_at_end': True, 'metric_for_best_model': 'f1', 'greater_is_better': True, 'ignore_data_skip': False, 'fsdp': [], 'fsdp_min_num_params': 0, 'fsdp_config': {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}, 'fsdp_transformer_layer_cls_to_wrap': None, 'accelerator_config': {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}, 'deepspeed': None, 'label_smoothing_factor': 0.0, 'optim': 'adamw_torch', 'optim_args': None, 'adafactor': False, 'group_by_length': False, 'length_column_name': 'length', 'report_to': ['wandb'], 'ddp_find_unused_parameters': None, 'ddp_bucket_cap_mb': None, 'ddp_broadcast_buffers': None, 'dataloader_pin_memory': True, 'dataloader_persistent_workers': False, 'skip_memory_metrics': True, 'use_legacy_prediction_loop': False, 'push_to_hub': False, 'resume_from_checkpoint': None, 'hub_model_id': None, 'hub_strategy': 'every_save', 'hub_token': '<HUB_TOKEN>', 'hub_private_repo': None, 'hub_always_push': False, 'gradient_checkpointing': False, 'gradient_checkpointing_kwargs': None, 'include_inputs_for_metrics': False, 'include_for_metrics': [], 'eval_do_concat_batches': True, 'fp16_backend': 'auto', 'evaluation_strategy': None, 'push_to_hub_model_id': None, 'push_to_hub_organization': None, 'push_to_hub_token': '<PUSH_TO_HUB_TOKEN>', 'mp_parameters': '', 'auto_find_batch_size': False, 'full_determinism': False, 'torchdynamo': None, 'ray_scope': 'last', 'ddp_timeout': 1800, 'torch_compile': False, 'torch_compile_backend': None, 'torch_compile_mode': None, 'dispatch_batches': None, 'split_batches': None, 'include_tokens_per_second': False, 'include_num_input_tokens_seen': False, 'neftune_noise_alpha': None, 'optim_target_modules': None, 'batch_eval_metrics': False, 'eval_on_start': False, 'use_liger_kernel': False, 'eval_use_gather_object': False, 'average_tokens_across_devices': False}
38
+ 2024-12-11 04:29:05,777 INFO MainThread:1705 [wandb_config.py:__setitem__():154] config set model/num_parameters = 334398466 - <bound method Run._config_callback of <wandb.sdk.wandb_run.Run object at 0x797955c8e920>>
39
+ 2024-12-11 04:29:05,777 INFO MainThread:1705 [wandb_run.py:_config_callback():1375] config_cb model/num_parameters 334398466 None
40
+ 2024-12-11 04:29:07,712 INFO MainThread:1705 [jupyter.py:save_ipynb():387] not saving jupyter notebook
41
+ 2024-12-11 04:29:07,712 INFO MainThread:1705 [wandb_init.py:_pause_backend():439] pausing backend
42
+ 2024-12-11 04:30:04,523 WARNING MsgRouterThr:1705 [router.py:message_loop():75] message_loop has been closed
wandb/run-20241211_042903-vt1hbdpt/run-vt1hbdpt.wandb ADDED
Binary file (7.85 kB). View file
 
wandb/run-20241211_043031-pfhjmc9v/files/config.yaml ADDED
@@ -0,0 +1,568 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ _attn_implementation_autoset:
2
+ value: true
3
+ _name_or_path:
4
+ value: belisards/congretimbau
5
+ _wandb:
6
+ value:
7
+ cli_version: 0.19.0
8
+ m:
9
+ - "1": train/global_step
10
+ "6":
11
+ - 3
12
+ "7": []
13
+ - "1": train/learning_rate
14
+ "5": 1
15
+ "6":
16
+ - 1
17
+ - 3
18
+ "7": []
19
+ - "1": eval/accuracy
20
+ "5": 1
21
+ "6":
22
+ - 1
23
+ - 3
24
+ "7": []
25
+ - "1": eval/steps_per_second
26
+ "5": 1
27
+ "6":
28
+ - 1
29
+ - 3
30
+ "7": []
31
+ - "1": eval/runtime
32
+ "5": 1
33
+ "6":
34
+ - 1
35
+ - 3
36
+ "7": []
37
+ - "1": eval/loss
38
+ "5": 1
39
+ "6":
40
+ - 1
41
+ - 3
42
+ "7": []
43
+ - "1": eval/f1
44
+ "5": 1
45
+ "6":
46
+ - 1
47
+ - 3
48
+ "7": []
49
+ - "1": train/loss
50
+ "5": 1
51
+ "6":
52
+ - 1
53
+ - 3
54
+ "7": []
55
+ - "1": train/grad_norm
56
+ "5": 1
57
+ "6":
58
+ - 1
59
+ - 3
60
+ "7": []
61
+ - "1": train/epoch
62
+ "5": 1
63
+ "6":
64
+ - 1
65
+ - 3
66
+ "7": []
67
+ - "1": eval/samples_per_second
68
+ "5": 1
69
+ "6":
70
+ - 1
71
+ - 3
72
+ "7": []
73
+ - "1": eval/precision
74
+ "5": 1
75
+ "6":
76
+ - 1
77
+ - 3
78
+ "7": []
79
+ - "1": eval/recall
80
+ "5": 1
81
+ "6":
82
+ - 1
83
+ - 3
84
+ "7": []
85
+ python_version: 3.10.12
86
+ t:
87
+ "1":
88
+ - 1
89
+ - 2
90
+ - 3
91
+ - 5
92
+ - 11
93
+ - 12
94
+ - 41
95
+ - 49
96
+ - 51
97
+ - 53
98
+ - 55
99
+ - 71
100
+ - 98
101
+ - 100
102
+ - 105
103
+ "2":
104
+ - 1
105
+ - 2
106
+ - 3
107
+ - 5
108
+ - 11
109
+ - 12
110
+ - 41
111
+ - 49
112
+ - 51
113
+ - 53
114
+ - 55
115
+ - 71
116
+ - 98
117
+ - 100
118
+ - 105
119
+ "3":
120
+ - 2
121
+ - 7
122
+ - 13
123
+ - 19
124
+ - 23
125
+ - 55
126
+ - 62
127
+ - 66
128
+ "4": 3.10.12
129
+ "5": 0.19.0
130
+ "6": 4.47.0
131
+ "8":
132
+ - 1
133
+ - 5
134
+ - 12
135
+ "9":
136
+ "1": transformers_trainer
137
+ "12": 0.19.0
138
+ "13": linux-x86_64
139
+ accelerator_config:
140
+ value:
141
+ dispatch_batches: null
142
+ even_batches: true
143
+ gradient_accumulation_kwargs: null
144
+ non_blocking: false
145
+ split_batches: false
146
+ use_seedable_sampler: true
147
+ adafactor:
148
+ value: false
149
+ adam_beta1:
150
+ value: 0.9
151
+ adam_beta2:
152
+ value: 0.999
153
+ adam_epsilon:
154
+ value: 1e-08
155
+ add_cross_attention:
156
+ value: false
157
+ architectures:
158
+ value:
159
+ - BertForMaskedLM
160
+ attention_probs_dropout_prob:
161
+ value: 0.05
162
+ auto_find_batch_size:
163
+ value: false
164
+ average_tokens_across_devices:
165
+ value: false
166
+ bad_words_ids:
167
+ value: null
168
+ batch_eval_metrics:
169
+ value: false
170
+ begin_suppress_tokens:
171
+ value: null
172
+ bf16:
173
+ value: false
174
+ bf16_full_eval:
175
+ value: false
176
+ bos_token_id:
177
+ value: null
178
+ chunk_size_feed_forward:
179
+ value: 0
180
+ classifier_dropout:
181
+ value: null
182
+ cross_attention_hidden_size:
183
+ value: null
184
+ data_seed:
185
+ value: null
186
+ dataloader_drop_last:
187
+ value: false
188
+ dataloader_num_workers:
189
+ value: 0
190
+ dataloader_persistent_workers:
191
+ value: false
192
+ dataloader_pin_memory:
193
+ value: true
194
+ dataloader_prefetch_factor:
195
+ value: null
196
+ ddp_backend:
197
+ value: null
198
+ ddp_broadcast_buffers:
199
+ value: null
200
+ ddp_bucket_cap_mb:
201
+ value: null
202
+ ddp_find_unused_parameters:
203
+ value: null
204
+ ddp_timeout:
205
+ value: 1800
206
+ debug:
207
+ value: []
208
+ decoder_start_token_id:
209
+ value: null
210
+ deepspeed:
211
+ value: null
212
+ directionality:
213
+ value: bidi
214
+ disable_tqdm:
215
+ value: false
216
+ dispatch_batches:
217
+ value: null
218
+ diversity_penalty:
219
+ value: 0
220
+ do_eval:
221
+ value: true
222
+ do_predict:
223
+ value: false
224
+ do_sample:
225
+ value: false
226
+ do_train:
227
+ value: false
228
+ early_stopping:
229
+ value: false
230
+ encoder_no_repeat_ngram_size:
231
+ value: 0
232
+ eos_token_id:
233
+ value: null
234
+ eval_accumulation_steps:
235
+ value: null
236
+ eval_delay:
237
+ value: 0
238
+ eval_do_concat_batches:
239
+ value: true
240
+ eval_on_start:
241
+ value: false
242
+ eval_steps:
243
+ value: 102
244
+ eval_strategy:
245
+ value: epoch
246
+ eval_use_gather_object:
247
+ value: false
248
+ evaluation_strategy:
249
+ value: null
250
+ exponential_decay_length_penalty:
251
+ value: null
252
+ finetuning_task:
253
+ value: null
254
+ forced_bos_token_id:
255
+ value: null
256
+ forced_eos_token_id:
257
+ value: null
258
+ fp16:
259
+ value: false
260
+ fp16_backend:
261
+ value: auto
262
+ fp16_full_eval:
263
+ value: false
264
+ fp16_opt_level:
265
+ value: O1
266
+ fsdp:
267
+ value: []
268
+ fsdp_config:
269
+ value:
270
+ min_num_params: 0
271
+ xla: false
272
+ xla_fsdp_grad_ckpt: false
273
+ xla_fsdp_v2: false
274
+ fsdp_min_num_params:
275
+ value: 0
276
+ fsdp_transformer_layer_cls_to_wrap:
277
+ value: null
278
+ full_determinism:
279
+ value: false
280
+ gradient_accumulation_steps:
281
+ value: 1
282
+ gradient_checkpointing:
283
+ value: false
284
+ gradient_checkpointing_kwargs:
285
+ value: null
286
+ greater_is_better:
287
+ value: true
288
+ group_by_length:
289
+ value: false
290
+ half_precision_backend:
291
+ value: auto
292
+ hidden_act:
293
+ value: gelu
294
+ hidden_dropout_prob:
295
+ value: 0.05
296
+ hidden_size:
297
+ value: 1024
298
+ hub_always_push:
299
+ value: false
300
+ hub_model_id:
301
+ value: null
302
+ hub_private_repo:
303
+ value: null
304
+ hub_strategy:
305
+ value: every_save
306
+ hub_token:
307
+ value: <HUB_TOKEN>
308
+ id2label:
309
+ value:
310
+ "0": "0"
311
+ "1": "1"
312
+ ignore_data_skip:
313
+ value: false
314
+ include_for_metrics:
315
+ value: []
316
+ include_inputs_for_metrics:
317
+ value: false
318
+ include_num_input_tokens_seen:
319
+ value: false
320
+ include_tokens_per_second:
321
+ value: false
322
+ initializer_range:
323
+ value: 0.02
324
+ intermediate_size:
325
+ value: 4096
326
+ is_decoder:
327
+ value: false
328
+ is_encoder_decoder:
329
+ value: false
330
+ jit_mode_eval:
331
+ value: false
332
+ label_names:
333
+ value: null
334
+ label_smoothing_factor:
335
+ value: 0
336
+ label2id:
337
+ value:
338
+ "0": 0
339
+ "1": 1
340
+ layer_norm_eps:
341
+ value: 1e-12
342
+ learning_rate:
343
+ value: 1e-05
344
+ length_column_name:
345
+ value: length
346
+ length_penalty:
347
+ value: 1
348
+ load_best_model_at_end:
349
+ value: true
350
+ local_rank:
351
+ value: 0
352
+ log_level:
353
+ value: passive
354
+ log_level_replica:
355
+ value: warning
356
+ log_on_each_node:
357
+ value: true
358
+ logging_dir:
359
+ value: ./runs/Dec11_04-30-35_fd55a770be24
360
+ logging_first_step:
361
+ value: false
362
+ logging_nan_inf_filter:
363
+ value: true
364
+ logging_steps:
365
+ value: 10
366
+ logging_strategy:
367
+ value: steps
368
+ lr_scheduler_type:
369
+ value: linear
370
+ max_grad_norm:
371
+ value: 1
372
+ max_length:
373
+ value: 20
374
+ max_position_embeddings:
375
+ value: 512
376
+ max_steps:
377
+ value: -1
378
+ metric_for_best_model:
379
+ value: f1
380
+ min_length:
381
+ value: 0
382
+ model/num_parameters:
383
+ value: 334398466
384
+ model_type:
385
+ value: bert
386
+ mp_parameters:
387
+ value: ""
388
+ neftune_noise_alpha:
389
+ value: null
390
+ no_cuda:
391
+ value: false
392
+ no_repeat_ngram_size:
393
+ value: 0
394
+ num_attention_heads:
395
+ value: 16
396
+ num_beam_groups:
397
+ value: 1
398
+ num_beams:
399
+ value: 1
400
+ num_hidden_layers:
401
+ value: 24
402
+ num_return_sequences:
403
+ value: 1
404
+ num_train_epochs:
405
+ value: 18
406
+ optim:
407
+ value: adamw_torch
408
+ optim_args:
409
+ value: null
410
+ optim_target_modules:
411
+ value: null
412
+ output_attentions:
413
+ value: false
414
+ output_dir:
415
+ value: .
416
+ output_hidden_states:
417
+ value: false
418
+ output_past:
419
+ value: true
420
+ output_scores:
421
+ value: false
422
+ overwrite_output_dir:
423
+ value: false
424
+ pad_token_id:
425
+ value: 0
426
+ past_index:
427
+ value: -1
428
+ per_device_eval_batch_size:
429
+ value: 32
430
+ per_device_train_batch_size:
431
+ value: 32
432
+ per_gpu_eval_batch_size:
433
+ value: null
434
+ per_gpu_train_batch_size:
435
+ value: null
436
+ pooler_fc_size:
437
+ value: 768
438
+ pooler_num_attention_heads:
439
+ value: 12
440
+ pooler_num_fc_layers:
441
+ value: 3
442
+ pooler_size_per_head:
443
+ value: 128
444
+ pooler_type:
445
+ value: first_token_transform
446
+ position_embedding_type:
447
+ value: absolute
448
+ prediction_loss_only:
449
+ value: false
450
+ prefix:
451
+ value: null
452
+ problem_type:
453
+ value: null
454
+ push_to_hub:
455
+ value: false
456
+ push_to_hub_model_id:
457
+ value: null
458
+ push_to_hub_organization:
459
+ value: null
460
+ push_to_hub_token:
461
+ value: <PUSH_TO_HUB_TOKEN>
462
+ ray_scope:
463
+ value: last
464
+ remove_invalid_values:
465
+ value: false
466
+ remove_unused_columns:
467
+ value: true
468
+ repetition_penalty:
469
+ value: 1
470
+ report_to:
471
+ value:
472
+ - wandb
473
+ restore_callback_states_from_checkpoint:
474
+ value: false
475
+ resume_from_checkpoint:
476
+ value: null
477
+ return_dict:
478
+ value: true
479
+ return_dict_in_generate:
480
+ value: false
481
+ run_name:
482
+ value: m2-congretimbau_tunado
483
+ save_on_each_node:
484
+ value: false
485
+ save_only_model:
486
+ value: false
487
+ save_safetensors:
488
+ value: true
489
+ save_steps:
490
+ value: 204
491
+ save_strategy:
492
+ value: epoch
493
+ save_total_limit:
494
+ value: 1
495
+ seed:
496
+ value: 5151
497
+ sep_token_id:
498
+ value: null
499
+ skip_memory_metrics:
500
+ value: true
501
+ split_batches:
502
+ value: null
503
+ suppress_tokens:
504
+ value: null
505
+ task_specific_params:
506
+ value: null
507
+ temperature:
508
+ value: 1
509
+ tf_legacy_loss:
510
+ value: false
511
+ tf32:
512
+ value: null
513
+ tie_encoder_decoder:
514
+ value: false
515
+ tie_word_embeddings:
516
+ value: true
517
+ tokenizer_class:
518
+ value: null
519
+ top_k:
520
+ value: 50
521
+ top_p:
522
+ value: 1
523
+ torch_compile:
524
+ value: false
525
+ torch_compile_backend:
526
+ value: null
527
+ torch_compile_mode:
528
+ value: null
529
+ torch_dtype:
530
+ value: float32
531
+ torch_empty_cache_steps:
532
+ value: null
533
+ torchdynamo:
534
+ value: null
535
+ torchscript:
536
+ value: false
537
+ tpu_metrics_debug:
538
+ value: false
539
+ tpu_num_cores:
540
+ value: null
541
+ transformers_version:
542
+ value: 4.47.0
543
+ type_vocab_size:
544
+ value: 2
545
+ typical_p:
546
+ value: 1
547
+ use_bfloat16:
548
+ value: false
549
+ use_cache:
550
+ value: true
551
+ use_cpu:
552
+ value: false
553
+ use_ipex:
554
+ value: false
555
+ use_legacy_prediction_loop:
556
+ value: false
557
+ use_liger_kernel:
558
+ value: false
559
+ use_mps_device:
560
+ value: false
561
+ vocab_size:
562
+ value: 29794
563
+ warmup_ratio:
564
+ value: 0
565
+ warmup_steps:
566
+ value: 200
567
+ weight_decay:
568
+ value: 0.02
wandb/run-20241211_043031-pfhjmc9v/files/output.log ADDED
@@ -0,0 +1,8 @@
 
 
 
 
 
 
 
 
 
1
+ precision recall f1-score support
2
+
3
+ 0 0.91 0.89 0.90 114
4
+ 1 0.66 0.70 0.68 33
5
+
6
+ accuracy 0.85 147
7
+ macro avg 0.78 0.80 0.79 147
8
+ weighted avg 0.85 0.85 0.85 147
wandb/run-20241211_043031-pfhjmc9v/files/requirements.txt ADDED
@@ -0,0 +1,578 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ multiprocess==0.70.16
2
+ accelerate==1.2.0
3
+ fsspec==2024.9.0
4
+ wandb==0.19.0
5
+ transformers==4.47.0
6
+ evaluate==0.4.3
7
+ dill==0.3.8
8
+ tokenizers==0.21.0
9
+ datasets==3.2.0
10
+ xxhash==3.5.0
11
+ google-colab==1.0.0
12
+ termcolor==2.5.0
13
+ numexpr==2.10.2
14
+ duckdb==1.1.3
15
+ slicer==0.0.8
16
+ pandas-stubs==2.2.2.240909
17
+ yarl==1.18.3
18
+ jupyter-client==6.1.12
19
+ umf==0.9.1
20
+ aiohappyeyeballs==2.4.4
21
+ tensorflow-io-gcs-filesystem==0.37.1
22
+ nibabel==5.3.2
23
+ google-api-python-client==2.151.0
24
+ gensim==4.3.3
25
+ plotnine==0.14.3
26
+ stanio==0.5.1
27
+ pyogrio==0.10.0
28
+ nvidia-curand-cu12==10.3.7.77
29
+ Bottleneck==1.4.2
30
+ openpyxl==3.1.5
31
+ audioread==3.0.1
32
+ libcudf-cu12==24.10.1
33
+ networkx==3.4.2
34
+ cupy-cuda12x==12.2.0
35
+ nx-cugraph-cu12==24.10.0
36
+ aiosignal==1.3.1
37
+ StrEnum==0.4.15
38
+ jsonpickle==4.0.0
39
+ google-cloud-firestore==2.19.0
40
+ scipy==1.13.1
41
+ colour==0.1.5
42
+ pycparser==2.22
43
+ cmake==3.30.5
44
+ fastdownload==0.0.7
45
+ google-cloud-functions==1.18.1
46
+ imagesize==1.4.1
47
+ imageio==2.36.1
48
+ frozendict==2.4.6
49
+ ndindex==1.9.2
50
+ html5lib==1.1
51
+ python-louvain==0.16
52
+ Cython==3.0.11
53
+ fastai==2.7.18
54
+ cuda-python==12.2.1
55
+ parsy==2.1
56
+ smmap==5.0.1
57
+ weasel==0.4.1
58
+ pyasn1==0.6.1
59
+ matplotlib==3.8.0
60
+ immutabledict==4.2.1
61
+ dopamine_rl==4.0.9
62
+ uc-micro-py==1.0.3
63
+ websocket-client==1.8.0
64
+ anyio==3.7.1
65
+ msgpack==1.1.0
66
+ multitasking==0.0.11
67
+ albucore==0.0.19
68
+ scikit-image==0.24.0
69
+ setproctitle==1.3.4
70
+ marisa-trie==1.2.1
71
+ nvidia-cuda-nvcc-cu12==12.6.85
72
+ sphinxcontrib-qthelp==2.0.0
73
+ tables==3.10.1
74
+ kaggle==1.6.17
75
+ google-cloud-core==2.4.1
76
+ soupsieve==2.6
77
+ opencv-contrib-python==4.10.0.84
78
+ python-box==7.2.0
79
+ fonttools==4.55.1
80
+ Markdown==3.7
81
+ googleapis-common-protos==1.66.0
82
+ tensorflow-metadata==1.13.1
83
+ google-auth-httplib2==0.2.0
84
+ autograd==1.7.0
85
+ pydantic_core==2.27.1
86
+ pexpect==4.9.0
87
+ tweepy==4.14.0
88
+ pyshp==2.3.1
89
+ sklearn-pandas==2.2.0
90
+ toml==0.10.2
91
+ pydotplus==2.0.2
92
+ click==8.1.7
93
+ optree==0.13.1
94
+ catalogue==2.0.10
95
+ google-api-core==2.19.2
96
+ en-core-web-sm==3.7.1
97
+ tensorflow-datasets==4.9.7
98
+ pynvjitlink-cu12==0.4.0
99
+ pytest==8.3.4
100
+ portpicker==1.5.2
101
+ dm-tree==0.1.8
102
+ h5py==3.12.1
103
+ pandas-gbq==0.24.0
104
+ qdldl==0.1.7.post4
105
+ partd==1.4.2
106
+ wrapt==1.17.0
107
+ blis==0.7.11
108
+ textblob==0.17.1
109
+ jupyterlab_widgets==3.0.13
110
+ filelock==3.16.1
111
+ oauth2client==4.1.3
112
+ iniconfig==2.0.0
113
+ notebook==6.5.5
114
+ pydot==3.0.3
115
+ tomli==2.2.1
116
+ py4j==0.10.9.7
117
+ langchain-text-splitters==0.3.2
118
+ typer==0.15.0
119
+ contourpy==1.3.1
120
+ aiohttp==3.11.9
121
+ langchain==0.3.9
122
+ datascience==0.17.6
123
+ etuples==0.3.9
124
+ six==1.16.0
125
+ tornado==6.3.3
126
+ torchaudio==2.5.1+cu121
127
+ httpcore==1.0.7
128
+ importlib_metadata==8.5.0
129
+ holoviews==1.20.0
130
+ google-ai-generativelanguage==0.6.10
131
+ osqp==0.6.7.post3
132
+ pymc==5.18.2
133
+ google-pasta==0.2.0
134
+ py-cpuinfo==9.0.0
135
+ colorcet==3.1.0
136
+ blosc2==2.7.1
137
+ nvidia-cuda-cupti-cu12==12.6.80
138
+ colorlover==0.3.0
139
+ pymystem3==0.2.0
140
+ MarkupSafe==3.0.2
141
+ astunparse==1.6.3
142
+ google-cloud-bigquery==3.25.0
143
+ opencv-python==4.10.0.84
144
+ Flask==3.0.3
145
+ fastjsonschema==2.21.1
146
+ pylibcugraph-cu12==24.10.0
147
+ google-cloud-datastore==2.20.1
148
+ jaxlib==0.4.33
149
+ rich==13.9.4
150
+ widgetsnbextension==3.6.10
151
+ vega-datasets==0.9.0
152
+ natsort==8.4.0
153
+ blinker==1.9.0
154
+ statsmodels==0.14.4
155
+ sniffio==1.3.1
156
+ cymem==2.0.10
157
+ threadpoolctl==3.5.0
158
+ proto-plus==1.25.0
159
+ httpimport==1.4.0
160
+ matplotlib-inline==0.1.7
161
+ ratelim==0.1.6
162
+ python-utils==3.9.1
163
+ prompt_toolkit==3.0.48
164
+ gspread==6.0.2
165
+ google-cloud-storage==2.8.0
166
+ PyDrive==1.3.1
167
+ tensorflow-probability==0.24.0
168
+ xarray==2024.10.0
169
+ eerepr==0.0.4
170
+ murmurhash==1.0.11
171
+ pylibcudf-cu12==24.10.1
172
+ music21==9.3.0
173
+ geographiclib==2.0
174
+ smart-open==7.0.5
175
+ pygit2==1.16.0
176
+ more-itertools==10.5.0
177
+ cycler==0.12.1
178
+ idna==3.10
179
+ h11==0.14.0
180
+ ipython-genutils==0.2.0
181
+ referencing==0.35.1
182
+ GitPython==3.1.43
183
+ sentencepiece==0.2.0
184
+ CacheControl==0.14.1
185
+ tf_keras==2.17.0
186
+ annotated-types==0.7.0
187
+ ecos==2.0.14
188
+ geopandas==1.0.1
189
+ jieba==0.42.1
190
+ shapely==2.0.6
191
+ ipyparallel==8.8.0
192
+ pyOpenSSL==24.2.1
193
+ pyarrow==17.0.0
194
+ opencv-python-headless==4.10.0.84
195
+ openai==1.54.5
196
+ sentry-sdk==2.19.0
197
+ pyarrow-hotfix==0.6
198
+ gitdb==4.0.11
199
+ prophet==1.1.6
200
+ ipython-sql==0.5.0
201
+ tf-slim==1.1.0
202
+ PySocks==1.7.1
203
+ imbalanced-learn==0.12.4
204
+ mlxtend==0.23.3
205
+ mkl==2025.0.1
206
+ itsdangerous==2.2.0
207
+ gast==0.6.0
208
+ torchsummary==1.5.1
209
+ urllib3==2.2.3
210
+ earthengine-api==1.2.0
211
+ jupyter-server==1.24.0
212
+ webencodings==0.5.1
213
+ seaborn==0.13.2
214
+ SQLAlchemy==2.0.36
215
+ pyerfa==2.0.1.5
216
+ xarray-einstats==0.8.0
217
+ jsonschema==4.23.0
218
+ parso==0.8.4
219
+ tensorflow-hub==0.16.1
220
+ ipyfilechooser==0.6.0
221
+ packaging==24.2
222
+ param==2.1.1
223
+ et_xmlfile==2.0.0
224
+ kagglehub==0.3.4
225
+ yfinance==0.2.50
226
+ spacy==3.7.5
227
+ imgaug==0.4.0
228
+ sqlparse==0.5.2
229
+ certifi==2024.8.30
230
+ plotly==5.24.1
231
+ pandas-datareader==0.10.0
232
+ charset-normalizer==3.4.0
233
+ cloudpathlib==0.20.0
234
+ decorator==4.4.2
235
+ prometheus_client==0.21.1
236
+ google-auth==2.27.0
237
+ pyzmq==24.0.1
238
+ tzdata==2024.2
239
+ scs==3.2.7
240
+ types-pytz==2024.2.0.20241003
241
+ db-dtypes==1.3.1
242
+ chex==0.1.87
243
+ spacy-loggers==1.0.5
244
+ peewee==3.17.8
245
+ nvidia-cuda-runtime-cu12==12.6.77
246
+ ibis-framework==9.2.0
247
+ wasabi==1.1.3
248
+ python-dateutil==2.8.2
249
+ google-generativeai==0.8.3
250
+ google-cloud-aiplatform==1.73.0
251
+ jupyterlab_pygments==0.3.0
252
+ google-cloud-translate==3.17.0
253
+ exceptiongroup==1.2.2
254
+ intel-openmp==2025.0.3
255
+ nvidia-cufft-cu12==11.3.0.4
256
+ timm==1.0.12
257
+ python-slugify==8.0.4
258
+ cryptography==43.0.3
259
+ multipledispatch==1.0.0
260
+ google-crc32c==1.6.0
261
+ rsa==4.9
262
+ cffi==1.17.1
263
+ holidays==0.62
264
+ tinycss2==1.4.0
265
+ google-cloud-iam==2.16.1
266
+ google-cloud-resource-manager==1.13.1
267
+ typeguard==4.4.1
268
+ xlrd==2.0.1
269
+ google==2.0.3
270
+ pandocfilters==1.5.1
271
+ promise==2.3
272
+ progressbar2==4.5.0
273
+ wheel==0.45.1
274
+ google-resumable-media==2.7.2
275
+ einops==0.8.0
276
+ defusedxml==0.7.1
277
+ llvmlite==0.43.0
278
+ attrs==24.2.0
279
+ preshed==3.0.9
280
+ geocoder==1.38.1
281
+ firebase-admin==6.5.0
282
+ ipython==7.34.0
283
+ cvxopt==1.3.2
284
+ greenlet==3.1.1
285
+ confection==0.1.5
286
+ keras==3.5.0
287
+ pandas==2.2.2
288
+ graphviz==0.20.3
289
+ tenacity==9.0.0
290
+ future==1.0.0
291
+ inflect==7.4.0
292
+ langsmith==0.1.147
293
+ babel==2.16.0
294
+ nvidia-cusparse-cu12==12.5.4.2
295
+ importlib_resources==6.4.5
296
+ jiter==0.8.0
297
+ sphinxcontrib-htmlhelp==2.1.0
298
+ moviepy==1.0.3
299
+ typing_extensions==4.12.2
300
+ tcmlib==1.2.0
301
+ snowballstemmer==2.2.0
302
+ scikit-learn==1.5.2
303
+ jsonpatch==1.33
304
+ notebook_shim==0.2.4
305
+ mizani==0.13.0
306
+ simple-parsing==0.1.6
307
+ humanize==4.11.0
308
+ hyperopt==0.2.7
309
+ frozenlist==1.5.0
310
+ uritemplate==4.1.1
311
+ pydata-google-auth==1.9.0
312
+ pyperclip==1.9.0
313
+ lazy_loader==0.4
314
+ bigframes==1.27.0
315
+ numba==0.60.0
316
+ pytensor==2.26.4
317
+ geemap==0.35.1
318
+ polars==1.9.0
319
+ orjson==3.10.12
320
+ fastrlock==0.8.2
321
+ mistune==3.0.2
322
+ sphinxcontrib-applehelp==2.0.0
323
+ mpmath==1.3.0
324
+ mdurl==0.1.2
325
+ async-timeout==4.0.3
326
+ jax==0.4.33
327
+ panel==1.5.4
328
+ patsy==1.0.1
329
+ oauthlib==3.2.2
330
+ librosa==0.10.2.post1
331
+ imutils==0.5.4
332
+ xyzservices==2024.9.0
333
+ Pygments==2.18.0
334
+ sphinxcontrib-jsmath==1.0.1
335
+ pyasn1_modules==0.4.1
336
+ pyproj==3.7.0
337
+ Pyomo==6.8.2
338
+ nvidia-cublas-cu12==12.6.4.1
339
+ numpy==1.26.4
340
+ google-cloud-bigquery-storage==2.27.0
341
+ soxr==0.5.0.post1
342
+ bleach==6.2.0
343
+ sympy==1.13.1
344
+ dask==2024.10.0
345
+ miniKanren==1.0.3
346
+ pathlib==1.0.1
347
+ torch==2.5.1+cu121
348
+ distro==1.9.0
349
+ gym==0.25.2
350
+ lxml==5.3.0
351
+ folium==0.18.0
352
+ altair==4.2.2
353
+ tifffile==2024.9.20
354
+ diffusers==0.31.0
355
+ requests-oauthlib==1.3.1
356
+ google-cloud-language==2.15.1
357
+ ptyprocess==0.7.0
358
+ tensorboard-data-server==0.7.2
359
+ locket==1.0.0
360
+ intel-cmplr-lib-ur==2025.0.3
361
+ lightgbm==4.5.0
362
+ flax==0.8.5
363
+ mdit-py-plugins==0.4.2
364
+ pygame==2.6.1
365
+ pytz==2024.2
366
+ pyparsing==3.2.0
367
+ alabaster==1.0.0
368
+ jupyter_core==5.7.2
369
+ cloudpickle==3.1.0
370
+ logical-unification==0.4.6
371
+ traittypes==0.2.1
372
+ terminado==0.18.1
373
+ beautifulsoup4==4.12.3
374
+ google-cloud-bigquery-connection==1.16.1
375
+ scooby==0.10.0
376
+ google-auth-oauthlib==1.2.1
377
+ protobuf==4.25.5
378
+ astropy==6.1.7
379
+ GDAL==3.6.4
380
+ nbformat==5.10.4
381
+ safetensors==0.4.5
382
+ tzlocal==5.2
383
+ chardet==5.2.0
384
+ psutil==5.9.5
385
+ jsonpointer==3.0.0
386
+ psycopg2==2.9.10
387
+ cachetools==5.5.0
388
+ pyspark==3.5.3
389
+ google-cloud-pubsub==2.27.1
390
+ pillow==11.0.0
391
+ pydantic==2.10.3
392
+ langcodes==3.5.0
393
+ jsonschema-specifications==2024.10.1
394
+ markdown-it-py==3.0.0
395
+ nltk==3.9.1
396
+ nbconvert==7.16.4
397
+ pooch==1.8.2
398
+ huggingface-hub==0.26.3
399
+ tbb==2022.0.0
400
+ ipywidgets==7.7.1
401
+ matplotlib-venn==1.1.1
402
+ rmm-cu12==24.10.0
403
+ sphinxcontrib-serializinghtml==2.0.0
404
+ branca==0.8.0
405
+ gin-config==0.5.0
406
+ bqplot==0.12.43
407
+ opentelemetry-sdk==1.28.2
408
+ community==1.0.0b1
409
+ soundfile==0.12.1
410
+ shap==0.46.0
411
+ joblib==1.4.2
412
+ entrypoints==0.4
413
+ docstring_parser==0.16
414
+ multidict==6.1.0
415
+ grpcio-status==1.62.3
416
+ wcwidth==0.2.13
417
+ geopy==2.4.1
418
+ prettytable==3.12.0
419
+ propcache==0.2.1
420
+ ml-dtypes==0.4.1
421
+ atpublic==4.1.0
422
+ stringzilla==3.11.0
423
+ libclang==18.1.1
424
+ arviz==0.20.0
425
+ requests==2.32.3
426
+ fastprogress==1.0.3
427
+ dlib==19.24.2
428
+ platformdirs==4.3.6
429
+ tabulate==0.9.0
430
+ cufflinks==0.17.3
431
+ spacy-legacy==3.0.12
432
+ grpcio==1.68.1
433
+ cvxpy==1.5.4
434
+ flatbuffers==24.3.25
435
+ pluggy==1.5.0
436
+ regex==2024.9.11
437
+ clarabel==0.9.0
438
+ httplib2==0.22.0
439
+ language_data==1.3.0
440
+ backcall==0.2.0
441
+ nbclient==0.10.1
442
+ ipyleaflet==0.19.2
443
+ PyJWT==2.10.1
444
+ toolz==0.12.1
445
+ opt_einsum==3.4.0
446
+ nvtx==0.2.10
447
+ nvidia-nccl-cu12==2.23.4
448
+ debugpy==1.8.0
449
+ sphinxcontrib-devhelp==2.0.0
450
+ jax-cuda12-pjrt==0.4.33
451
+ missingno==0.5.2
452
+ proglog==0.1.10
453
+ nest-asyncio==1.6.0
454
+ nvidia-cusolver-cu12==11.7.1.2
455
+ gspread-dataframe==3.3.1
456
+ bigquery-magics==0.4.0
457
+ jellyfish==1.1.2
458
+ text-unidecode==1.3
459
+ Deprecated==1.2.15
460
+ srsly==2.4.8
461
+ ipytree==0.2.2
462
+ gcsfs==2024.10.0
463
+ ipyevents==2.0.2
464
+ docker-pycreds==0.4.0
465
+ glob2==0.7
466
+ wordcloud==1.9.4
467
+ traitlets==5.7.1
468
+ nbclassic==1.1.0
469
+ nvidia-cudnn-cu12==9.6.0.74
470
+ easydict==1.13
471
+ httpx==0.28.0
472
+ Jinja2==3.1.4
473
+ argon2-cffi==23.1.0
474
+ linkify-it-py==2.0.3
475
+ optax==0.2.4
476
+ cons==0.4.6
477
+ requests-toolbelt==1.0.0
478
+ editdistance==0.8.1
479
+ jupyter-leaflet==0.19.2
480
+ webcolors==24.11.1
481
+ jax-cuda12-plugin==0.4.33
482
+ albumentations==1.4.20
483
+ PyOpenGL==3.1.7
484
+ tqdm==4.66.6
485
+ googledrivedownloader==0.4
486
+ peft==0.13.2
487
+ absl-py==1.4.0
488
+ eval_type_backport==0.2.0
489
+ opentelemetry-api==1.28.2
490
+ Send2Trash==1.8.3
491
+ gdown==5.2.0
492
+ google-cloud-bigtable==2.27.0
493
+ grpc-google-iam-v1==0.13.1
494
+ rpds-py==0.22.3
495
+ xgboost==2.1.3
496
+ pyviz_comms==3.0.3
497
+ ipykernel==5.5.6
498
+ gym-notices==0.0.8
499
+ opentelemetry-semantic-conventions==0.49b2
500
+ fastcore==1.7.22
501
+ torchvision==0.20.1+cu121
502
+ Sphinx==8.1.3
503
+ ply==3.11
504
+ tensorboard==2.17.1
505
+ namex==0.0.8
506
+ astropy-iers-data==0.2024.12.2.0.35.34
507
+ rpy2==3.4.2
508
+ nvidia-nvjitlink-cu12==12.6.85
509
+ array_record==0.5.1
510
+ PyYAML==6.0.2
511
+ tensorflow==2.17.1
512
+ yellowbrick==1.5
513
+ pylibraft-cu12==24.10.0
514
+ orbax-checkpoint==0.6.4
515
+ Werkzeug==3.1.3
516
+ jupyter-console==6.1.0
517
+ thinc==8.2.5
518
+ etils==1.11.0
519
+ langchain-core==0.3.21
520
+ cmdstanpy==1.2.4
521
+ h5netcdf==1.4.1
522
+ cudf-cu12==24.10.1
523
+ zipp==3.21.0
524
+ PyDrive2==1.21.3
525
+ kiwisolver==1.4.7
526
+ pickleshare==0.7.5
527
+ bokeh==3.6.2
528
+ shellingham==1.5.4
529
+ imageio-ffmpeg==0.5.1
530
+ pycocotools==2.0.8
531
+ sqlglot==25.1.0
532
+ tensorstore==0.1.69
533
+ argon2-cffi-bindings==21.2.0
534
+ sentence-transformers==3.2.1
535
+ docutils==0.21.2
536
+ python-apt==0.0.0
537
+ setuptools==75.1.0
538
+ pip==24.1.2
539
+ requirements-parser==0.9.0
540
+ types-setuptools==75.6.0.20241126
541
+ importlib-metadata==4.6.4
542
+ blinker==1.4
543
+ PyJWT==2.3.0
544
+ PyGObject==3.42.1
545
+ cryptography==3.4.8
546
+ pyparsing==2.4.7
547
+ wadllib==1.3.6
548
+ more-itertools==8.10.0
549
+ zipp==1.0.0
550
+ distro==1.7.0
551
+ six==1.16.0
552
+ python-apt==2.4.0+ubuntu4
553
+ jeepney==0.7.1
554
+ SecretStorage==3.3.1
555
+ oauthlib==3.2.0
556
+ lazr.restfulclient==0.14.4
557
+ httplib2==0.20.2
558
+ launchpadlib==1.10.16
559
+ keyring==23.5.0
560
+ lazr.uri==1.0.6
561
+ dbus-python==1.2.18
562
+ inflect==7.3.1
563
+ wheel==0.43.0
564
+ packaging==24.1
565
+ importlib_metadata==8.0.0
566
+ more-itertools==10.3.0
567
+ platformdirs==4.2.2
568
+ backports.tarfile==1.2.0
569
+ typing_extensions==4.12.2
570
+ jaraco.functools==4.0.1
571
+ jaraco.collections==5.1.0
572
+ tomli==2.0.1
573
+ importlib_resources==6.4.0
574
+ jaraco.text==3.12.1
575
+ jaraco.context==5.3.0
576
+ typeguard==4.3.0
577
+ autocommand==2.2.2
578
+ zipp==3.19.2
wandb/run-20241211_043031-pfhjmc9v/files/wandb-metadata.json ADDED
@@ -0,0 +1,38 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "os": "Linux-6.1.85+-x86_64-with-glibc2.35",
3
+ "python": "CPython 3.10.12",
4
+ "startedAt": "2024-12-11T04:30:31.644444Z",
5
+ "program": "m2-congretimbau-focal_loss-sempartido.ipynb",
6
+ "email": "[email protected]",
7
+ "root": "/content",
8
+ "host": "fd55a770be24",
9
+ "username": "root",
10
+ "executable": "/usr/bin/python3",
11
+ "colab": "https://colab.research.google.com/notebook#fileId=1a_b6x6zKkekdeqrCzP83ovWk7MhdV3DO",
12
+ "cpu_count": 6,
13
+ "cpu_count_logical": 12,
14
+ "gpu": "NVIDIA A100-SXM4-40GB",
15
+ "gpu_count": 1,
16
+ "disk": {
17
+ "/": {
18
+ "total": "120942624768",
19
+ "used": "36615233536"
20
+ }
21
+ },
22
+ "memory": {
23
+ "total": "89632796672"
24
+ },
25
+ "cpu": {
26
+ "count": 6,
27
+ "countLogical": 12
28
+ },
29
+ "gpu_nvidia": [
30
+ {
31
+ "name": "NVIDIA A100-SXM4-40GB",
32
+ "memoryTotal": "42949672960",
33
+ "cudaCores": 6912,
34
+ "architecture": "Ampere"
35
+ }
36
+ ],
37
+ "cudaVersion": "12.2"
38
+ }
wandb/run-20241211_043031-pfhjmc9v/files/wandb-summary.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"train_samples_per_second":52.456,"_step":41,"eval/f1":0.7895627277459656,"train/loss":0.0067,"train_loss":0.08594005133897539,"train/global_step":315,"train/epoch":9,"train/learning_rate":7.44186046511628e-06,"eval/recall":0.79585326953748,"train/grad_norm":0.3785024583339691,"eval/accuracy":0.8503401360544217,"_timestamp":1.7338918220029504e+09,"train_runtime":383.9768,"eval/samples_per_second":104.597,"eval/runtime":1.4054,"_wandb":{"runtime":393},"train_steps_per_second":1.641,"eval/loss":0.1075880378484726,"eval/precision":0.7839285714285714,"eval/steps_per_second":3.558,"total_flos":4.03282376409528e+15,"_runtime":390.313178795}
wandb/run-20241211_043031-pfhjmc9v/logs/debug-core.log ADDED
@@ -0,0 +1,9 @@
 
 
 
 
 
 
 
 
 
 
1
+ {"time":"2024-12-11T04:30:31.400373612Z","level":"INFO","msg":"started logging, with flags","port-filename":"/tmp/tmpvhcya0ch/port-2341.txt","pid":2341,"debug":false,"disable-analytics":false}
2
+ {"time":"2024-12-11T04:30:31.40040802Z","level":"INFO","msg":"FeatureState","shutdownOnParentExitEnabled":false}
3
+ {"time":"2024-12-11T04:30:31.406301911Z","level":"INFO","msg":"Will exit if parent process dies.","ppid":2341}
4
+ {"time":"2024-12-11T04:30:31.406260591Z","level":"INFO","msg":"server is running","addr":{"IP":"127.0.0.1","Port":40057,"Zone":""}}
5
+ {"time":"2024-12-11T04:30:31.589054926Z","level":"INFO","msg":"connection: ManageConnectionData: new connection created","id":"127.0.0.1:50024"}
6
+ {"time":"2024-12-11T04:30:31.645539858Z","level":"INFO","msg":"handleInformInit: received","streamId":"pfhjmc9v","id":"127.0.0.1:50024"}
7
+ {"time":"2024-12-11T04:30:31.748169115Z","level":"INFO","msg":"handleInformInit: stream started","streamId":"pfhjmc9v","id":"127.0.0.1:50024"}
8
+ {"time":"2024-12-11T04:37:06.688042547Z","level":"INFO","msg":"handleInformFinish: finish message received","streamId":"pfhjmc9v","id":"127.0.0.1:50024"}
9
+ {"time":"2024-12-11T04:37:06.688190655Z","level":"INFO","msg":"handleInformFinish: stream closed","streamId":"pfhjmc9v","id":"127.0.0.1:50024"}
wandb/run-20241211_043031-pfhjmc9v/logs/debug-internal.log ADDED
@@ -0,0 +1,41 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {"time":"2024-12-11T04:30:31.64567442Z","level":"INFO","msg":"using version","core version":"0.19.0"}
2
+ {"time":"2024-12-11T04:30:31.645694977Z","level":"INFO","msg":"created symlink","path":"/content/wandb/run-20241211_043031-pfhjmc9v/logs/debug-core.log"}
3
+ {"time":"2024-12-11T04:30:31.748128189Z","level":"INFO","msg":"created new stream","id":"pfhjmc9v"}
4
+ {"time":"2024-12-11T04:30:31.748162278Z","level":"INFO","msg":"stream: started","id":"pfhjmc9v"}
5
+ {"time":"2024-12-11T04:30:31.748191334Z","level":"INFO","msg":"writer: Do: started","stream_id":"pfhjmc9v"}
6
+ {"time":"2024-12-11T04:30:31.748217805Z","level":"INFO","msg":"handler: started","stream_id":"pfhjmc9v"}
7
+ {"time":"2024-12-11T04:30:31.748239576Z","level":"INFO","msg":"sender: started","stream_id":"pfhjmc9v"}
8
+ {"time":"2024-12-11T04:30:34.483159987Z","level":"INFO","msg":"Starting system monitor"}
9
+ {"time":"2024-12-11T04:30:35.046627667Z","level":"INFO","msg":"Pausing system monitor"}
10
+ {"time":"2024-12-11T04:30:35.046662301Z","level":"INFO","msg":"Resuming system monitor"}
11
+ {"time":"2024-12-11T04:30:35.046670624Z","level":"INFO","msg":"Pausing system monitor"}
12
+ {"time":"2024-12-11T04:30:35.046676728Z","level":"INFO","msg":"Resuming system monitor"}
13
+ {"time":"2024-12-11T04:30:35.046682641Z","level":"INFO","msg":"Pausing system monitor"}
14
+ {"time":"2024-12-11T04:30:35.046688398Z","level":"INFO","msg":"Resuming system monitor"}
15
+ {"time":"2024-12-11T04:30:35.626440922Z","level":"INFO","msg":"Pausing system monitor"}
16
+ {"time":"2024-12-11T04:30:35.667423399Z","level":"INFO","msg":"Resuming system monitor"}
17
+ {"time":"2024-12-11T04:37:00.589693882Z","level":"INFO","msg":"Pausing system monitor"}
18
+ {"time":"2024-12-11T04:37:00.594358479Z","level":"INFO","msg":"Resuming system monitor"}
19
+ {"time":"2024-12-11T04:37:02.005522092Z","level":"INFO","msg":"Pausing system monitor"}
20
+ {"time":"2024-12-11T04:37:02.010541772Z","level":"INFO","msg":"Resuming system monitor"}
21
+ {"time":"2024-12-11T04:37:02.011288477Z","level":"INFO","msg":"Pausing system monitor"}
22
+ {"time":"2024-12-11T04:37:02.017364808Z","level":"INFO","msg":"Resuming system monitor"}
23
+ {"time":"2024-12-11T04:37:02.171915316Z","level":"INFO","msg":"Pausing system monitor"}
24
+ {"time":"2024-12-11T04:37:02.177365784Z","level":"INFO","msg":"Resuming system monitor"}
25
+ {"time":"2024-12-11T04:37:02.180571775Z","level":"INFO","msg":"Pausing system monitor"}
26
+ {"time":"2024-12-11T04:37:02.185372366Z","level":"INFO","msg":"Resuming system monitor"}
27
+ {"time":"2024-12-11T04:37:05.365253262Z","level":"INFO","msg":"Pausing system monitor"}
28
+ {"time":"2024-12-11T04:37:05.372555953Z","level":"INFO","msg":"Resuming system monitor"}
29
+ {"time":"2024-12-11T04:37:05.373998802Z","level":"INFO","msg":"Pausing system monitor"}
30
+ {"time":"2024-12-11T04:37:05.379408244Z","level":"INFO","msg":"Resuming system monitor"}
31
+ {"time":"2024-12-11T04:37:05.3922832Z","level":"INFO","msg":"Pausing system monitor"}
32
+ {"time":"2024-12-11T04:37:05.396859255Z","level":"INFO","msg":"Resuming system monitor"}
33
+ {"time":"2024-12-11T04:37:05.398980138Z","level":"INFO","msg":"Stopping system monitor"}
34
+ {"time":"2024-12-11T04:37:05.399571058Z","level":"INFO","msg":"Stopped system monitor"}
35
+ {"time":"2024-12-11T04:37:05.608796436Z","level":"INFO","msg":"fileTransfer: Close: file transfer manager closed"}
36
+ {"time":"2024-12-11T04:37:05.673605445Z","level":"INFO","msg":"handler: operation stats","stats":{}}
37
+ {"time":"2024-12-11T04:37:06.688087405Z","level":"INFO","msg":"stream: closing","id":"pfhjmc9v"}
38
+ {"time":"2024-12-11T04:37:06.68810837Z","level":"INFO","msg":"handler: closed","stream_id":"pfhjmc9v"}
39
+ {"time":"2024-12-11T04:37:06.688117223Z","level":"INFO","msg":"writer: Close: closed","stream_id":"pfhjmc9v"}
40
+ {"time":"2024-12-11T04:37:06.688143843Z","level":"INFO","msg":"sender: closed","stream_id":"pfhjmc9v"}
41
+ {"time":"2024-12-11T04:37:06.688182944Z","level":"INFO","msg":"stream: closed","id":"pfhjmc9v"}
wandb/run-20241211_043031-pfhjmc9v/logs/debug.log ADDED
@@ -0,0 +1,73 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ 2024-12-11 04:30:31,640 INFO MainThread:2341 [wandb_setup.py:_flush():68] Current SDK version is 0.19.0
2
+ 2024-12-11 04:30:31,640 INFO MainThread:2341 [wandb_setup.py:_flush():68] Configure stats pid to 2341
3
+ 2024-12-11 04:30:31,640 INFO MainThread:2341 [wandb_setup.py:_flush():68] Loading settings from /root/.config/wandb/settings
4
+ 2024-12-11 04:30:31,640 INFO MainThread:2341 [wandb_setup.py:_flush():68] Loading settings from /content/wandb/settings
5
+ 2024-12-11 04:30:31,640 INFO MainThread:2341 [wandb_setup.py:_flush():68] Loading settings from environment variables
6
+ 2024-12-11 04:30:31,640 INFO MainThread:2341 [wandb_init.py:_log_setup():528] Logging user logs to /content/wandb/run-20241211_043031-pfhjmc9v/logs/debug.log
7
+ 2024-12-11 04:30:31,640 INFO MainThread:2341 [wandb_init.py:_log_setup():529] Logging internal logs to /content/wandb/run-20241211_043031-pfhjmc9v/logs/debug-internal.log
8
+ 2024-12-11 04:30:31,640 INFO MainThread:2341 [wandb_init.py:_jupyter_setup():474] configuring jupyter hooks <wandb.sdk.wandb_init._WandbInit object at 0x7de625ba39d0>
9
+ 2024-12-11 04:30:31,641 INFO MainThread:2341 [wandb_init.py:init():639] calling init triggers
10
+ 2024-12-11 04:30:31,641 INFO MainThread:2341 [wandb_init.py:init():645] wandb.init called with sweep_config: {}
11
+ config: {}
12
+ 2024-12-11 04:30:31,641 INFO MainThread:2341 [wandb_init.py:init():688] starting backend
13
+ 2024-12-11 04:30:31,641 INFO MainThread:2341 [wandb_init.py:init():692] sending inform_init request
14
+ 2024-12-11 04:30:31,643 INFO MainThread:2341 [backend.py:_multiprocessing_setup():104] multiprocessing start_methods=fork,spawn,forkserver, using: spawn
15
+ 2024-12-11 04:30:31,644 INFO MainThread:2341 [wandb_init.py:init():705] backend started and connected
16
+ 2024-12-11 04:30:31,652 INFO MainThread:2341 [wandb_run.py:_label_probe_notebook():1327] probe notebook
17
+ 2024-12-11 04:30:34,399 INFO MainThread:2341 [wandb_init.py:init():798] updated telemetry
18
+ 2024-12-11 04:30:34,399 INFO MainThread:2341 [wandb_init.py:init():830] communicating run to backend with 90.0 second timeout
19
+ 2024-12-11 04:30:34,477 INFO MainThread:2341 [wandb_init.py:init():882] starting run threads in backend
20
+ 2024-12-11 04:30:35,007 INFO MainThread:2341 [wandb_run.py:_console_start():2443] atexit reg
21
+ 2024-12-11 04:30:35,007 INFO MainThread:2341 [wandb_run.py:_redirect():2293] redirect: wrap_raw
22
+ 2024-12-11 04:30:35,007 INFO MainThread:2341 [wandb_run.py:_redirect():2358] Wrapping output streams.
23
+ 2024-12-11 04:30:35,007 INFO MainThread:2341 [wandb_run.py:_redirect():2383] Redirects installed.
24
+ 2024-12-11 04:30:35,009 INFO MainThread:2341 [wandb_init.py:init():925] run started, returning control to user process
25
+ 2024-12-11 04:30:35,013 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
26
+ 2024-12-11 04:30:35,013 INFO MainThread:2341 [wandb_init.py:_pause_backend():439] pausing backend
27
+ 2024-12-11 04:30:35,019 INFO MainThread:2341 [wandb_init.py:_resume_backend():444] resuming backend
28
+ 2024-12-11 04:30:35,020 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
29
+ 2024-12-11 04:30:35,020 INFO MainThread:2341 [wandb_init.py:_pause_backend():439] pausing backend
30
+ 2024-12-11 04:30:35,028 INFO MainThread:2341 [wandb_init.py:_resume_backend():444] resuming backend
31
+ 2024-12-11 04:30:35,029 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
32
+ 2024-12-11 04:30:35,030 INFO MainThread:2341 [wandb_init.py:_pause_backend():439] pausing backend
33
+ 2024-12-11 04:30:35,037 INFO MainThread:2341 [wandb_init.py:_resume_backend():444] resuming backend
34
+ 2024-12-11 04:30:35,626 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
35
+ 2024-12-11 04:30:35,626 INFO MainThread:2341 [wandb_init.py:_pause_backend():439] pausing backend
36
+ 2024-12-11 04:30:35,630 INFO MainThread:2341 [wandb_init.py:_resume_backend():444] resuming backend
37
+ 2024-12-11 04:30:36,007 INFO MainThread:2341 [wandb_run.py:_config_callback():1375] config_cb None None {'return_dict': True, 'output_hidden_states': False, 'output_attentions': False, 'torchscript': False, 'torch_dtype': 'float32', 'use_bfloat16': False, 'tf_legacy_loss': False, 'pruned_heads': {}, 'tie_word_embeddings': True, 'chunk_size_feed_forward': 0, 'is_encoder_decoder': False, 'is_decoder': False, 'cross_attention_hidden_size': None, 'add_cross_attention': False, 'tie_encoder_decoder': False, 'max_length': 20, 'min_length': 0, 'do_sample': False, 'early_stopping': False, 'num_beams': 1, 'num_beam_groups': 1, 'diversity_penalty': 0.0, 'temperature': 1.0, 'top_k': 50, 'top_p': 1.0, 'typical_p': 1.0, 'repetition_penalty': 1.0, 'length_penalty': 1.0, 'no_repeat_ngram_size': 0, 'encoder_no_repeat_ngram_size': 0, 'bad_words_ids': None, 'num_return_sequences': 1, 'output_scores': False, 'return_dict_in_generate': False, 'forced_bos_token_id': None, 'forced_eos_token_id': None, 'remove_invalid_values': False, 'exponential_decay_length_penalty': None, 'suppress_tokens': None, 'begin_suppress_tokens': None, 'architectures': ['BertForMaskedLM'], 'finetuning_task': None, 'id2label': {0: '0', 1: '1'}, 'label2id': {'0': 0, '1': 1}, 'tokenizer_class': None, 'prefix': None, 'bos_token_id': None, 'pad_token_id': 0, 'eos_token_id': None, 'sep_token_id': None, 'decoder_start_token_id': None, 'task_specific_params': None, 'problem_type': None, '_name_or_path': 'belisards/congretimbau', '_attn_implementation_autoset': True, 'transformers_version': '4.47.0', 'directionality': 'bidi', 'model_type': 'bert', 'output_past': True, 'pooler_fc_size': 768, 'pooler_num_attention_heads': 12, 'pooler_num_fc_layers': 3, 'pooler_size_per_head': 128, 'pooler_type': 'first_token_transform', 'vocab_size': 29794, 'hidden_size': 1024, 'num_hidden_layers': 24, 'num_attention_heads': 16, 'hidden_act': 'gelu', 'intermediate_size': 4096, 'hidden_dropout_prob': 0.05, 'attention_probs_dropout_prob': 0.05, 'max_position_embeddings': 512, 'type_vocab_size': 2, 'initializer_range': 0.02, 'layer_norm_eps': 1e-12, 'position_embedding_type': 'absolute', 'use_cache': True, 'classifier_dropout': None, 'output_dir': '.', 'overwrite_output_dir': False, 'do_train': False, 'do_eval': True, 'do_predict': False, 'eval_strategy': 'epoch', 'prediction_loss_only': False, 'per_device_train_batch_size': 32, 'per_device_eval_batch_size': 32, 'per_gpu_train_batch_size': None, 'per_gpu_eval_batch_size': None, 'gradient_accumulation_steps': 1, 'eval_accumulation_steps': None, 'eval_delay': 0, 'torch_empty_cache_steps': None, 'learning_rate': 1e-05, 'weight_decay': 0.02, 'adam_beta1': 0.9, 'adam_beta2': 0.999, 'adam_epsilon': 1e-08, 'max_grad_norm': 1.0, 'num_train_epochs': 18, 'max_steps': -1, 'lr_scheduler_type': 'linear', 'lr_scheduler_kwargs': {}, 'warmup_ratio': 0.0, 'warmup_steps': 200, 'log_level': 'passive', 'log_level_replica': 'warning', 'log_on_each_node': True, 'logging_dir': './runs/Dec11_04-30-35_fd55a770be24', 'logging_strategy': 'steps', 'logging_first_step': False, 'logging_steps': 10, 'logging_nan_inf_filter': True, 'save_strategy': 'epoch', 'save_steps': 204, 'save_total_limit': 1, 'save_safetensors': True, 'save_on_each_node': False, 'save_only_model': False, 'restore_callback_states_from_checkpoint': False, 'no_cuda': False, 'use_cpu': False, 'use_mps_device': False, 'seed': 5151, 'data_seed': None, 'jit_mode_eval': False, 'use_ipex': False, 'bf16': False, 'fp16': False, 'fp16_opt_level': 'O1', 'half_precision_backend': 'auto', 'bf16_full_eval': False, 'fp16_full_eval': False, 'tf32': None, 'local_rank': 0, 'ddp_backend': None, 'tpu_num_cores': None, 'tpu_metrics_debug': False, 'debug': [], 'dataloader_drop_last': False, 'eval_steps': 102, 'dataloader_num_workers': 0, 'dataloader_prefetch_factor': None, 'past_index': -1, 'run_name': 'm2-congretimbau_tunado', 'disable_tqdm': False, 'remove_unused_columns': True, 'label_names': None, 'load_best_model_at_end': True, 'metric_for_best_model': 'f1', 'greater_is_better': True, 'ignore_data_skip': False, 'fsdp': [], 'fsdp_min_num_params': 0, 'fsdp_config': {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}, 'fsdp_transformer_layer_cls_to_wrap': None, 'accelerator_config': {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}, 'deepspeed': None, 'label_smoothing_factor': 0.0, 'optim': 'adamw_torch', 'optim_args': None, 'adafactor': False, 'group_by_length': False, 'length_column_name': 'length', 'report_to': ['wandb'], 'ddp_find_unused_parameters': None, 'ddp_bucket_cap_mb': None, 'ddp_broadcast_buffers': None, 'dataloader_pin_memory': True, 'dataloader_persistent_workers': False, 'skip_memory_metrics': True, 'use_legacy_prediction_loop': False, 'push_to_hub': False, 'resume_from_checkpoint': None, 'hub_model_id': None, 'hub_strategy': 'every_save', 'hub_token': '<HUB_TOKEN>', 'hub_private_repo': None, 'hub_always_push': False, 'gradient_checkpointing': False, 'gradient_checkpointing_kwargs': None, 'include_inputs_for_metrics': False, 'include_for_metrics': [], 'eval_do_concat_batches': True, 'fp16_backend': 'auto', 'evaluation_strategy': None, 'push_to_hub_model_id': None, 'push_to_hub_organization': None, 'push_to_hub_token': '<PUSH_TO_HUB_TOKEN>', 'mp_parameters': '', 'auto_find_batch_size': False, 'full_determinism': False, 'torchdynamo': None, 'ray_scope': 'last', 'ddp_timeout': 1800, 'torch_compile': False, 'torch_compile_backend': None, 'torch_compile_mode': None, 'dispatch_batches': None, 'split_batches': None, 'include_tokens_per_second': False, 'include_num_input_tokens_seen': False, 'neftune_noise_alpha': None, 'optim_target_modules': None, 'batch_eval_metrics': False, 'eval_on_start': False, 'use_liger_kernel': False, 'eval_use_gather_object': False, 'average_tokens_across_devices': False}
38
+ 2024-12-11 04:30:36,011 INFO MainThread:2341 [wandb_config.py:__setitem__():154] config set model/num_parameters = 334398466 - <bound method Run._config_callback of <wandb.sdk.wandb_run.Run object at 0x7de6143df1c0>>
39
+ 2024-12-11 04:30:36,011 INFO MainThread:2341 [wandb_run.py:_config_callback():1375] config_cb model/num_parameters 334398466 None
40
+ 2024-12-11 04:37:00,589 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
41
+ 2024-12-11 04:37:00,589 INFO MainThread:2341 [wandb_init.py:_pause_backend():439] pausing backend
42
+ 2024-12-11 04:37:00,594 INFO MainThread:2341 [wandb_init.py:_resume_backend():444] resuming backend
43
+ 2024-12-11 04:37:02,005 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
44
+ 2024-12-11 04:37:02,005 INFO MainThread:2341 [wandb_init.py:_pause_backend():439] pausing backend
45
+ 2024-12-11 04:37:02,010 INFO MainThread:2341 [wandb_init.py:_resume_backend():444] resuming backend
46
+ 2024-12-11 04:37:02,010 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
47
+ 2024-12-11 04:37:02,011 INFO MainThread:2341 [wandb_init.py:_pause_backend():439] pausing backend
48
+ 2024-12-11 04:37:02,017 INFO MainThread:2341 [wandb_init.py:_resume_backend():444] resuming backend
49
+ 2024-12-11 04:37:02,171 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
50
+ 2024-12-11 04:37:02,171 INFO MainThread:2341 [wandb_init.py:_pause_backend():439] pausing backend
51
+ 2024-12-11 04:37:02,177 INFO MainThread:2341 [wandb_init.py:_resume_backend():444] resuming backend
52
+ 2024-12-11 04:37:02,180 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
53
+ 2024-12-11 04:37:02,180 INFO MainThread:2341 [wandb_init.py:_pause_backend():439] pausing backend
54
+ 2024-12-11 04:37:02,184 INFO MainThread:2341 [wandb_init.py:_resume_backend():444] resuming backend
55
+ 2024-12-11 04:37:05,364 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
56
+ 2024-12-11 04:37:05,364 INFO MainThread:2341 [wandb_init.py:_pause_backend():439] pausing backend
57
+ 2024-12-11 04:37:05,372 INFO MainThread:2341 [wandb_init.py:_resume_backend():444] resuming backend
58
+ 2024-12-11 04:37:05,373 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
59
+ 2024-12-11 04:37:05,373 INFO MainThread:2341 [wandb_init.py:_pause_backend():439] pausing backend
60
+ 2024-12-11 04:37:05,379 INFO MainThread:2341 [wandb_init.py:_resume_backend():444] resuming backend
61
+ 2024-12-11 04:37:05,392 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
62
+ 2024-12-11 04:37:05,392 INFO MainThread:2341 [wandb_init.py:_pause_backend():439] pausing backend
63
+ 2024-12-11 04:37:05,396 INFO MainThread:2341 [wandb_init.py:_resume_backend():444] resuming backend
64
+ 2024-12-11 04:37:05,397 INFO MainThread:2341 [wandb_run.py:_finish():2169] finishing run adrianobf/AzMina-M2/pfhjmc9v
65
+ 2024-12-11 04:37:05,397 INFO MainThread:2341 [jupyter.py:save_history():459] not saving jupyter history
66
+ 2024-12-11 04:37:05,397 INFO MainThread:2341 [jupyter.py:save_ipynb():387] not saving jupyter notebook
67
+ 2024-12-11 04:37:05,397 INFO MainThread:2341 [wandb_init.py:_jupyter_teardown():456] cleaning up jupyter logic
68
+ 2024-12-11 04:37:05,397 INFO MainThread:2341 [wandb_run.py:_atexit_cleanup():2408] got exitcode: 0
69
+ 2024-12-11 04:37:05,398 INFO MainThread:2341 [wandb_run.py:_restore():2390] restore
70
+ 2024-12-11 04:37:05,398 INFO MainThread:2341 [wandb_run.py:_restore():2396] restore done
71
+ 2024-12-11 04:37:06,676 INFO MainThread:2341 [wandb_run.py:_footer_history_summary_info():3951] rendering history
72
+ 2024-12-11 04:37:06,677 INFO MainThread:2341 [wandb_run.py:_footer_history_summary_info():3983] rendering summary
73
+ 2024-12-11 04:37:06,685 INFO MainThread:2341 [wandb_run.py:_footer_sync_info():3912] logging synced files
wandb/run-20241211_043031-pfhjmc9v/run-pfhjmc9v.wandb ADDED
Binary file (106 kB). View file