ArabicNewSplits7_usingWellWrittenEssays_FineTuningAraBERT_run2_AugV5_k4_task1_organization

This model is a fine-tuned version of aubmindlab/bert-base-arabertv02 on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 1.3775
  • Qwk: 0.4651
  • Mse: 1.3775
  • Rmse: 1.1737

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 100

Training results

Training Loss Epoch Step Validation Loss Qwk Mse Rmse
No log 0.1111 2 7.2302 -0.0162 7.2302 2.6889
No log 0.2222 4 4.4115 0.0667 4.4115 2.1003
No log 0.3333 6 3.1448 0.1053 3.1448 1.7733
No log 0.4444 8 2.6809 0.1216 2.6809 1.6373
No log 0.5556 10 1.9468 0.1429 1.9468 1.3953
No log 0.6667 12 1.8467 0.2261 1.8467 1.3589
No log 0.7778 14 2.6127 0.0667 2.6127 1.6164
No log 0.8889 16 3.1084 0.0714 3.1084 1.7631
No log 1.0 18 2.6965 0.0 2.6965 1.6421
No log 1.1111 20 2.1406 0.0484 2.1406 1.4631
No log 1.2222 22 1.9503 0.1284 1.9503 1.3965
No log 1.3333 24 1.8638 0.1284 1.8638 1.3652
No log 1.4444 26 1.7989 0.1754 1.7989 1.3412
No log 1.5556 28 1.6646 0.2353 1.6646 1.2902
No log 1.6667 30 1.4500 0.2364 1.4500 1.2042
No log 1.7778 32 1.4416 0.2617 1.4416 1.2007
No log 1.8889 34 1.3675 0.3423 1.3675 1.1694
No log 2.0 36 1.3329 0.3604 1.3329 1.1545
No log 2.1111 38 1.2758 0.4370 1.2758 1.1295
No log 2.2222 40 1.4265 0.4426 1.4265 1.1944
No log 2.3333 42 1.6307 0.3040 1.6307 1.2770
No log 2.4444 44 1.5520 0.3651 1.5520 1.2458
No log 2.5556 46 1.3728 0.4426 1.3728 1.1717
No log 2.6667 48 1.3897 0.4355 1.3897 1.1788
No log 2.7778 50 1.3147 0.4444 1.3147 1.1466
No log 2.8889 52 1.2343 0.4715 1.2343 1.1110
No log 3.0 54 1.2403 0.4426 1.2403 1.1137
No log 3.1111 56 1.3924 0.3448 1.3924 1.1800
No log 3.2222 58 1.5343 0.2759 1.5343 1.2387
No log 3.3333 60 1.3523 0.4167 1.3523 1.1629
No log 3.4444 62 1.2288 0.4921 1.2288 1.1085
No log 3.5556 64 1.2093 0.496 1.2093 1.0997
No log 3.6667 66 1.2407 0.4882 1.2407 1.1139
No log 3.7778 68 1.3146 0.4127 1.3146 1.1466
No log 3.8889 70 1.3242 0.4444 1.3242 1.1507
No log 4.0 72 1.3302 0.4444 1.3302 1.1534
No log 4.1111 74 1.2998 0.5303 1.2998 1.1401
No log 4.2222 76 1.3509 0.4848 1.3509 1.1623
No log 4.3333 78 1.4385 0.4776 1.4385 1.1994
No log 4.4444 80 1.5528 0.3857 1.5528 1.2461
No log 4.5556 82 1.6398 0.2920 1.6398 1.2805
No log 4.6667 84 1.7369 0.2647 1.7369 1.3179
No log 4.7778 86 1.5624 0.3433 1.5624 1.2500
No log 4.8889 88 1.2484 0.5 1.2484 1.1173
No log 5.0 90 1.2137 0.4463 1.2137 1.1017
No log 5.1111 92 1.2098 0.4202 1.2098 1.0999
No log 5.2222 94 1.3076 0.4677 1.3076 1.1435
No log 5.3333 96 1.3324 0.4355 1.3324 1.1543
No log 5.4444 98 1.3185 0.4274 1.3185 1.1482
No log 5.5556 100 1.2887 0.3423 1.2887 1.1352
No log 5.6667 102 1.2516 0.3826 1.2516 1.1188
No log 5.7778 104 1.1652 0.5714 1.1652 1.0794
No log 5.8889 106 1.2631 0.5816 1.2631 1.1239
No log 6.0 108 1.2507 0.5441 1.2507 1.1183
No log 6.1111 110 1.3513 0.4539 1.3513 1.1625
No log 6.2222 112 1.4058 0.4113 1.4058 1.1857
No log 6.3333 114 1.4401 0.3788 1.4401 1.2000
No log 6.4444 116 1.3406 0.4094 1.3406 1.1578
No log 6.5556 118 1.4056 0.3622 1.4056 1.1856
No log 6.6667 120 1.4455 0.3622 1.4455 1.2023
No log 6.7778 122 1.4329 0.4031 1.4329 1.1970
No log 6.8889 124 1.5019 0.4444 1.5019 1.2255
No log 7.0 126 1.5050 0.3731 1.5050 1.2268
No log 7.1111 128 1.2916 0.4480 1.2916 1.1365
No log 7.2222 130 1.1935 0.5238 1.1935 1.0925
No log 7.3333 132 1.1971 0.4918 1.1971 1.0941
No log 7.4444 134 1.2331 0.5333 1.2331 1.1105
No log 7.5556 136 1.4692 0.3101 1.4692 1.2121
No log 7.6667 138 1.5731 0.2923 1.5731 1.2542
No log 7.7778 140 1.4763 0.3871 1.4763 1.2151
No log 7.8889 142 1.3090 0.4068 1.3090 1.1441
No log 8.0 144 1.1858 0.5333 1.1858 1.0889
No log 8.1111 146 1.1625 0.5289 1.1625 1.0782
No log 8.2222 148 1.2691 0.5038 1.2691 1.1265
No log 8.3333 150 1.4789 0.3824 1.4789 1.2161
No log 8.4444 152 1.8595 0.2571 1.8595 1.3636
No log 8.5556 154 1.8596 0.2571 1.8596 1.3637
No log 8.6667 156 1.5688 0.3235 1.5688 1.2525
No log 8.7778 158 1.3606 0.4409 1.3606 1.1665
No log 8.8889 160 1.3174 0.5082 1.3174 1.1478
No log 9.0 162 1.3682 0.4833 1.3682 1.1697
No log 9.1111 164 1.5108 0.4032 1.5108 1.2291
No log 9.2222 166 1.7706 0.2362 1.7706 1.3306
No log 9.3333 168 1.8467 0.1642 1.8467 1.3589
No log 9.4444 170 1.6200 0.3134 1.6200 1.2728
No log 9.5556 172 1.2654 0.4677 1.2654 1.1249
No log 9.6667 174 1.1442 0.5289 1.1442 1.0697
No log 9.7778 176 1.1996 0.4839 1.1996 1.0952
No log 9.8889 178 1.4476 0.3768 1.4476 1.2032
No log 10.0 180 1.7188 0.2857 1.7188 1.3110
No log 10.1111 182 1.6671 0.3165 1.6671 1.2912
No log 10.2222 184 1.4401 0.4 1.4401 1.2000
No log 10.3333 186 1.2900 0.4407 1.2900 1.1358
No log 10.4444 188 1.2735 0.4348 1.2735 1.1285
No log 10.5556 190 1.2690 0.4274 1.2690 1.1265
No log 10.6667 192 1.3048 0.4590 1.3048 1.1423
No log 10.7778 194 1.4541 0.4030 1.4541 1.2059
No log 10.8889 196 1.6732 0.3022 1.6732 1.2935
No log 11.0 198 1.7449 0.3000 1.7449 1.3210
No log 11.1111 200 1.5780 0.3333 1.5780 1.2562
No log 11.2222 202 1.2762 0.4427 1.2762 1.1297
No log 11.3333 204 1.1859 0.5246 1.1859 1.0890
No log 11.4444 206 1.2334 0.5246 1.2334 1.1106
No log 11.5556 208 1.3362 0.5079 1.3362 1.1559
No log 11.6667 210 1.3765 0.4480 1.3765 1.1732
No log 11.7778 212 1.4023 0.5116 1.4023 1.1842
No log 11.8889 214 1.4883 0.3704 1.4883 1.2199
No log 12.0 216 1.6310 0.3284 1.6310 1.2771
No log 12.1111 218 1.6209 0.3158 1.6209 1.2731
No log 12.2222 220 1.3665 0.3939 1.3665 1.1690
No log 12.3333 222 1.1409 0.6 1.1409 1.0681
No log 12.4444 224 1.1304 0.56 1.1304 1.0632
No log 12.5556 226 1.1946 0.5 1.1946 1.0930
No log 12.6667 228 1.3441 0.4531 1.3441 1.1593
No log 12.7778 230 1.3941 0.4341 1.3941 1.1807
No log 12.8889 232 1.3476 0.5041 1.3476 1.1608
No log 13.0 234 1.2825 0.4615 1.2825 1.1325
No log 13.1111 236 1.2276 0.4483 1.2276 1.1080
No log 13.2222 238 1.1684 0.5042 1.1684 1.0809
No log 13.3333 240 1.2128 0.544 1.2128 1.1013
No log 13.4444 242 1.3093 0.4923 1.3093 1.1443
No log 13.5556 244 1.4582 0.4088 1.4582 1.2076
No log 13.6667 246 1.4419 0.4088 1.4419 1.2008
No log 13.7778 248 1.3565 0.4662 1.3565 1.1647
No log 13.8889 250 1.1208 0.6142 1.1208 1.0587
No log 14.0 252 1.0436 0.6349 1.0436 1.0216
No log 14.1111 254 1.0638 0.5806 1.0638 1.0314
No log 14.2222 256 1.1157 0.5528 1.1157 1.0563
No log 14.3333 258 1.1479 0.528 1.1479 1.0714
No log 14.4444 260 1.0918 0.5806 1.0918 1.0449
No log 14.5556 262 1.0783 0.5806 1.0783 1.0384
No log 14.6667 264 1.0979 0.5873 1.0979 1.0478
No log 14.7778 266 1.2126 0.5496 1.2126 1.1012
No log 14.8889 268 1.4109 0.4412 1.4109 1.1878
No log 15.0 270 1.5261 0.3609 1.5261 1.2354
No log 15.1111 272 1.4898 0.4160 1.4898 1.2206
No log 15.2222 274 1.3226 0.4918 1.3226 1.1500
No log 15.3333 276 1.1620 0.5410 1.1620 1.0780
No log 15.4444 278 1.1353 0.5691 1.1353 1.0655
No log 15.5556 280 1.1841 0.5556 1.1841 1.0881
No log 15.6667 282 1.2256 0.5 1.2256 1.1070
No log 15.7778 284 1.1625 0.544 1.1625 1.0782
No log 15.8889 286 1.1070 0.5920 1.1070 1.0522
No log 16.0 288 1.0856 0.5806 1.0856 1.0419
No log 16.1111 290 1.1143 0.6299 1.1143 1.0556
No log 16.2222 292 1.2301 0.5669 1.2301 1.1091
No log 16.3333 294 1.3891 0.4211 1.3891 1.1786
No log 16.4444 296 1.4909 0.3582 1.4909 1.2210
No log 16.5556 298 1.4894 0.4122 1.4894 1.2204
No log 16.6667 300 1.4245 0.4839 1.4245 1.1935
No log 16.7778 302 1.3503 0.4667 1.3503 1.1620
No log 16.8889 304 1.3226 0.5082 1.3226 1.1501
No log 17.0 306 1.3171 0.496 1.3171 1.1477
No log 17.1111 308 1.3992 0.4688 1.3992 1.1829
No log 17.2222 310 1.3530 0.4341 1.3530 1.1632
No log 17.3333 312 1.4121 0.4091 1.4121 1.1883
No log 17.4444 314 1.5174 0.3731 1.5174 1.2318
No log 17.5556 316 1.5420 0.3731 1.5420 1.2418
No log 17.6667 318 1.3997 0.4122 1.3997 1.1831
No log 17.7778 320 1.2696 0.4603 1.2696 1.1268
No log 17.8889 322 1.1606 0.5323 1.1606 1.0773
No log 18.0 324 1.1657 0.528 1.1657 1.0797
No log 18.1111 326 1.2725 0.4567 1.2725 1.1280
No log 18.2222 328 1.3827 0.4091 1.3827 1.1759
No log 18.3333 330 1.4296 0.4 1.4296 1.1957
No log 18.4444 332 1.4292 0.4 1.4292 1.1955
No log 18.5556 334 1.3584 0.4697 1.3584 1.1655
No log 18.6667 336 1.2253 0.5079 1.2253 1.1069
No log 18.7778 338 1.2030 0.5323 1.2030 1.0968
No log 18.8889 340 1.2899 0.4688 1.2899 1.1357
No log 19.0 342 1.3934 0.4615 1.3934 1.1804
No log 19.1111 344 1.4310 0.4242 1.4310 1.1962
No log 19.2222 346 1.3474 0.4651 1.3474 1.1608
No log 19.3333 348 1.2123 0.5197 1.2123 1.1011
No log 19.4444 350 1.0835 0.6190 1.0835 1.0409
No log 19.5556 352 1.0483 0.6299 1.0483 1.0239
No log 19.6667 354 1.0919 0.6190 1.0919 1.0450
No log 19.7778 356 1.1786 0.5625 1.1786 1.0856
No log 19.8889 358 1.3220 0.5191 1.3220 1.1498
No log 20.0 360 1.5076 0.3504 1.5076 1.2278
No log 20.1111 362 1.5391 0.3504 1.5391 1.2406
No log 20.2222 364 1.3497 0.4394 1.3497 1.1618
No log 20.3333 366 1.2291 0.5079 1.2291 1.1086
No log 20.4444 368 1.2576 0.5079 1.2576 1.1214
No log 20.5556 370 1.2908 0.5079 1.2908 1.1361
No log 20.6667 372 1.3176 0.4615 1.3176 1.1478
No log 20.7778 374 1.3333 0.4361 1.3333 1.1547
No log 20.8889 376 1.4097 0.4361 1.4097 1.1873
No log 21.0 378 1.4595 0.4 1.4595 1.2081
No log 21.1111 380 1.4463 0.4511 1.4463 1.2026
No log 21.2222 382 1.4038 0.4961 1.4038 1.1848
No log 21.3333 384 1.3199 0.5197 1.3199 1.1489
No log 21.4444 386 1.3220 0.5197 1.3220 1.1498
No log 21.5556 388 1.3725 0.5197 1.3725 1.1716
No log 21.6667 390 1.4606 0.4545 1.4606 1.2086
No log 21.7778 392 1.5000 0.4328 1.5000 1.2247
No log 21.8889 394 1.4173 0.4394 1.4173 1.1905
No log 22.0 396 1.3477 0.4394 1.3477 1.1609
No log 22.1111 398 1.3326 0.4962 1.3326 1.1544
No log 22.2222 400 1.2398 0.5469 1.2398 1.1135
No log 22.3333 402 1.1913 0.5736 1.1913 1.0915
No log 22.4444 404 1.1450 0.5846 1.1450 1.0701
No log 22.5556 406 1.1891 0.5469 1.1891 1.0904
No log 22.6667 408 1.1605 0.5469 1.1605 1.0773
No log 22.7778 410 1.0954 0.5714 1.0954 1.0466
No log 22.8889 412 1.1177 0.544 1.1177 1.0572
No log 23.0 414 1.1727 0.512 1.1727 1.0829
No log 23.1111 416 1.2065 0.4839 1.2065 1.0984
No log 23.2222 418 1.2118 0.48 1.2118 1.1008
No log 23.3333 420 1.2227 0.5079 1.2227 1.1057
No log 23.4444 422 1.2427 0.5197 1.2427 1.1148
No log 23.5556 424 1.2537 0.5156 1.2537 1.1197
No log 23.6667 426 1.2628 0.5156 1.2628 1.1237
No log 23.7778 428 1.2376 0.5156 1.2376 1.1125
No log 23.8889 430 1.1614 0.5354 1.1614 1.0777
No log 24.0 432 1.1310 0.5354 1.1310 1.0635
No log 24.1111 434 1.1382 0.5354 1.1382 1.0668
No log 24.2222 436 1.2160 0.5385 1.2160 1.1027
No log 24.3333 438 1.3783 0.3881 1.3783 1.1740
No log 24.4444 440 1.5406 0.3504 1.5406 1.2412
No log 24.5556 442 1.5181 0.3235 1.5181 1.2321
No log 24.6667 444 1.4056 0.4923 1.4056 1.1856
No log 24.7778 446 1.2887 0.5197 1.2887 1.1352
No log 24.8889 448 1.2434 0.5366 1.2434 1.1151
No log 25.0 450 1.2660 0.5366 1.2660 1.1252
No log 25.1111 452 1.2796 0.5161 1.2796 1.1312
No log 25.2222 454 1.2685 0.512 1.2685 1.1263
No log 25.3333 456 1.2138 0.5161 1.2138 1.1017
No log 25.4444 458 1.1906 0.512 1.1906 1.0911
No log 25.5556 460 1.1783 0.5312 1.1783 1.0855
No log 25.6667 462 1.1486 0.5312 1.1486 1.0717
No log 25.7778 464 1.1852 0.5426 1.1852 1.0887
No log 25.8889 466 1.2981 0.4923 1.2981 1.1393
No log 26.0 468 1.4278 0.3852 1.4278 1.1949
No log 26.1111 470 1.4644 0.3704 1.4644 1.2101
No log 26.2222 472 1.3641 0.5116 1.3641 1.1679
No log 26.3333 474 1.2515 0.4762 1.2515 1.1187
No log 26.4444 476 1.2331 0.4839 1.2331 1.1104
No log 26.5556 478 1.2538 0.4839 1.2538 1.1197
No log 26.6667 480 1.2491 0.4839 1.2491 1.1176
No log 26.7778 482 1.2210 0.4878 1.2210 1.1050
No log 26.8889 484 1.2202 0.4878 1.2202 1.1046
No log 27.0 486 1.2710 0.5079 1.2710 1.1274
No log 27.1111 488 1.3325 0.5116 1.3325 1.1543
No log 27.2222 490 1.3226 0.5116 1.3226 1.1500
No log 27.3333 492 1.3569 0.4844 1.3569 1.1648
No log 27.4444 494 1.3738 0.4844 1.3738 1.1721
No log 27.5556 496 1.3503 0.496 1.3503 1.1620
No log 27.6667 498 1.3301 0.496 1.3301 1.1533
0.3281 27.7778 500 1.3659 0.496 1.3659 1.1687
0.3281 27.8889 502 1.3950 0.4921 1.3950 1.1811
0.3281 28.0 504 1.3784 0.496 1.3784 1.1741
0.3281 28.1111 506 1.3233 0.5512 1.3233 1.1503
0.3281 28.2222 508 1.2705 0.5556 1.2705 1.1272
0.3281 28.3333 510 1.2605 0.5669 1.2605 1.1227
0.3281 28.4444 512 1.2942 0.5512 1.2942 1.1376
0.3281 28.5556 514 1.4035 0.5116 1.4035 1.1847
0.3281 28.6667 516 1.5015 0.3759 1.5015 1.2253
0.3281 28.7778 518 1.5056 0.3759 1.5056 1.2270
0.3281 28.8889 520 1.4851 0.3969 1.4851 1.2186
0.3281 29.0 522 1.4167 0.4724 1.4167 1.1902
0.3281 29.1111 524 1.3418 0.512 1.3418 1.1583
0.3281 29.2222 526 1.2763 0.5161 1.2763 1.1297
0.3281 29.3333 528 1.2682 0.5161 1.2682 1.1261
0.3281 29.4444 530 1.2820 0.512 1.2820 1.1323
0.3281 29.5556 532 1.2954 0.512 1.2954 1.1381
0.3281 29.6667 534 1.2552 0.512 1.2552 1.1204
0.3281 29.7778 536 1.2767 0.4839 1.2767 1.1299
0.3281 29.8889 538 1.3347 0.4882 1.3347 1.1553
0.3281 30.0 540 1.4123 0.4651 1.4123 1.1884
0.3281 30.1111 542 1.4404 0.4651 1.4404 1.2002
0.3281 30.2222 544 1.3775 0.4651 1.3775 1.1737

Framework versions

  • Transformers 4.44.2
  • Pytorch 2.4.0+cu118
  • Datasets 2.21.0
  • Tokenizers 0.19.1
Downloads last month
5
Safetensors
Model size
135M params
Tensor type
F32
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Model tree for MayBashendy/ArabicNewSplits7_usingWellWrittenEssays_FineTuningAraBERT_run2_AugV5_k4_task1_organization

Finetuned
(4222)
this model