Model save

Browse files

Files changed (5) hide show

README.md +80 -0
intent_report_test.txt +75 -0
model.safetensors +1 -1
model_predict_test.csv +0 -0
slot_report_test.txt +60 -0

README.md ADDED Viewed

	@@ -0,0 +1,80 @@

+---
+library_name: transformers
+license: apache-2.0
+base_model: uitnlp/CafeBERT
+tags:
+- generated_from_trainer
+model-index:
+- name: CafeBERT_massive_crf_v2
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# CafeBERT_massive_crf_v2
+This model is a fine-tuned version of [uitnlp/CafeBERT](https://huggingface.co/uitnlp/CafeBERT) on an unknown dataset.
+It achieves the following results on the evaluation set:
+- Loss: 5.5963
+- Slot P: 0.0077
+- Slot R: 0.0082
+- Slot F1: 0.0079
+- Slot Exact Match: 0.3246
+- Intent Acc: 0.8751
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 5e-05
+- train_batch_size: 128
+- eval_batch_size: 128
+- seed: 42
+- gradient_accumulation_steps: 2
+- total_train_batch_size: 256
+- optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
+- lr_scheduler_type: cosine
+- lr_scheduler_warmup_ratio: 0.06
+- num_epochs: 30
+- mixed_precision_training: Native AMP
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Slot P | Slot R | Slot F1 | Slot Exact Match | Intent Acc |
+|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:-------:|:----------------:|:----------:|
+| No log        | 1.0   | 45   | 15.8863         | 0.0    | 0.0    | 0.0     | 0.4088           | 0.0733     |
+| 69.6998       | 2.0   | 90   | 7.8039          | 0.0080 | 0.0076 | 0.0078  | 0.3438           | 0.5032     |
+| 22.3589       | 3.0   | 135  | 4.5345          | 0.0091 | 0.0100 | 0.0095  | 0.3227           | 0.7846     |
+| 10.4218       | 4.0   | 180  | 4.0667          | 0.0110 | 0.0111 | 0.0111  | 0.3384           | 0.8406     |
+| 6.8199        | 5.0   | 225  | 3.8871          | 0.0092 | 0.0100 | 0.0096  | 0.3261           | 0.8623     |
+| 5.4068        | 6.0   | 270  | 3.9234          | 0.0106 | 0.0117 | 0.0111  | 0.3212           | 0.8633     |
+| 4.2552        | 7.0   | 315  | 4.0332          | 0.0115 | 0.0129 | 0.0122  | 0.3168           | 0.8657     |
+| 3.5197        | 8.0   | 360  | 4.2753          | 0.0080 | 0.0088 | 0.0084  | 0.3222           | 0.8647     |
+| 2.8374        | 9.0   | 405  | 4.6031          | 0.0099 | 0.0106 | 0.0102  | 0.3256           | 0.8701     |
+| 2.2784        | 10.0  | 450  | 4.7992          | 0.0118 | 0.0129 | 0.0123  | 0.3237           | 0.8652     |
+| 2.2784        | 11.0  | 495  | 5.0575          | 0.0118 | 0.0129 | 0.0123  | 0.3222           | 0.8652     |
+| 1.8204        | 12.0  | 540  | 5.1371          | 0.0088 | 0.0094 | 0.0091  | 0.3266           | 0.8731     |
+| 1.5073        | 13.0  | 585  | 5.4768          | 0.0109 | 0.0123 | 0.0116  | 0.3133           | 0.8677     |
+| 1.275         | 14.0  | 630  | 5.5963          | 0.0077 | 0.0082 | 0.0079  | 0.3246           | 0.8751     |
+### Framework versions
+- Transformers 4.55.0
+- Pytorch 2.7.0+cu126
+- Datasets 3.6.0
+- Tokenizers 0.21.4

intent_report_test.txt ADDED Viewed

	@@ -0,0 +1,75 @@

+              precision    recall  f1-score   support
+           0       0.90      0.97      0.93        88
+           1       0.85      0.94      0.89        36
+           2       1.00      0.94      0.97        35
+           3       0.91      0.86      0.88        35
+           4       0.77      0.92      0.84        26
+           5       0.00      0.00      0.00         1
+           6       0.77      0.79      0.78        43
+           7       1.00      0.50      0.67         4
+           8       1.00      0.83      0.91        18
+           9       0.96      0.92      0.94        72
+          10       0.97      0.97      0.97        39
+          11       0.83      1.00      0.91        15
+          12       0.71      0.55      0.62       169
+          13       0.95      0.96      0.95       156
+          14       0.79      0.85      0.81        13
+          15       0.71      0.83      0.77        12
+          16       0.83      0.86      0.84        22
+          17       0.65      0.85      0.73        26
+          18       0.89      0.89      0.89        27
+          19       0.78      1.00      0.87        31
+          20       0.88      0.88      0.88        41
+          21       0.86      0.92      0.89        39
+          22       0.80      0.86      0.83       124
+          23       1.00      0.88      0.94        34
+          24       1.00      0.90      0.95        10
+          25       0.95      1.00      0.97        19
+          26       0.94      0.86      0.90        57
+          27       0.87      0.80      0.83        25
+          28       0.33      0.33      0.33         6
+          29       1.00      0.50      0.67         6
+          30       0.91      0.94      0.93        67
+          31       0.89      0.76      0.82        21
+          32       0.73      0.82      0.77       126
+          33       0.95      0.93      0.94       114
+          34       0.96      0.88      0.92        26
+          35       0.91      0.91      0.91        11
+          36       0.78      0.96      0.86        72
+          37       0.00      0.00      0.00         0
+          38       0.79      0.73      0.76        15
+          39       0.88      0.92      0.90        25
+          40       0.95      0.98      0.97        43
+          41       0.67      0.67      0.67         3
+          42       0.84      0.90      0.87        51
+          43       0.84      0.89      0.86        36
+          44       0.96      0.93      0.94       119
+          45       0.91      0.90      0.91       176
+          46       0.88      0.94      0.91        32
+          47       0.99      0.91      0.95        81
+          48       0.95      0.98      0.96        41
+          49       0.77      0.81      0.79       141
+          50       0.95      0.91      0.93       209
+          51       0.94      0.94      0.94        35
+          52       0.95      1.00      0.98        21
+          53       0.92      0.92      0.92        52
+          54       0.92      1.00      0.96        23
+          55       0.80      0.80      0.80        20
+          56       1.00      0.97      0.99        36
+          57       0.89      0.89      0.89        35
+          58       0.92      0.73      0.81        63
+          59       0.91      0.78      0.84        51
+    accuracy                           0.88      2974
+   macro avg       0.84      0.83      0.83      2974
+weighted avg       0.88      0.88      0.87      2974
+Confusion matrix:
+[[85  0  0 ...  0  0  0]
+ [ 0 34  0 ...  0  0  0]
+ [ 0  0 33 ...  0  0  0]
+ ...
+ [ 0  0  0 ... 31  0  0]
+ [ 0  0  0 ...  0 46  0]
+ [ 0  0  0 ...  0  1 40]]

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0162639ab844dd9e7b92b6a36d6d09373d72f898d2b4fa651359f7ea4b0e3550
 size 2240362200

 version https://git-lfs.github.com/spec/v1
+oid sha256:d0a4df39d307944d6a8f628272d6dc58413c6723db25dac6df6aac1f686020d0
 size 2240362200

model_predict_test.csv ADDED Viewed

The diff for this file is too large to render. See raw diff

slot_report_test.txt ADDED Viewed

	@@ -0,0 +1,60 @@

+                      precision    recall  f1-score   support
+          alarm_type       0.00      0.00      0.00         2
+            app_name       0.00      0.00      0.00         5
+         artist_name       0.02      0.02      0.02        52
+    audiobook_author       0.00      0.00      0.00         5
+      audiobook_name       0.00      0.00      0.00        22
+       business_name       0.00      0.00      0.00        89
+       business_type       0.00      0.00      0.00        27
+       change_amount       0.00      0.00      0.00         7
+         coffee_type       0.00      0.00      0.00         3
+          color_type       0.00      0.00      0.00         9
+        cooking_type       0.00      0.00      0.00         8
+       currency_name       0.00      0.00      0.00        50
+                date       0.01      0.01      0.01       365
+     definition_word       0.00      0.00      0.00        51
+         device_type       0.00      0.00      0.00        28
+          drink_type       0.00      0.00      0.00         1
+       email_address       0.00      0.00      0.00         9
+        email_folder       0.00      0.00      0.00         5
+          event_name       0.00      0.00      0.00       237
+           food_type       0.03      0.03      0.03        63
+           game_name       0.00      0.00      0.00        23
+   general_frequency       0.05      0.06      0.05        17
+         house_place       0.00      0.00      0.00        14
+          ingredient       0.00      0.00      0.00         5
+           joke_type       0.00      0.00      0.00        11
+           list_name       0.00      0.00      0.00        61
+           meal_type       0.00      0.00      0.00        13
+          media_type       0.00      0.00      0.00       119
+          movie_name       0.00      0.00      0.00         2
+          movie_type       0.00      0.00      0.00         3
+         music_album       0.00      0.00      0.00         1
+    music_descriptor       0.00      0.00      0.00         4
+         music_genre       0.03      0.02      0.02        44
+          news_topic       0.00      0.00      0.00        43
+          order_type       0.00      0.00      0.00         9
+              person       0.01      0.01      0.01       212
+       personal_info       0.00      0.00      0.00        13
+          place_name       0.00      0.00      0.00       257
+      player_setting       0.00      0.00      0.00        35
+       playlist_name       0.00      0.00      0.00        12
+  podcast_descriptor       0.00      0.00      0.00        21
+        podcast_name       0.00      0.00      0.00        16
+          radio_name       0.05      0.07      0.06        29
+            relation       0.00      0.00      0.00        53
+           song_name       0.00      0.00      0.00        34
+          sport_type       0.00      0.00      0.00         0
+                time       0.02      0.02      0.02       161
+           time_zone       0.00      0.00      0.00        13
+           timeofday       0.00      0.00      0.00        48
+    transport_agency       0.00      0.00      0.00         9
+transport_descriptor       0.00      0.00      0.00         2
+      transport_name       0.00      0.00      0.00         4
+      transport_type       0.00      0.00      0.00        63
+  weather_descriptor       0.02      0.02      0.02        48
+           micro avg       0.01      0.01      0.01      2437
+           macro avg       0.00      0.00      0.00      2437
+        weighted avg       0.01      0.01      0.01      2437