Novaciano commited on
Commit
834611b
·
verified ·
1 Parent(s): 74c4767

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +109 -0
README.md CHANGED
@@ -74,6 +74,101 @@ language:
74
  - es
75
  - en
76
  pipeline_tag: text-generation
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
77
  ---
78
 
79
  <center> <img src="https://i.ibb.co/hJk8j3Rz/IMG-20250306-WA0039.jpg" alt="IMG-20250306-WA0039" border="0"></a> </center>
@@ -82,6 +177,20 @@ pipeline_tag: text-generation
82
 
83
  <center> <img src="https://i.ibb.co/35vWL8Lc/IMG-20250311-011611.jpg" alt="IMG-20250311-011611" border="0"></a> </center>
84
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
85
  # Mezcla
86
 
87
  Esta mezcla de 20 modelos pre-entrenados fue creado usando [mergekit](https://github.com/cg123/mergekit).
 
74
  - es
75
  - en
76
  pipeline_tag: text-generation
77
+ model-index:
78
+ - name: HarmfulProject-3.2-1B
79
+ results:
80
+ - task:
81
+ type: text-generation
82
+ name: Text Generation
83
+ dataset:
84
+ name: IFEval (0-Shot)
85
+ type: HuggingFaceH4/ifeval
86
+ args:
87
+ num_few_shot: 0
88
+ metrics:
89
+ - type: inst_level_strict_acc and prompt_level_strict_acc
90
+ value: 39.22
91
+ name: strict accuracy
92
+ source:
93
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Novaciano/HarmfulProject-3.2-1B
94
+ name: Open LLM Leaderboard
95
+ - task:
96
+ type: text-generation
97
+ name: Text Generation
98
+ dataset:
99
+ name: BBH (3-Shot)
100
+ type: BBH
101
+ args:
102
+ num_few_shot: 3
103
+ metrics:
104
+ - type: acc_norm
105
+ value: 9.39
106
+ name: normalized accuracy
107
+ source:
108
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Novaciano/HarmfulProject-3.2-1B
109
+ name: Open LLM Leaderboard
110
+ - task:
111
+ type: text-generation
112
+ name: Text Generation
113
+ dataset:
114
+ name: MATH Lvl 5 (4-Shot)
115
+ type: hendrycks/competition_math
116
+ args:
117
+ num_few_shot: 7
118
+ metrics:
119
+ - type: exact_match
120
+ value: 7.48
121
+ name: exact match
122
+ source:
123
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Novaciano/HarmfulProject-3.2-1B
124
+ name: Open LLM Leaderboard
125
+ - task:
126
+ type: text-generation
127
+ name: Text Generation
128
+ dataset:
129
+ name: GPQA (0-shot)
130
+ type: Idavidrein/gpqa
131
+ args:
132
+ num_few_shot: 0
133
+ metrics:
134
+ - type: acc_norm
135
+ value: 2.13
136
+ name: acc_norm
137
+ source:
138
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Novaciano/HarmfulProject-3.2-1B
139
+ name: Open LLM Leaderboard
140
+ - task:
141
+ type: text-generation
142
+ name: Text Generation
143
+ dataset:
144
+ name: MuSR (0-shot)
145
+ type: TAUR-Lab/MuSR
146
+ args:
147
+ num_few_shot: 0
148
+ metrics:
149
+ - type: acc_norm
150
+ value: 3.06
151
+ name: acc_norm
152
+ source:
153
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Novaciano/HarmfulProject-3.2-1B
154
+ name: Open LLM Leaderboard
155
+ - task:
156
+ type: text-generation
157
+ name: Text Generation
158
+ dataset:
159
+ name: MMLU-PRO (5-shot)
160
+ type: TIGER-Lab/MMLU-Pro
161
+ config: main
162
+ split: test
163
+ args:
164
+ num_few_shot: 5
165
+ metrics:
166
+ - type: acc
167
+ value: 10.46
168
+ name: accuracy
169
+ source:
170
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Novaciano/HarmfulProject-3.2-1B
171
+ name: Open LLM Leaderboard
172
  ---
173
 
174
  <center> <img src="https://i.ibb.co/hJk8j3Rz/IMG-20250306-WA0039.jpg" alt="IMG-20250306-WA0039" border="0"></a> </center>
 
177
 
178
  <center> <img src="https://i.ibb.co/35vWL8Lc/IMG-20250311-011611.jpg" alt="IMG-20250311-011611" border="0"></a> </center>
179
 
180
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
181
+
182
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/Novaciano__BLAST_PROCESSING-3.2-1B-details)
183
+
184
+ | Metric |Value|
185
+ |-------------------|----:|
186
+ |Avg. |11.95|
187
+ |IFEval (0-Shot) |39.22|
188
+ |BBH (3-Shot) | 9.36|
189
+ |MATH Lvl 5 (4-Shot)| 7.48|
190
+ |GPQA (0-shot) | 2.13|
191
+ |MuSR (0-shot) | 3.06|
192
+ |MMLU-PRO (5-shot) | 10.46|
193
+
194
  # Mezcla
195
 
196
  Esta mezcla de 20 modelos pre-entrenados fue creado usando [mergekit](https://github.com/cg123/mergekit).