l3utterfly
/

open-llama-3b-v2-layla

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

open-llama-3b-v2-layla / README.md

leaderboard-pr-bot's picture

leaderboard-pr-bot

Adding Evaluation Results

f9aa4f1 over 1 year ago

|

692 Bytes

metadata

license: apache-2.0

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric	Value
Avg.	35.63
ARC (25-shot)	38.23
HellaSwag (10-shot)	66.43
MMLU (5-shot)	28.56
TruthfulQA (0-shot)	44.4
Winogrande (5-shot)	62.83
GSM8K (5-shot)	1.06
DROP (3-shot)	7.88