LSX-UniWue
/

LLaMmlein_120M

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Julia287 commited on Oct 1, 2024

Commit

c831f1f

·

verified ·

1 Parent(s): 8a1e684

README.md

Files changed (1) hide show

README.md +29 -0

README.md CHANGED Viewed

@@ -6,3 +6,32 @@ language:
 ---
 # German Tinyllama-120M

 ---
 # German Tinyllama-120M
+This is a German Tinyllama 120M language model trained from scratch using the
+the [Tinyllama](https://github.com/jzhang38/TinyLlama) codebase on the German portion of [RedPajama V2](https://huggingface.co/datasets/togethercomputer/RedPajama-Data-V2).
+### Usage
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+model = AutoModelForCausalLM.from_pretrained("LSX-UniWue/german_tinyllama_120M")
+tokenizer = AutoTokenizer.from_pretrained("LSX-UniWue/german_tinyllama_120M")
+```
+### Performance
+We evaluated our results on the [SuperGLEBer](https://lsx-uniwue.github.io/SuperGLEBer-site/) benchmark.
+| Task Type           | Task Name    | Metric   | Score |
+|---------------------|--------------|----------|-------|
+| Classification      | NLI          | Accuracy | 0.629 |
+| Classification      | DB Aspect    | micro F1 | 0.517 |
+| Sequence Tagging    | NER Europarl | micro F1 | 0.538 |
+| Sentence Similarity | Pawsx        | Pearson  | 0.489 |
+| Question Answering  | MLQA         | F1       | 0.846 |