LSX-UniWue
/

LLaMmlein_120M

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

JanPf commited on Nov 14, 2024

Commit

dfe7dac

·

verified ·

1 Parent(s): 804e61f

Update README.md

Files changed (1) hide show

README.md +5 -14

README.md CHANGED Viewed

@@ -9,28 +9,19 @@ library_name: transformers
 # LLäMmlein 120M
-This is a German Tinyllama 120M language model trained from scratch using the
-the [Tinyllama](https://github.com/jzhang38/TinyLlama) codebase on the German portion of [RedPajama V2](https://huggingface.co/datasets/togethercomputer/RedPajama-Data-V2).
 ### Usage
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
-model = AutoModelForCausalLM.from_pretrained("LSX-UniWue/LLaMmlein_120m")
-tokenizer = AutoTokenizer.from_pretrained("LSX-UniWue/LLaMmlein_120m")
 ```
 ### Performance
-We evaluated our model on the [SuperGLEBer](https://lsx-uniwue.github.io/SuperGLEBer-site/) benchmark.
-| Task Type           | Task Name    | Metric   | Score |
-|---------------------|--------------|----------|-------|
-| Classification      | NLI          | Accuracy | 0.629 |
-| Classification      | DB Aspect    | micro F1 | 0.517 |
-| Sequence Tagging    | NER Europarl | micro F1 | 0.538 |
-| Sentence Similarity | Pawsx        | Pearson  | 0.489 |
-| Question Answering  | MLQA         | F1       | 0.846 |

 # LLäMmlein 120M
+This is a German Tinyllama 120M language model trained from scratch using the [Tinyllama](https://github.com/jzhang38/TinyLlama) codebase on the German portion of [RedPajama V2](https://huggingface.co/datasets/togethercomputer/RedPajama-Data-V2).
+Find more details on our [page](https://www.informatik.uni-wuerzburg.de/datascience/projects/nlp/llammlein/)!
 ### Usage
 ```python
 from transformers import AutoModelForCausalLM, AutoTokenizer
+model = AutoModelForCausalLM.from_pretrained("LSX-UniWue/LLaMmlein_120M")
+tokenizer = AutoTokenizer.from_pretrained("LSX-UniWue/LLaMmlein_120M")
 ```
 ### Performance
+We evaluated our model on the [SuperGLEBer](https://lsx-uniwue.github.io/SuperGLEBer-site/) benchmark.