LTS-VVE commited on
Commit
9dcae7a
·
verified ·
1 Parent(s): 59f023d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +31 -1
README.md CHANGED
@@ -35,4 +35,34 @@ tags:
35
  - biology
36
  - climate
37
  - not-for-all-audiences
38
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
35
  - biology
36
  - climate
37
  - not-for-all-audiences
38
+ ---
39
+
40
+ <p align="center">
41
+ <span style="color:yellow">This model is not suitable for all audiences and may contain inappropriate or explicit content.</span>
42
+ </p>
43
+
44
+ <p align="center">
45
+ <img src="https://cdn-uploads.huggingface.co/production/uploads/67b7476deb48853c39ca000b/CzUTg97aTxK283qwD6kEm.png" alt="Teuta Logo" />
46
+ </p>
47
+
48
+ # Teuta
49
+
50
+ Teuta is a bilingual instruction-tuned language model designed for question answering in both Albanian (sq) and English (en). It is fine-tuned on a diverse mix of datasets covering subjects such as mathematics, philosophy, chemistry, biology, code (especially Rust), psychology, and climate science.
51
+
52
+ ## Model
53
+
54
+ - **Base model**: meta-llama/Llama-3.2-3B
55
+ - **Languages**: Albanian, English
56
+ - **Primary task**: Instruction-following and question answering
57
+
58
+ ## Description
59
+
60
+ Teuta is built to handle a variety of instructional prompts, from academic and scientific queries to more open-ended reasoning tasks. It is particularly suited for multilingual applications and under-resourced language support, with a strong focus on Albanian.
61
+
62
+ The model leverages both synthetic and real datasets to improve generalization across technical and non-technical domains.
63
+
64
+ ## Considerations
65
+
66
+ - Some datasets include sensitive content (e.g., mental health, therapy, and philosophical questions).
67
+ - Outputs are not guaranteed to be factual or safe; use in sensitive contexts should be done with care.
68
+ - Best suited for research, educational tools, and domain-specific applications.