Text Generation
PyTorch
GGUF
English
Albanian
llama
al
math
philosophy
chemistry
code
biology
climate
Not-For-All-Audiences
conversational
Update README.md
Browse files
README.md
CHANGED
|
@@ -35,4 +35,34 @@ tags:
|
|
| 35 |
- biology
|
| 36 |
- climate
|
| 37 |
- not-for-all-audiences
|
| 38 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 35 |
- biology
|
| 36 |
- climate
|
| 37 |
- not-for-all-audiences
|
| 38 |
+
---
|
| 39 |
+
|
| 40 |
+
<p align="center">
|
| 41 |
+
<span style="color:yellow">This model is not suitable for all audiences and may contain inappropriate or explicit content.</span>
|
| 42 |
+
</p>
|
| 43 |
+
|
| 44 |
+
<p align="center">
|
| 45 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/67b7476deb48853c39ca000b/CzUTg97aTxK283qwD6kEm.png" alt="Teuta Logo" />
|
| 46 |
+
</p>
|
| 47 |
+
|
| 48 |
+
# Teuta
|
| 49 |
+
|
| 50 |
+
Teuta is a bilingual instruction-tuned language model designed for question answering in both Albanian (sq) and English (en). It is fine-tuned on a diverse mix of datasets covering subjects such as mathematics, philosophy, chemistry, biology, code (especially Rust), psychology, and climate science.
|
| 51 |
+
|
| 52 |
+
## Model
|
| 53 |
+
|
| 54 |
+
- **Base model**: meta-llama/Llama-3.2-3B
|
| 55 |
+
- **Languages**: Albanian, English
|
| 56 |
+
- **Primary task**: Instruction-following and question answering
|
| 57 |
+
|
| 58 |
+
## Description
|
| 59 |
+
|
| 60 |
+
Teuta is built to handle a variety of instructional prompts, from academic and scientific queries to more open-ended reasoning tasks. It is particularly suited for multilingual applications and under-resourced language support, with a strong focus on Albanian.
|
| 61 |
+
|
| 62 |
+
The model leverages both synthetic and real datasets to improve generalization across technical and non-technical domains.
|
| 63 |
+
|
| 64 |
+
## Considerations
|
| 65 |
+
|
| 66 |
+
- Some datasets include sensitive content (e.g., mental health, therapy, and philosophical questions).
|
| 67 |
+
- Outputs are not guaranteed to be factual or safe; use in sensitive contexts should be done with care.
|
| 68 |
+
- Best suited for research, educational tools, and domain-specific applications.
|