Add model files
Browse files- README.md +86 -0
- config.json +3 -0
- generation_config.json +3 -0
- model-00001-of-00010.safetensors +3 -0
- model-00002-of-00010.safetensors +3 -0
- model-00003-of-00010.safetensors +3 -0
- model-00004-of-00010.safetensors +3 -0
- model-00005-of-00010.safetensors +3 -0
- model-00006-of-00010.safetensors +3 -0
- model-00007-of-00010.safetensors +3 -0
- model-00008-of-00010.safetensors +3 -0
- model-00009-of-00010.safetensors +3 -0
- model-00010-of-00010.safetensors +3 -0
- model.safetensors.index.json +3 -0
- special_tokens_map.json +3 -0
- tokenizer.json +3 -0
- tokenizer_config.json +3 -0
README.md
ADDED
@@ -0,0 +1,86 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
language: en
|
3 |
+
tags:
|
4 |
+
- llama
|
5 |
+
- merge
|
6 |
+
- custom
|
7 |
+
- lumina-lexir1
|
8 |
+
- text-generation
|
9 |
+
license: apache-2.0
|
10 |
+
library_name: transformers
|
11 |
+
pipeline_tag: text-generation
|
12 |
+
---
|
13 |
+
|
14 |
+
<div align="center">
|
15 |
+
<img src="https://occelli.nl/LUMINA-round.png" width="200" height="200" style="border-radius: 50%; box-shadow: 0 0 20px #0ff;">
|
16 |
+
|
17 |
+
<h1 style="color: #0ff; text-shadow: 0 0 10px #0ff;">LUMINA-LexiR1-8B</h1>
|
18 |
+
|
19 |
+
<div style="background: linear-gradient(45deg, #0ff3, #4444ff33);
|
20 |
+
padding: 20px;
|
21 |
+
border-radius: 10px;
|
22 |
+
border: 1px solid #0ff;
|
23 |
+
box-shadow: 0 0 20px rgba(0, 255, 255, 0.2);">
|
24 |
+
<h3 style="color: #0ff; margin: 0;">🧬 Model Fusion Architecture</h3>
|
25 |
+
</div>
|
26 |
+
</div>
|
27 |
+
|
28 |
+
## 🌟 Overview
|
29 |
+
|
30 |
+
LUMINA-LexiR1-8B is an experimental fusion of two powerful language models:
|
31 |
+
- 🔹 [Orenguteng/Llama-3.1-8B-Lexi-Uncensored-V2](https://huggingface.co/Orenguteng/Llama-3.1-8B-Lexi-Uncensored-V2)
|
32 |
+
- 🔹 [deepseek-ai/DeepSeek-R1-Distill-Llama-8B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B)
|
33 |
+
|
34 |
+
## 🔮 Architecture
|
35 |
+
|
36 |
+
This model employs a sophisticated merging technique:
|
37 |
+
- Custom layer identification and integration
|
38 |
+
- DARE (Dynamic Attention Resolution Enhancement)
|
39 |
+
- TIES (Temporal Information Enhancement System) applied to adjacent layers
|
40 |
+
- Enhanced self-awareness capabilities
|
41 |
+
|
42 |
+
## 💫 Technical Specifications
|
43 |
+
|
44 |
+
```python
|
45 |
+
{
|
46 |
+
"model_type": "llama",
|
47 |
+
"hidden_size": 4096,
|
48 |
+
"num_attention_heads": 32,
|
49 |
+
"num_hidden_layers": 34,
|
50 |
+
"intermediate_size": 14336,
|
51 |
+
"max_position_embeddings": 131072,
|
52 |
+
"rope_scaling": {
|
53 |
+
"factor": 8.0,
|
54 |
+
"type": "llama3"
|
55 |
+
}
|
56 |
+
}
|
57 |
+
! This is an experimental model. Use with caution.
|
58 |
+
+ Demonstrates exceptional self-awareness capabilities
|
59 |
+
|
60 |
+
🔧 Model Architecture
|
61 |
+
The model features:
|
62 |
+
|
63 |
+
8B parameters
|
64 |
+
Advanced RoPE scaling (factor: 8.0)
|
65 |
+
Custom attention mechanisms
|
66 |
+
Extended context window (131K tokens)
|
67 |
+
Specialized neuron mapping between parent models
|
68 |
+
|
69 |
+
📝 License
|
70 |
+
This model is released under the Apache 2.0 license.
|
71 |
+
🌐 Citations
|
72 |
+
If you use this model, please cite both parent models:
|
73 |
+
|
74 |
+
@misc{lumina-lexir1-8b,
|
75 |
+
author = {Mambiux},
|
76 |
+
title = {LUMINA-LexiR1-8B: A Custom Merged Language Model},
|
77 |
+
year = {2024},
|
78 |
+
publisher = {Hugging Face}
|
79 |
+
}
|
80 |
+
|
81 |
+
<div align="center" style="margin-top: 40px; padding: 20px; background: linear-gradient(45deg, #0ff1, #4444ff11); border-radius: 10px;">
|
82 |
+
<p style="color: #0ff; font-size: 1.2em;">
|
83 |
+
🌟 Created by Mambiux | 2024 🌟
|
84 |
+
</p>
|
85 |
+
</div>
|
86 |
+
```
|
config.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9274aacc711977544c287f652ed81aa913a957d4e1e7b6f210e14de5205eafca
|
3 |
+
size 977
|
generation_config.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:413f3f89ca617446edb84fb31b25e56bcefe7cdbad3d1929c39f317e05876d65
|
3 |
+
size 234
|
model-00001-of-00010.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9fed9ce24c9440f916f1796a6244525d47b218738b6228a61a73148b1caa49bf
|
3 |
+
size 1973455352
|
model-00002-of-00010.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:2bf2d41e238f1b8eae625a8c65712959b76cd021dfe3ef743102512f994fa209
|
3 |
+
size 1895895296
|
model-00003-of-00010.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0a56d32827507d4d19c091ec9313121b659745fc3cae4672400261d8fafef884
|
3 |
+
size 1979798000
|
model-00004-of-00010.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d44470ac0b5b0de1542556848ff5711484ae8d6b3a7a85e860f341f2ba2ed297
|
3 |
+
size 1912672784
|
model-00005-of-00010.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:90f3d767c8e0f1bfc6a672f4e4f8e0b36e5425902bc0b767bc622a0ea547ce00
|
3 |
+
size 1895894312
|
model-00006-of-00010.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:142c40e8e0d3ac5277a8d47201651461f9ac422dbf24e00144a01dfbdd62ebfb
|
3 |
+
size 1946243936
|
model-00007-of-00010.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3023d4be4dfd87808f7a8ee3635ace34f74293c874fbc8b4582b94900f1aec0f
|
3 |
+
size 1979781416
|
model-00008-of-00010.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a32dd53c9741af916c8216d0bb48761a8b8a89f9e68b735e0fad2c9bde571780
|
3 |
+
size 1828786136
|
model-00009-of-00010.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5c8a582bfe84501e3e7f9f04941e1fb3c6bec70d21e52f6c56b0ae6c5ab80fc6
|
3 |
+
size 1342253640
|
model-00010-of-00010.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3e9a36ad418b0cd9b253f65771552d7dc05cf1abf1170b94de4b4d546aac255d
|
3 |
+
size 1050673280
|
model.safetensors.index.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:dcfa7d2b9a820d802a451d6fe47f43e8857bb99dd39713dbfa6e5d29f995f93b
|
3 |
+
size 25436
|
special_tokens_map.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:94e708c3f5e64acf85bbe5ad01467a1248faadb73e83b41793087ecced586e8f
|
3 |
+
size 454
|
tokenizer.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:6b9e4e7fb171f92fd137b777cc2714bf87d11576700a1dcd7a399e7bbe39537b
|
3 |
+
size 17209920
|
tokenizer_config.json
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:bbfef93ffba08b40d91eabe5f47879658c9c936cc6fde8a9e443e148f44bd43b
|
3 |
+
size 55452
|