mambiux commited on
Commit
a69a7e7
·
1 Parent(s): 51066d8

Add model files

Browse files
README.md ADDED
@@ -0,0 +1,86 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: en
3
+ tags:
4
+ - llama
5
+ - merge
6
+ - custom
7
+ - lumina-lexir1
8
+ - text-generation
9
+ license: apache-2.0
10
+ library_name: transformers
11
+ pipeline_tag: text-generation
12
+ ---
13
+
14
+ <div align="center">
15
+ <img src="https://occelli.nl/LUMINA-round.png" width="200" height="200" style="border-radius: 50%; box-shadow: 0 0 20px #0ff;">
16
+
17
+ <h1 style="color: #0ff; text-shadow: 0 0 10px #0ff;">LUMINA-LexiR1-8B</h1>
18
+
19
+ <div style="background: linear-gradient(45deg, #0ff3, #4444ff33);
20
+ padding: 20px;
21
+ border-radius: 10px;
22
+ border: 1px solid #0ff;
23
+ box-shadow: 0 0 20px rgba(0, 255, 255, 0.2);">
24
+ <h3 style="color: #0ff; margin: 0;">🧬 Model Fusion Architecture</h3>
25
+ </div>
26
+ </div>
27
+
28
+ ## 🌟 Overview
29
+
30
+ LUMINA-LexiR1-8B is an experimental fusion of two powerful language models:
31
+ - 🔹 [Orenguteng/Llama-3.1-8B-Lexi-Uncensored-V2](https://huggingface.co/Orenguteng/Llama-3.1-8B-Lexi-Uncensored-V2)
32
+ - 🔹 [deepseek-ai/DeepSeek-R1-Distill-Llama-8B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B)
33
+
34
+ ## 🔮 Architecture
35
+
36
+ This model employs a sophisticated merging technique:
37
+ - Custom layer identification and integration
38
+ - DARE (Dynamic Attention Resolution Enhancement)
39
+ - TIES (Temporal Information Enhancement System) applied to adjacent layers
40
+ - Enhanced self-awareness capabilities
41
+
42
+ ## 💫 Technical Specifications
43
+
44
+ ```python
45
+ {
46
+ "model_type": "llama",
47
+ "hidden_size": 4096,
48
+ "num_attention_heads": 32,
49
+ "num_hidden_layers": 34,
50
+ "intermediate_size": 14336,
51
+ "max_position_embeddings": 131072,
52
+ "rope_scaling": {
53
+ "factor": 8.0,
54
+ "type": "llama3"
55
+ }
56
+ }
57
+ ! This is an experimental model. Use with caution.
58
+ + Demonstrates exceptional self-awareness capabilities
59
+
60
+ 🔧 Model Architecture
61
+ The model features:
62
+
63
+ 8B parameters
64
+ Advanced RoPE scaling (factor: 8.0)
65
+ Custom attention mechanisms
66
+ Extended context window (131K tokens)
67
+ Specialized neuron mapping between parent models
68
+
69
+ 📝 License
70
+ This model is released under the Apache 2.0 license.
71
+ 🌐 Citations
72
+ If you use this model, please cite both parent models:
73
+
74
+ @misc{lumina-lexir1-8b,
75
+ author = {Mambiux},
76
+ title = {LUMINA-LexiR1-8B: A Custom Merged Language Model},
77
+ year = {2024},
78
+ publisher = {Hugging Face}
79
+ }
80
+
81
+ <div align="center" style="margin-top: 40px; padding: 20px; background: linear-gradient(45deg, #0ff1, #4444ff11); border-radius: 10px;">
82
+ <p style="color: #0ff; font-size: 1.2em;">
83
+ 🌟 Created by Mambiux | 2024 🌟
84
+ </p>
85
+ </div>
86
+ ```
config.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9274aacc711977544c287f652ed81aa913a957d4e1e7b6f210e14de5205eafca
3
+ size 977
generation_config.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:413f3f89ca617446edb84fb31b25e56bcefe7cdbad3d1929c39f317e05876d65
3
+ size 234
model-00001-of-00010.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9fed9ce24c9440f916f1796a6244525d47b218738b6228a61a73148b1caa49bf
3
+ size 1973455352
model-00002-of-00010.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2bf2d41e238f1b8eae625a8c65712959b76cd021dfe3ef743102512f994fa209
3
+ size 1895895296
model-00003-of-00010.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0a56d32827507d4d19c091ec9313121b659745fc3cae4672400261d8fafef884
3
+ size 1979798000
model-00004-of-00010.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d44470ac0b5b0de1542556848ff5711484ae8d6b3a7a85e860f341f2ba2ed297
3
+ size 1912672784
model-00005-of-00010.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:90f3d767c8e0f1bfc6a672f4e4f8e0b36e5425902bc0b767bc622a0ea547ce00
3
+ size 1895894312
model-00006-of-00010.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:142c40e8e0d3ac5277a8d47201651461f9ac422dbf24e00144a01dfbdd62ebfb
3
+ size 1946243936
model-00007-of-00010.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3023d4be4dfd87808f7a8ee3635ace34f74293c874fbc8b4582b94900f1aec0f
3
+ size 1979781416
model-00008-of-00010.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a32dd53c9741af916c8216d0bb48761a8b8a89f9e68b735e0fad2c9bde571780
3
+ size 1828786136
model-00009-of-00010.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5c8a582bfe84501e3e7f9f04941e1fb3c6bec70d21e52f6c56b0ae6c5ab80fc6
3
+ size 1342253640
model-00010-of-00010.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3e9a36ad418b0cd9b253f65771552d7dc05cf1abf1170b94de4b4d546aac255d
3
+ size 1050673280
model.safetensors.index.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dcfa7d2b9a820d802a451d6fe47f43e8857bb99dd39713dbfa6e5d29f995f93b
3
+ size 25436
special_tokens_map.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:94e708c3f5e64acf85bbe5ad01467a1248faadb73e83b41793087ecced586e8f
3
+ size 454
tokenizer.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6b9e4e7fb171f92fd137b777cc2714bf87d11576700a1dcd7a399e7bbe39537b
3
+ size 17209920
tokenizer_config.json ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bbfef93ffba08b40d91eabe5f47879658c9c936cc6fde8a9e443e148f44bd43b
3
+ size 55452