Update README.md
Browse files
README.md
CHANGED
@@ -18,8 +18,8 @@ HRWKV7-hxa079-Qwen3-14B is an experimental hybrid architecture model that combin
|
|
18 |
|
19 |
- **Architecture:** RWKV v7 based "hxa079" Architecture + Group Query Attention Hybrid
|
20 |
- **Total Layers:** 40 layers (L40D5120)
|
21 |
-
- 34 RWKV layers
|
22 |
-
- 6 GQA layers
|
23 |
- **Hidden Dimension:** 5120
|
24 |
- **Training Context Window:** 4096 tokens
|
25 |
|
|
|
18 |
|
19 |
- **Architecture:** RWKV v7 based "hxa079" Architecture + Group Query Attention Hybrid
|
20 |
- **Total Layers:** 40 layers (L40D5120)
|
21 |
+
- 34 RWKV layers (with Rope)
|
22 |
+
- 6 GQA layers (No Rope, No Position Embeddings)
|
23 |
- **Hidden Dimension:** 5120
|
24 |
- **Training Context Window:** 4096 tokens
|
25 |
|