fnlp
/

Text Generation
Safetensors
llama
File size: 303 Bytes
41f4185
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
---
license: apache-2.0
datasets:
- HuggingFaceTB/smollm-corpus
base_model:
- HuggingFaceTB/SmolLM-135M
pipeline_tag: text-generation
---

**Research Paper** ["Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs"](https://arxiv.org/abs/2502.14837)