fnlp
/

Text Generation
Safetensors
llama
TaoJi's picture
Update README.md
a42f300 verified
metadata
license: apache-2.0
datasets:
  - HuggingFaceTB/smollm-corpus
base_model:
  - HuggingFaceTB/SmolLM-1.7B
pipeline_tag: text-generation

Research Paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs"