Text Generation
Transformers
Safetensors
liger_gla
Inference Endpoints
Liger-GLA-8B / README.md
landisen's picture
Upload folder using huggingface_hub
e67d872 verified
metadata
license: apache-2.0
library_name: transformers
pipeline_tag: text-generation

Liger-GLA-8B

[πŸ“‚ GitHub] [πŸ“œ Liger] [πŸ“‘ Paper]

We introduce Liger-GLA-8B, a gated linear recurrent model linearized from Transformer-based LLM.

Our Liger framework is compatible with various linear recurrent models with gating structures:

Model Name Base Model Linear Structure HF Link
Liger-GLA-8B Llama-3-8B GLA πŸ€— link
Liger-GSA-8B Llama-3-8B GSA πŸ€— link

Citation

If you find this repo useful, please cite and star our work:

@article{lan2025liger,
  title={Liger: Linearizing Large Language Models to Gated Recurrent Structures},
  author={Lan, Disen and Sun, Weigao and Hu, Jiaxi and Du, Jusen and Cheng, Yu},
  journal={arXiv preprint arXiv:2503.01496},
  year={2025}
}