Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Gausson Tschen's picture
4 3 2

Gausson Tschen

Gausson
Reybabylon's profile picture zhg2025's profile picture
·
https://www.xiaohongshu.com/user/profile/615bd9080000000002018213
  • GaussonTschen
  • GaussonTschen

AI & ML interests

LLM Architecture, Pre-training, Deep Neural Network Optimization, Sparsity

Recent Activity

new activity about 1 month ago
nvidia/kvpress-leaderboard:Upload the results of the training-free version of the method [SepLLM - ICML 2025 Paper](https://arxiv.org/abs/2412.12094) based on "meta-llama/Meta-Llama-3.1-8B-Instruct"
updated a model about 1 month ago
Gausson/sep_cache
updated a model about 1 month ago
transformers-community/sep_cache
View all activity

Organizations

Data Intelligence Lab@HKU's profile picture Transformers Community's profile picture

Gausson 's models 9

Gausson/sep_cache

8B • Updated Aug 4 • 1.13k • 1

Gausson/pythia-160m-deduped-n64-SepLLM

0.2B • Updated Jul 2 • 2

Gausson/pythia-160m-deduped-n64h-SepLLM

0.2B • Updated Jul 2 • 3

Gausson/pythia-160m-deduped-n64-StreamingLLM

0.2B • Updated Jul 2 • 9

Gausson/pythia-160m-deduped-n64-RoBiPE-SepLLM

0.2B • Updated Jul 2 • 2

Gausson/pythia-160m-deduped-n128-SepLLM

0.2B • Updated Jul 2 • 3

Gausson/pythia-160m-deduped-SepLLM

0.2B • Updated Jul 2 • 14

Gausson/pythia-160m-deduped-n64ht-SepLLM

0.2B • Updated Jul 2 • 3

Gausson/gpt-neox-125m-deduped-SA

0.2B • Updated Jul 2 • 3
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs