Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4
3
2
Gausson Tschen
Gausson
Follow
Reybabylon's profile picture
zhg2025's profile picture
2 followers
·
3 following
https://www.xiaohongshu.com/user/profile/615bd9080000000002018213
GaussonTschen
GaussonTschen
AI & ML interests
LLM Architecture, Pre-training, Deep Neural Network Optimization, Sparsity
Recent Activity
new
activity
about 1 month ago
nvidia/kvpress-leaderboard:
Upload the results of the training-free version of the method [SepLLM - ICML 2025 Paper](https://arxiv.org/abs/2412.12094) based on "meta-llama/Meta-Llama-3.1-8B-Instruct"
updated
a model
about 1 month ago
Gausson/sep_cache
updated
a model
about 1 month ago
transformers-community/sep_cache
View all activity
Organizations
Gausson
's models
9
Sort: Recently updated
Gausson/sep_cache
8B
•
Updated
Aug 4
•
1.13k
•
1
Gausson/pythia-160m-deduped-n64-SepLLM
0.2B
•
Updated
Jul 2
•
2
Gausson/pythia-160m-deduped-n64h-SepLLM
0.2B
•
Updated
Jul 2
•
3
Gausson/pythia-160m-deduped-n64-StreamingLLM
0.2B
•
Updated
Jul 2
•
9
Gausson/pythia-160m-deduped-n64-RoBiPE-SepLLM
0.2B
•
Updated
Jul 2
•
2
Gausson/pythia-160m-deduped-n128-SepLLM
0.2B
•
Updated
Jul 2
•
3
Gausson/pythia-160m-deduped-SepLLM
0.2B
•
Updated
Jul 2
•
14
Gausson/pythia-160m-deduped-n64ht-SepLLM
0.2B
•
Updated
Jul 2
•
3
Gausson/gpt-neox-125m-deduped-SA
0.2B
•
Updated
Jul 2
•
3