Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Fireworks
Cerebras
SambaNova
Together AI
Replicate
Novita
fal
Hyperbolic
Nebius AI Studio
HF Inference API
Misc
Reset Misc
arxiv:
2306.11695
AutoTrain Compatible
text-generation-inference
Inference Endpoints
custom_code
4-bit precision
8-bit precision
Misc with no match
Eval Results
Merge
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
50
Full-text search
Edit filters
Sort: Trending
Active filters:
2306.11695
Clear all
wang7776/Llama-2-7b-chat-hf-30-attention-sparsity
Text Generation
•
Updated
Feb 5, 2024
•
12
wang7776/vicuna-7b-v1.3-attention-sparsity-10
Text Generation
•
Updated
Feb 5, 2024
•
5
wang7776/vicuna-7b-v1.3-attention-sparsity-30
Text Generation
•
Updated
Feb 5, 2024
•
8
wang7776/Mistral-7B-Instruct-v0.2-attention-sparsity-10
Text Generation
•
Updated
Feb 5, 2024
•
6
wang7776/Mistral-7B-Instruct-v0.2-attention-sparsity-30
Text Generation
•
Updated
Feb 5, 2024
•
7
kettleguts/zephyr-7b-beta_sparse05
Text Generation
•
Updated
Mar 27, 2024
•
20
RichardErkhov/wang7776_-_vicuna-7b-v1.3-sparsity-10-gguf
Updated
Jul 25, 2024
•
27
RichardErkhov/wang7776_-_Mistral-7B-Instruct-v0.2-sparsity-30-v0.1-gguf
Updated
Jul 25, 2024
•
24
RichardErkhov/wang7776_-_Llama-2-7b-chat-hf-10-sparsity-gguf
Updated
Jul 31, 2024
•
1
RichardErkhov/wang7776_-_Llama-2-7b-chat-hf-30-sparsity-gguf
Updated
Jul 31, 2024
•
1
RichardErkhov/wang7776_-_Llama-2-7b-chat-hf-20-attention-sparsity-gguf
Updated
Aug 1, 2024
RichardErkhov/wang7776_-_vicuna-7b-v1.3-attention-sparsity-20-gguf
Updated
Aug 1, 2024
•
62
RichardErkhov/wang7776_-_vicuna-7b-v1.3-sparsity-20-gguf
Updated
Aug 18, 2024
•
43
RichardErkhov/wang7776_-_Llama-2-7b-chat-hf-10-sparsity-4bits
Updated
Sep 2, 2024
•
4
RichardErkhov/wang7776_-_Llama-2-7b-chat-hf-10-sparsity-8bits
Updated
Sep 2, 2024
•
4
RichardErkhov/wang7776_-_vicuna-7b-v1.3-attention-sparsity-30-gguf
Updated
Sep 3, 2024
•
25
RichardErkhov/kettleguts_-_zephyr-7b-beta_sparse05-gguf
Updated
Sep 21, 2024
•
90
wang7776/Llama-2-7b-chat-hf-40-sparsity
Text Generation
•
Updated
Sep 25, 2024
•
10
gbai24/SparseLLM
Updated
Sep 29, 2024
RichardErkhov/IntelLabs_-_sqft-phi-3-mini-4k-50-base-gguf
Updated
Oct 23, 2024
•
25
Previous
1
2
Next