Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Nebius AI Studio
fal
SambaNova
Fireworks
Hyperbolic
Replicate
Novita
Cerebras
Together AI
HF Inference API
Misc
Reset Misc
arxiv:
2408.15237
AutoTrain Compatible
Inference Endpoints
text-generation-inference
Misc with no match
Eval Results
Merge
4-bit precision
8-bit precision
custom_code
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
21
Full-text search
Edit filters
Sort: Trending
Active filters:
2408.15237
Clear all
JunxiongWang/mamba_0_5_dpo_ep1
Text Generation
•
Updated
Sep 2, 2024
•
14
JunxiongWang/mamba_0_5_dpo_ep3
Text Generation
•
Updated
Sep 2, 2024
•
68
JunxiongWang/mamba_0_875_dpo_ep3
Text Generation
•
Updated
Sep 2, 2024
•
26
•
1
JunxiongWang/mamba_0_875_dpo_ep1
Text Generation
•
Updated
Sep 2, 2024
•
8
JunxiongWang/mamba_0_75_dpo_ep3
Text Generation
•
Updated
Sep 2, 2024
•
31
JunxiongWang/mamba_0_75_dpo_ep1
Text Generation
•
Updated
Sep 2, 2024
•
11
JunxiongWang/MambaInLlama_0_50
Updated
Sep 2, 2024
•
34
JunxiongWang/Mamba2InLlama_0_50
Updated
Sep 2, 2024
•
55
JunxiongWang/MambaInLlama_0_75
Updated
Sep 2, 2024
•
23
JunxiongWang/Mamba2InLlama_0_75
Updated
Sep 2, 2024
•
12
JunxiongWang/Mamba2InLlama_0_875
Updated
Sep 2, 2024
•
22
JunxiongWang/MambaInLlama_0_875
Updated
Sep 2, 2024
•
21
JunxiongWang/Mamba2InLlama_1
Updated
Sep 2, 2024
•
69
•
1
JunxiongWang/Llama3.2-Mamba2-3B-distill
Updated
Nov 17, 2024
•
205
JunxiongWang/Llama3.2-Mamba2-3B-dpo
Updated
Nov 17, 2024
•
26
JunxiongWang/Llama3.1-Mamba2-8B-distill
Updated
Nov 17, 2024
•
16
JunxiongWang/Llama3.2-Mamba-3B-distill
Updated
Nov 17, 2024
•
38
JunxiongWang/Llama3.1-Mamba-8B-distill
Updated
Nov 17, 2024
•
16
JunxiongWang/Llama3.1-Mamba2-8B-dpo
Updated
Nov 17, 2024
•
13
JunxiongWang/Llama3.1-Mamba-8B-dpo
Updated
Nov 17, 2024
•
20
JunxiongWang/Llama3.2-Mamba-3B-dpo
Updated
Nov 17, 2024
•
24