FuseChat 3.0
Preference Optimization for Implicit Model Fusion
- Paper • 2412.03187 • Published • 9
FuseAI/FuseChat-Llama-3.1-8B-Instruct
Updated • 207 • 9Note Final DPO version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Llama-3.1-8B-Instruct.
FuseAI/FuseChat-Llama-3.2-3B-Instruct
Updated • 39 • 4Note Final DPO version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Llama-3.2-3B-Instruct.
FuseAI/FuseChat-Llama-3.2-1B-Instruct
Updated • 11 • 4Note Final DPO version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Llama-3.2-1B-Instruct.
FuseAI/FuseChat-Qwen-2.5-7B-Instruct
Updated • 179 • 9Note Final DPO version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Qwen-2.5-7B-Instruct.
FuseAI/FuseChat-Gemma-2-9B-Instruct
Updated • 122 • 5Note Final DPO version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Gemma-2-9B-Instruct.
FuseAI/FuseChat-Llama-3.1-8B-SFT
Updated • 111Note SFT version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Llama-3.1-8B-Instruct.
FuseAI/FuseChat-Llama-3.2-3B-SFT
Updated • 10Note SFT version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Llama-3.2-3B-Instruct.
FuseAI/FuseChat-Llama-3.2-1B-SFT
Updated • 14Note SFT version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Llama-3.2-1B-Instruct.
FuseAI/FuseChat-Qwen-2.5-7B-SFT
Updated • 9Note SFT version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Qwen-2.5-7B-Instruct.
FuseAI/FuseChat-Gemma-2-9B-SFT
Updated • 8 • 1Note SFT version of FuseChat-3.0. Source LLMs: Gemma-2-27B-It, Mistral-Large-Instruct-2407, Qwen-2.5-72B-Instruct, and Llama-3.1-70B-Instruct. Target LLM: Gemma-2-9B-Instruct.
FuseAI/FuseChat-3.0-SFT-Data
Viewer • Updated • 94.5kNote SFT dataset for FuseChat-3.0.
FuseAI/FuseChat-3.0-DPO-Data
Viewer • Updated • 64.1kNote DPO dataset for FuseChat-3.0.