nexa-collaboration/output_llama3-1_8b_distillation_from_sparse Text Generation • Updated 18 days ago • 9
nexa-collaboration/gptqmodel-5000-g32-noreorder-tulu3sft-Llama3.2-1B-instruct-4bit-2 Updated 21 days ago • 31
nexa-collaboration/gptqmodel-5000-g32-noreorder-tulu3sft-Llama3.2-1B-instruct-4bit-2 Updated 21 days ago • 31
Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models Paper • 2408.15518 • Published Aug 28, 2024 • 43
Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models Paper • 2408.15518 • Published Aug 28, 2024 • 43
Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models Paper • 2408.15518 • Published Aug 28, 2024 • 43
Dolphin: Long Context as a New Modality for Energy-Efficient On-Device Language Models Paper • 2408.15518 • Published Aug 28, 2024 • 43
Octo-planner: On-device Language Model for Planner-Action Agents Paper • 2406.18082 • Published Jun 26, 2024 • 48
Octo-planner: On-device Language Model for Planner-Action Agents Paper • 2406.18082 • Published Jun 26, 2024 • 48