MatAnyone: Stable Video Matting with Consistent Memory Propagation Paper • 2501.14677 • Published 14 days ago • 26
Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch Paper • 2501.18512 • Published 8 days ago • 25
Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. • 10 items • Updated 8 days ago • 88
POTION Collection These are the flagship POTION models. Load them and use them with model2vec (https://github.com/MinishLab/model2vec) or sentence-transformers • 5 items • Updated 4 days ago • 10
Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 12 days ago • 98
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated 11 days ago • 324
Eagle 2 Collection Eagle 2 is a family of frontier vision-language models with vision-centric design. The model supports 4K HD input, long-context video, and grounding. • 9 items • Updated 15 days ago • 31
Search-o1: Agentic Search-Enhanced Large Reasoning Models Paper • 2501.05366 • Published 29 days ago • 91
TACO Models Collection This collection contains the best-performing TACO models based on LLaMA-3/Qwen2 and SigLIP/CLIP. • 3 items • Updated Dec 20, 2024 • 8
Dolphin 3.0 Collection Dolphin 3.0 is the next generation of the Dolphin series of instruct-tuned models. Designed to be the ultimate general purpose local model. • 9 items • Updated about 8 hours ago • 65
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 132