view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 22 days ago • 363
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 8 days ago • 89
Large Language Model Agent: A Survey on Methodology, Applications and Challenges Paper • 2503.21460 • Published 7 days ago • 67
view article Article LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone! 27 days ago • 48
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality about 1 month ago • 71
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU Paper • 2502.08910 • Published Feb 13 • 147
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Paper • 2502.06703 • Published Feb 10 • 148
SmolVLM 256M & 500M Collection Collection for models & demos for even smoller SmolVLM release • 12 items • Updated Feb 20 • 72
HelpSteer2: Open-source dataset for training top-performing reward models Paper • 2406.08673 • Published Jun 12, 2024 • 19
Crossmodal-3600: A Massively Multilingual Multimodal Evaluation Dataset Paper • 2205.12522 • Published May 25, 2022 • 2
BGE M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation Paper • 2402.03216 • Published Feb 5, 2024 • 5
CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings Paper • 2501.01257 • Published Jan 2 • 52