Parallel-R1: Towards Parallel Thinking via Reinforcement Learning Paper • 2509.07980 • Published 13 days ago • 95
Train Long, Think Short: Curriculum Learning for Efficient Reasoning Paper • 2508.08940 • Published Aug 12 • 26
Emergent Hierarchical Reasoning in LLMs through Reinforcement Learning Paper • 2509.03646 • Published 19 days ago • 29
view article Article mmBERT: ModernBERT goes Multilingual By orionweller and 5 others • 14 days ago • 93
Refusal Tokens: A Simple Way to Calibrate Refusals in Large Language Models Paper • 2412.06748 • Published Dec 9, 2024 • 2
ERank Collection A highly effective and efficient pointwise reranker built from a reasoning LLM, which excels across diverse relevance scenarios with low latency • 3 items • Updated 20 days ago • 7
PRELUDE: A Benchmark Designed to Require Global Comprehension and Reasoning over Long Contexts Paper • 2508.09848 • Published Aug 13 • 66
ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing Paper • 2508.10881 • Published Aug 14 • 52
Phi-Ground Tech Report: Advancing Perception in GUI Grounding Paper • 2507.23779 • Published Jul 31 • 44
Step-3 is Large yet Affordable: Model-system Co-design for Cost-effective Decoding Paper • 2507.19427 • Published Jul 25 • 18
view article Article AI Companionship: Why We Need to Evaluate How AI Systems Handle Emotional Bonds By giadap and 2 others • Jul 21 • 20
view article Article Seq vs Seq: the Ettin Suite of Paired Encoders and Decoders By orionweller and 5 others • Jul 16 • 68
JinaVDR (Visual Document Retrieval) Collection max. ~1000 images and OCR text included • 42 items • Updated Jul 20 • 6
OpenCodeReasoning-II Collection Reasoning data for supervised finetuning of LLMs to advance code generation and critique • 5 items • Updated 7 days ago • 10