PILAF: Optimal Human Preference Sampling for Reward Modeling Paper • 2502.04270 • Published about 15 hours ago • 1
Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis Paper • 2502.04128 • Published about 18 hours ago • 5
Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization Paper • 2502.04295 • Published about 14 hours ago • 3
BOLT: Bootstrap Long Chain-of-Thought in Language Models without Distillation Paper • 2502.03860 • Published 1 day ago • 3
UltraIF: Advancing Instruction Following from the Wild Paper • 2502.04153 • Published about 17 hours ago • 13
ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization Paper • 2502.04306 • Published about 14 hours ago • 7
Large Language Model Guided Self-Debugging Code Generation Paper • 2502.02928 • Published 2 days ago • 6
Riddle Me This! Stealthy Membership Inference for Retrieval-Augmented Generation Paper • 2502.00306 • Published 6 days ago • 2
A Probabilistic Inference Approach to Inference-Time Scaling of LLMs using Particle-Based Monte Carlo Methods Paper • 2502.01618 • Published 4 days ago • 5
Token Assorted: Mixing Latent and Text Tokens for Improved Language Model Reasoning Paper • 2502.03275 • Published 1 day ago • 7
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published 2 days ago • 107
Boosting Multimodal Reasoning with MCTS-Automated Structured Thinking Paper • 2502.02339 • Published 3 days ago • 11
Text-to-CAD Generation Through Infusing Visual Feedback in Large Language Models Paper • 2501.19054 • Published 7 days ago • 6
Sample, Scrutinize and Scale: Effective Inference-Time Search by Scaling Verification Paper • 2502.01839 • Published 3 days ago • 3
Federated Sketching LoRA: On-Device Collaborative Fine-Tuning of Large Language Models Paper • 2501.19389 • Published 7 days ago • 2
Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial? Paper • 2502.00674 • Published 5 days ago • 8