new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

by AK and the research community

Feb 21

Submitted by

akhaliq

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

·
17 authors

Submitted by

akhaliq

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

·
14 authors

Submitted by

akhaliq

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

·
95 authors

Submitted by

msalnikov

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

·
7 authors

Submitted by

akhaliq

S*: Test Time Scaling for Code Generation

·
9 authors

Submitted by

akhaliq

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

·
10 authors

Submitted by

basil2115

Discovering highly efficient low-weight quantum error-correcting codes with reinforcement learning

·
2 authors

Submitted by

tsq2000

LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models

·
11 authors

Submitted by

vvibt

S$^2$R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning

·
9 authors

Submitted by

Minbyul

Does Time Have Its Place? Temporal Heads: Where Language Models Recall Time-specific Information

·
5 authors

Submitted by

xhyandwyy

PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PC

·
11 authors

Submitted by

akhaliq

Dynamic Concepts Personalization from Single Videos

·
8 authors

Submitted by

akhaliq

Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation

·
11 authors

Submitted by

arkilpatel

How to Get Your LLM to Generate Challenging Problems for Evaluation

·
3 authors

Submitted by

Zheyuan22

NAVIG: Natural Language-guided Analysis with Vision Language Models for Image Geo-localization

·
4 authors

Submitted by

akhaliq

RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers

·
11 authors

Submitted by

yhshu

From RAG to Memory: Non-Parametric Continual Learning for Large Language Models

·
5 authors

Submitted by

akhaliq

AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO

·
2 authors

Submitted by

vansin

LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention

·
10 authors

Submitted by

YuchengShi

Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data

·
5 authors

Submitted by

chtmp223

CLIPPER: Compression enables long-context synthetic data generation

·
3 authors

Submitted by

wmying

Generating Skyline Datasets for Data Science Models

·
5 authors

Submitted by

Breadbang

LLM-based User Profile Management for Recommender System

·
2 authors

Submitted by

michiyasunaga

Multimodal RewardBench: Holistic Evaluation of Reward Models for Vision Language Models

·
3 authors

Submitted by

saadob12

How Much Do LLMs Hallucinate across Languages? On Multilingual Estimation of LLM Hallucination in the Wild

·
3 authors

Submitted by

dwright37

Unstructured Evidence Attribution for Long Context Query Focused Summarization

·
5 authors

Submitted by

Ziruibest

Geolocation with Real Human Gameplay Data: A Large-Scale Dataset and Human-Like Reasoning Framework

·
9 authors

Submitted by

nielsr

Generating $π$-Functional Molecules Using STGG+ with Active Learning

·
5 authors

Submitted by

danielwusg

Symmetrical Visual Contrastive Optimization: Aligning Vision-Language Models with Minimal Contrastive Images

·
4 authors