new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Apr 15

Submitted by

Weiyun1025

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

·
47 authors

Submitted by

LIKirin

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

·
5 authors

7

Submitted by

cuijiaxing

Have we unified image generation and understanding yet? An empirical study of GPT-4o's image generation ability

·
3 authors

2

Submitted by

wenhu

VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning

·
6 authors

Submitted by

starriver030515

FUSION: Fully Integration of Vision-Language Representations for Deep Cross-Modal Understanding

·
7 authors

Submitted by

mponty

Iterative Self-Training for Code Generation via Reinforced Re-Ranking

·
3 authors

Submitted by

DogNeverSleep

Mavors: Multi-granularity Video Representation for Multimodal Large Language Model

·
15 authors

Submitted by

xhluca

AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories

·
10 authors

Submitted by

AIRobotZ

S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability of Large Reasoning Models

·
5 authors

Submitted by

ztwang

DUMP: Automated Distribution-Level Curriculum Learning for RL-based LLM Post-training

·
4 authors

2

Submitted by

isaacchung

MIEB: Massive Image Embedding Benchmark

·
10 authors

Submitted by

brucelyu

SocioVerse: A World Model for Social Simulation Powered by LLM Agents and A Pool of 10 Million Real-World Users

·
21 authors

3

Submitted by

leoozy

Breaking the Data Barrier -- Building GUI Agents Through Task Generalization

·
7 authors

Submitted by

Zhang199

TinyLLaVA-Video-R1: Towards Smaller LMMs for Video Reasoning

·
4 authors

3

Submitted by

akhaliq

M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models

·
6 authors

Submitted by

yyamada

The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search

·
8 authors

3

Submitted by

codezakh

Executable Functional Abstractions: Inferring Generative Programs for Advanced Math Problems

·
5 authors

Submitted by

sewon

Reasoning Models Can Be Effective Without Thinking

·
6 authors

2

Submitted by

LibraTree

VisuoThink: Empowering LVLM Reasoning with Multimodal Tree Search

·
8 authors

Submitted by

parshinsh

LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models

·
6 authors

Submitted by

akhaliq

How new data permeates LLM knowledge and how to dilute it

·
8 authors

Submitted by

ChrisJuan

EmoAgent: Assessing and Safeguarding Human-AI Interaction for Mental Health Safety

·
10 authors

Submitted by

mqliu

LLM Can be a Dangerous Persuader: Empirical Study of Persuasion Safety in Large Language Models

·
11 authors

2

Submitted by

SteveZeyuZhang

3D CoCa: Contrastive Learners are 3D Captioners

·
4 authors

2

Submitted by

Rexhaif

DeepSeek vs. o3-mini: How Well can Reasoning LLMs Evaluate MT and Summarization?

·
8 authors

2

Submitted by

kpzhang996

MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models

·
20 authors

2

Submitted by

johnhalloran

MCP Safety Audit: LLMs with the Model Context Protocol Allow Major Security Exploits

·
2 authors

Submitted by

SteveZeyuZhang

DiffuMural: Restoring Dunhuang Murals with Multi-scale Diffusion

·
9 authors

2