new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Dec 22

Submitted by

taesiri

Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows

·
107 authors

Submitted by

birdxp

PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence

DeepCybo

2

Submitted by

taesiri

When Reasoning Meets Its Laws

·
7 authors

Submitted by

taesiri

Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience

ByteDance-Seed

Submitted by

shilongz

Both Semantics and Reconstruction Matter: Making Representation Encoders Ready for Text-to-Image Generation and Editing

adobe

Submitted by

cmhungsteve

4D-RGPT: Toward Region-level 4D Understanding via Perceptual Distillation

nvidia

1

Submitted by

shuaishuaicdp

Are We on the Right Way to Assessing LLM-as-a-Judge?

Submitted by

Chaoxu0309

An Anatomy of Vision-Language-Action Models: From Modules to Milestones and Challenges

IRootech

IROOTECH TECHNOLOGY

Submitted by

danielgilo

RadarGen: Automotive Radar Point Cloud Generation from Cameras

·
5 authors

Submitted by

tobiaslee

GroundingME: Exposing the Visual Grounding Gap in MLLMs through Multi-Dimensional Evaluation

XiaomiMiMo

2

Submitted by

zhuzeyuan

Physics of Language Models: Part 4.1, Architecture Design and the Magic of Canon Layers

Submitted by

Jiaqi-hkust

Robust-R1: Degradation-Aware Reasoning for Robust Visual Understanding

·
10 authors

1

Submitted by

ljb121002

Turn-PPO: Turn-Level Advantage Estimation with PPO for Improved Multi-Turn RL in Agentic LLMs

·
6 authors

1

Submitted by

GSerussi

HERBench: A Benchmark for Multi-Evidence Integration in Video Question Answering

Insight-bgu

Submitted by

taesiri

Animate Any Character in Any World

·
5 authors

Submitted by

benjamin

Bolmo: Byteifying the Next Generation of Language Models

allenai

Ai2

Submitted by

taesiri

SWE-Bench++: A Framework for the Scalable Generation of Software Engineering Benchmarks from Open-Source Repositories

TuringEnterprises

Submitted by

senmaonk

StageVAR: Stage-Aware Acceleration for Visual Autoregressive Models

NankaiUniversity

Nankai University

Submitted by

yljblues

Meta-RL Induces Exploration in Language Agents

·
5 authors

Submitted by

JDihlmann

3D-RE-GEN: 3D Reconstruction of Indoor Scenes with a Generative Framework

·
3 authors

Submitted by

sahalshajim

A Benchmark and Agentic Framework for Omni-Modal Reasoning and Tool Use in Long Videos

MBZUAI

Mohamed Bin Zayed University of Artificial Intelligence

Submitted by

cohennoa

MineTheGap: Automatic Mining of Biases in Text-to-Image Models

·
4 authors