Daily Papers

Submitted by

Lingmin-Ran

31

TPDiff: Temporal Pyramid Video Diffusion Model

·
2 authors

1

Submitted by

akhaliq

27

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

·
8 authors

1

Submitted by

hyeonho-jeong-video

23

Reangle-A-Video: 4D Video Generation as Video-to-Video Translation

·
3 authors

1

Submitted by

GuyYariv

10

RewardSDS: Aligning Score Distillation via Reward-Weighted Sampling

·
3 authors

1

Submitted by

yijunyang

10

GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training

·
6 authors

1

Submitted by

LihiShalmon

10

Motion Anything: Any to Motion Generation

·
10 authors

3

Submitted by

Devy1

6

Quantizing Large Language Models for Code Generation: A Differentiated Replication

·
5 authors

1

Submitted by

Asaf-Yehudai

5

WildIFEval: Instruction Following in the Wild

·
4 authors

2

Submitted by

KevinQHLin

4

VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary

·
2 authors

1

Submitted by

chen-yingfa

3

Cost-Optimal Grouped-Query Attention for Long-Context LLMs

·
5 authors

1

Submitted by

SingleZombie

3

Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space

·
4 authors

1

Submitted by

VityaVitalich

3

Self-Taught Self-Correction for Small Language Models

·
3 authors

1

Submitted by

ll-13

3

When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning

·
8 authors

1

Submitted by

gberta

2

BIMBA: Selective-Scan Compression for Long-Range Video Question Answering

·
5 authors

1

Submitted by

yrshi

2

Multimodal Language Modeling for High-Accuracy Single Cell Transcriptomics Analysis and Generation

·
7 authors

1

Submitted by

sakharamg

2

Multi Agent based Medical Assistant for Edge Devices

·
8 authors

1

Submitted by

Robot2050

1

MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation System

·
8 authors

2

Submitted by

Sailorzzcc

-

Monte Carlo Diffusion for Generalizable Learning-Based RANSAC

·
4 authors

1

Submitted by

tkreiman

-

Understanding and Mitigating Distribution Shifts For Machine Learning Force Fields

·
2 authors

2

Submitted by

mspitzna

-

PhysicsGen: Can Generative Models Learn from Images to Predict Complex Physical Relations?

·
3 authors

1

byAK and the research community

TPDiff: Temporal Pyramid Video Diffusion Model

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Reangle-A-Video: 4D Video Generation as Video-to-Video Translation

RewardSDS: Aligning Score Distillation via Reward-Weighted Sampling

GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training

More Documents, Same Length: Isolating the Challenge of Multiple Documents in RAG

Motion Anything: Any to Motion Generation

Quantizing Large Language Models for Code Generation: A Differentiated Replication

WildIFEval: Instruction Following in the Wild

VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary

Cost-Optimal Grouped-Query Attention for Long-Context LLMs

Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space

Self-Taught Self-Correction for Small Language Models

When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning

BIMBA: Selective-Scan Compression for Long-Range Video Question Answering

Multimodal Language Modeling for High-Accuracy Single Cell Transcriptomics Analysis and Generation

Multi Agent based Medical Assistant for Edge Devices

MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation System

Monte Carlo Diffusion for Generalizable Learning-Based RANSAC

Understanding and Mitigating Distribution Shifts For Machine Learning Force Fields

PhysicsGen: Can Generative Models Learn from Images to Predict Complex Physical Relations?