Submitted by akhaliq 49 MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series · 45 authors 3
Submitted by akhaliq 22 Self-Exploring Language Models: Active Preference Elicitation for Online Alignment · 7 authors 1
Submitted by akhaliq 21 T2V-Turbo: Breaking the Quality Bottleneck of Video Consistency Model with Mixed Reward Feedback · 7 authors 1
Submitted by akhaliq 17 LLMs achieve adult human performance on higher-order theory of mind tasks · 10 authors 7
Submitted by akhaliq 14 Nearest Neighbor Speculative Decoding for LLM Generation and Attribution · 7 authors
Submitted by akhaliq 14 Offline Regularised Reinforcement Learning for Large Language Models Alignment · 18 authors
Submitted by akhaliq 12 EasyAnimate: A High-Performance Long Video Generation Method based on Transformer Architecture · 8 authors 1
Submitted by akhaliq 10 Value-Incentivized Preference Optimization: A Unified Approach to Online and Offline RLHF · 9 authors
Submitted by akhaliq 9 SoundCTM: Uniting Score-based and Consistency Models for Text-to-Sound Generation · 7 authors
Submitted by akhaliq 8 Atlas3D: Physically Constrained Self-Supporting Text-to-3D for Simulation and Fabrication · 8 authors