Implicit Search via Discrete Diffusion: A Study on Chess Paper • 2502.19805 • Published 15 days ago • 1
Teaching Language Models to Critique via Reinforcement Learning Paper • 2502.03492 • Published Feb 5 • 24
ZeroGen: Efficient Zero-shot Learning via Dataset Generation Paper • 2202.07922 • Published Feb 16, 2022 • 1
Scaling Diffusion Language Models via Adaptation from Autoregressive Models Paper • 2410.17891 • Published Oct 23, 2024 • 16
Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning Paper • 2410.14157 • Published Oct 18, 2024
Language Versatilists vs. Specialists: An Empirical Revisiting on Multilingual Transfer Ability Paper • 2306.06688 • Published Jun 11, 2023
Generating Data for Symbolic Language with Large Language Models Paper • 2305.13917 • Published May 23, 2023
Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models Paper • 2402.07754 • Published Feb 12, 2024
VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment Paper • 2410.09421 • Published Oct 12, 2024
VLRewardBench: A Challenging Benchmark for Vision-Language Generative Reward Models Paper • 2411.17451 • Published Nov 26, 2024 • 11
Calibrating Reasoning in Language Models with Internal Consistency Paper • 2405.18711 • Published May 29, 2024 • 6
Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models Paper • 2404.12387 • Published Apr 18, 2024 • 39
Silkie: Preference Distillation for Large Visual Language Models Paper • 2312.10665 • Published Dec 17, 2023 • 11
Future-conditioned Unsupervised Pretraining for Decision Transformer Paper • 2305.16683 • Published May 26, 2023
CAB: Comprehensive Attention Benchmarking on Long Sequence Modeling Paper • 2210.07661 • Published Oct 14, 2022