MLGym: A New Framework and Benchmark for Advancing AI Research Agents Paper • 2502.14499 • Published 21 days ago • 179
TabReD: A Benchmark of Tabular Machine Learning in-the-Wild Paper • 2406.19380 • Published Jun 27, 2024 • 49
SpecExec: Massively Parallel Speculative Decoding for Interactive LLM Inference on Consumer Devices Paper • 2406.02532 • Published Jun 4, 2024 • 13
Does Diffusion Beat GAN in Image Super Resolution? Paper • 2405.17261 • Published May 27, 2024 • 20
Learn Your Reference Model for Real Good Alignment Paper • 2404.09656 • Published Apr 15, 2024 • 84