Reward-Guided Speculative Decoding for Efficient LLM Reasoning Paper • 2501.19324 • Published Jan 31 • 40
DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data Paper • 2405.14333 • Published May 23, 2024 • 41
Leveraging Training Data in Few-Shot Prompting for Numerical Reasoning Paper • 2305.18170 • Published May 29, 2023 • 2