DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 16 days ago • 302
OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding? Paper • 2501.05510 • Published 29 days ago • 39
AIGS: Generating Science from AI-Powered Automated Falsification Paper • 2411.11910 • Published Nov 17, 2024
Enhancing Multilingual Capabilities of Large Language Models through Self-Distillation from Resource-Rich Languages Paper • 2402.12204 • Published Feb 19, 2024 • 1
Enabling Weak LLMs to Judge Response Reliability via Meta Ranking Paper • 2402.12146 • Published Feb 19, 2024
An Extensible Plug-and-Play Method for Multi-Aspect Controllable Text Generation Paper • 2212.09387 • Published Dec 19, 2022
Towards Unified Alignment Between Agents, Humans, and Environment Paper • 2402.07744 • Published Feb 12, 2024 • 3
Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization Paper • 2310.02170 • Published Oct 3, 2023 • 2
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments Paper • 2404.07972 • Published Apr 11, 2024 • 48
Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence Paper • 2404.05892 • Published Apr 8, 2024 • 33
LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images Paper • 2403.11703 • Published Mar 18, 2024 • 17
Sherpa3D: Boosting High-Fidelity Text-to-3D Generation via Coarse 3D Prior Paper • 2312.06655 • Published Dec 11, 2023 • 24
Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models Paper • 2312.04410 • Published Dec 7, 2023 • 15
Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model Paper • 2311.13231 • Published Nov 22, 2023 • 27
Tuna: Instruction Tuning using Feedback from Large Language Models Paper • 2310.13385 • Published Oct 20, 2023 • 11
From Knowledge Distillation to Self-Knowledge Distillation: A Unified Approach with Normalized Loss and Customized Soft Labels Paper • 2303.13005 • Published Mar 23, 2023