view post Post 1596 R1 is out! And with a lot of other R1 releated models... See translation 🚀 5 5 + Reply
Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control Paper • 2501.03847 • Published about 1 month ago • 23
Instruction-based Image Manipulation by Watching How Things Move Paper • 2412.12087 • Published Dec 16, 2024 • 1
OpenAnimalTracks: A Dataset for Animal Track Recognition Paper • 2406.09647 • Published Jun 14, 2024
SBS Figures: Pre-training Figure QA from Stage-by-Stage Synthesized Images Paper • 2412.17606 • Published Dec 23, 2024 • 5
view post Post 456 @s3nh Hey man check your discord! Got some news. See translation 4 replies · 👍 1 1 + Reply
Offline Data Enhanced On-Policy Policy Gradient with Provable Guarantees Paper • 2311.08384 • Published Nov 14, 2023
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL Paper • 2402.19446 • Published Feb 29, 2024
Autonomous Evaluation and Refinement of Digital Agents Paper • 2404.06474 • Published Apr 9, 2024 • 2
$BT^2$: Backward-compatible Training with Basis Transformation Paper • 2211.03989 • Published Nov 8, 2022
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning Paper • 2406.11896 • Published Jun 14, 2024 • 20
Aligning Large Language Models with Representation Editing: A Control Perspective Paper • 2406.05954 • Published Jun 10, 2024
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning Paper • 2405.10292 • Published May 16, 2024 • 1
FakeSpotter: A Simple yet Robust Baseline for Spotting AI-Synthesized Fake Faces Paper • 1909.06122 • Published Sep 13, 2019
Zero-Day Backdoor Attack against Text-to-Image Diffusion Models via Personalization Paper • 2305.10701 • Published May 18, 2023