UI-Venus Technical Report: Building High-performance UI Agents with RFT Paper • 2508.10833 • Published 19 days ago • 40
XYLayoutLM: Towards Layout-Aware Multimodal Networks For Visually-Rich Document Understanding Paper • 2203.06947 • Published Mar 14, 2022
Mobile User Interface Element Detection Via Adaptively Prompt Tuning Paper • 2305.09699 • Published May 16, 2023
Backpropagation Path Search On Adversarial Transferability Paper • 2308.07625 • Published Aug 15, 2023
DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark Paper • 2405.19707 • Published May 30, 2024 • 8
E-ANT: A Large-Scale Dataset for Efficient Automatic GUI NavigaTion Paper • 2406.14250 • Published Jun 20, 2024