view article Article ScreenEnv: Deploy your full stack Desktop Agent By A-Mahla and 1 other β’ Jul 10 β’ 64
Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge Paper β’ 2506.21506 β’ Published Jun 26 β’ 51
view article Article π€ππ¬π₯οΈπ Kimi-VL-A3B-Thinking-2506: A Quick Navigation By moonshotai and 1 other β’ Jun 21 β’ 68
view article Article ScreenSuite - The most comprehensive evaluation suite for GUI Agents! Jun 6 β’ 53
view article Article Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H By Hcompany and 1 other β’ Jun 3 β’ 70
view article Article TinyAgents: A Minimal Experiment with Code Agents and MCP Tools By albertvillanova β’ May 16 β’ 30