Alfie Devine's picture

24

Alfie Devine

alf16Devine

AI & ML interests

structured visual recognition

Recent Activity

upvoted a paper about 1 month ago

MOSPA: Human Motion Generation Driven by Spatial Audio

upvoted a paper about 1 month ago

SpatialTrackerV2: 3D Point Tracking Made Easy

upvoted a paper about 1 month ago

Seq vs Seq: An Open Suite of Paired Encoders and Decoders

View all activity

Organizations

None yet

upvoted 20 papers about 1 month ago

MOSPA: Human Motion Generation Driven by Spatial Audio

Paper • 2507.11949 • Published Jul 16 • 23

SpatialTrackerV2: 3D Point Tracking Made Easy

Paper • 2507.12462 • Published Jul 16 • 16

Seq vs Seq: An Open Suite of Paired Encoders and Decoders

Paper • 2507.11412 • Published Jul 15 • 25

DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering

Paper • 2507.11527 • Published Jul 15 • 31

MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding

Paper • 2507.12463 • Published Jul 16 • 26

Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs

Paper • 2507.09477 • Published Jul 13 • 80

PhysX: Physical-Grounded 3D Asset Generation

Paper • 2507.12465 • Published Jul 16 • 43

SWE-Perf: Can Language Models Optimize Code Performance on Real-World Repositories?

Paper • 2507.12415 • Published Jul 16 • 41

Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning

Paper • 2507.14137 • Published Jul 18 • 34

A Data-Centric Framework for Addressing Phonetic and Prosodic Challenges in Russian Speech Generative Models

Paper • 2507.13563 • Published Jul 17 • 51

The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs

Paper • 2507.11097 • Published Jul 15 • 63

MiroMind-M1: An Open-Source Advancement in Mathematical Reasoning via Context-Aware Multi-Stage Policy Optimization

Paper • 2507.14683 • Published Jul 19 • 128

The Invisible Leash: Why RLVR May Not Escape Its Origin

Paper • 2507.14843 • Published Jul 20 • 84

Task-Specific Zero-shot Quantization-Aware Training for Object Detection

Paper • 2507.16782 • Published Jul 22 • 9

Iwin Transformer: Hierarchical Vision Transformer using Interleaved Windows

Paper • 2507.18405 • Published Jul 24 • 4

DriftMoE: A Mixture of Experts Approach to Handle Concept Drifts

Paper • 2507.18464 • Published Jul 24 • 11

LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization

Paper • 2507.15758 • Published Jul 21 • 34

Captain Cinema: Towards Short Movie Generation

Paper • 2507.18634 • Published Jul 24 • 40

Rep-MTL: Unleashing the Power of Representation-level Task Saliency for Multi-Task Learning

Paper • 2507.21049 • Published Jul 28 • 41

Geometric-Mean Policy Optimization

Paper • 2507.20673 • Published Jul 28 • 31