93 256

Mwangi PRO

Benson

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Multimodal OCR: Parse Anything from Documents

upvoted a paper 4 days ago

NLE: Non-autoregressive LLM-based ASR by Transcript Editing

upvoted a paper 4 days ago

MAISI-v2: Accelerated 3D High-Resolution Medical Image Synthesis with Rectified Flow and Region-specific Contrastive Loss

View all activity

Organizations

None yet

upvoted 3 papers 4 days ago

Multimodal OCR: Parse Anything from Documents

Paper • 2603.13032 • Published 6 days ago • 29

NLE: Non-autoregressive LLM-based ASR by Transcript Editing

Paper • 2603.08397 • Published 10 days ago • 21

MAISI-v2: Accelerated 3D High-Resolution Medical Image Synthesis with Rectified Flow and Region-specific Contrastive Loss

Paper • 2508.05772 • Published Aug 7, 2025 • 3

upvoted a paper 9 days ago

Vad-R1: Towards Video Anomaly Reasoning via Perception-to-Cognition Chain-of-Thought

Paper • 2505.19877 • Published May 26, 2025 • 1

upvoted a paper 23 days ago

A Very Big Video Reasoning Suite

Paper • 2602.20159 • Published 24 days ago • 515

upvoted a collection about 1 month ago

HumanLM Models

Collection

https://humanlm.stanford.edu/ • 1 item • Updated Feb 13 • 1

upvoted a paper about 1 month ago

PaperBanana: Automating Academic Illustration for AI Scientists

Paper • 2601.23265 • Published Jan 30 • 219

upvoted 4 papers about 2 months ago

upvoted a collection 2 months ago

Personalized Reasoning

Collection

9 items • Updated Oct 15, 2025 • 5

upvoted a paper 2 months ago

Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation

Paper • 2512.24271 • Published Dec 30, 2025 • 64

upvoted 2 articles 3 months ago

Article

How to make NeuTTS-air generate over 200 seconds of audio in a single second.

Nov 21, 2025

•

Article

LLM based Audio models

Dec 18, 2025

•

upvoted 5 papers 3 months ago

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Paper • 2512.13281 • Published Dec 15, 2025 • 65

Graph of Verification: Structured Verification of LLM Reasoning with Directed Acyclic Graphs

Paper • 2506.12509 • Published Jun 14, 2025 • 2

Scaling Zero-Shot Reference-to-Video Generation

Paper • 2512.06905 • Published Dec 7, 2025 • 29

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published Dec 4, 2025 • 175

PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing

Paper • 2512.02589 • Published Dec 2, 2025 • 73

Mwangi PRO

AI & ML interests

Recent Activity

Organizations

Benson's activity

How to make NeuTTS-air generate over 200 seconds of audio in a single second.

LLM based Audio models