Taha Ansari

Tahahah

AI & ML interests

None yet

Recent Activity

liked a model 12 days ago

google/medasr

upvoted a paper 15 days ago

Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed

liked a model about 2 months ago

renderartist/technically-color-wan

View all activity

Organizations

upvoted a paper 15 days ago

Efficient-DLM: From Autoregressive to Diffusion Language Models, and Beyond in Speed

Paper • 2512.14067 • Published 16 days ago • 13

upvoted a paper 10 months ago

SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation

Paper • 2503.09641 • Published Mar 12, 2025 • 41

upvoted an article 10 months ago

Article

FastRTC: The Real-Time Communication Library for Python

Feb 25, 2025

•

172

upvoted 5 papers 11 months ago

DarwinLM: Evolutionary Structured Pruning of Large Language Models

Paper • 2502.07780 • Published Feb 11, 2025 • 18

CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation

Paper • 2502.08639 • Published Feb 12, 2025 • 43

upvoted an article 11 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4, 2025

•

1.31k

upvoted 5 papers 11 months ago

DeepFlow: Serverless Large Language Model Serving at Scale

Paper • 2501.14417 • Published Jan 24, 2025 • 3

DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation

Paper • 2501.16764 • Published Jan 28, 2025 • 22

DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning

Paper • 2411.04983 • Published Nov 7, 2024 • 13

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31, 2025 • 124

Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs

Paper • 2501.18585 • Published Jan 30, 2025 • 61

upvoted a paper about 1 year ago

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published Dec 10, 2024 • 48

upvoted a collection about 1 year ago

VILA-U-7B

Collection

VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation • 2 items • Updated Jul 3, 2025 • 5

upvoted a paper about 2 years ago

Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action

Paper • 2312.17172 • Published Dec 28, 2023 • 30