3 17 9

Haokun Lin

Felix1023

https://felixmessi.github.io/

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs

authored a paper 3 months ago

LRQ-DiT: Log-Rotation Post-Training Quantization of Diffusion Transformers for Image and Video Generation

upvoted a paper 3 months ago

LRQ-DiT: Log-Rotation Post-Training Quantization of Diffusion Transformers for Image and Video Generation

View all activity

Organizations

upvoted a paper about 1 month ago

TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs

Paper • 2512.14698 • Published Dec 16, 2025 • 21

authored a paper 3 months ago

LRQ-DiT: Log-Rotation Post-Training Quantization of Diffusion Transformers for Image and Video Generation

Paper • 2508.03485 • Published Aug 5, 2025 • 2

upvoted 2 papers 3 months ago

LRQ-DiT: Log-Rotation Post-Training Quantization of Diffusion Transformers for Image and Video Generation

Paper • 2508.03485 • Published Aug 5, 2025 • 2

From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model

Paper • 2510.19871 • Published Oct 22, 2025 • 30

liked 2 models 3 months ago

ByteDance/Video-As-Prompt-CogVideoX-5B

Image-to-Video • Updated Oct 27, 2025 • 47 • 21

ByteDance/Video-As-Prompt-Wan2.1-14B

Image-to-Video • Updated Oct 27, 2025 • 63 • 46

upvoted a collection 3 months ago

Video-As-Prompt

Collection

The model zoo for "Video-As-Prompt: Unified Semantic Control for Video Generation" • 3 items • Updated Oct 27, 2025 • 13

liked a dataset 3 months ago

BianYx/VAP-Data

Viewer • Updated Oct 30, 2025 • 90.1k • 10.2k • 21

upvoted a paper 3 months ago

Video-As-Prompt: Unified Semantic Control for Video Generation

Paper • 2510.20888 • Published Oct 23, 2025 • 49

liked a model 3 months ago

JunhaoZhuang/FlashVSR

Video-to-Video • Updated Dec 10, 2025 • 169

upvoted a paper 5 months ago

HuMo: Human-Centric Video Generation via Collaborative Multi-Modal Conditioning

Paper • 2509.08519 • Published Sep 10, 2025 • 128

liked a model 5 months ago

TencentARC/IC-Custom

Image-to-Image • Updated Aug 31, 2025 • 6 • 16

upvoted a paper 5 months ago

AudioStory: Generating Long-Form Narrative Audio with Large Language Models

Paper • 2508.20088 • Published Aug 27, 2025 • 21

New activity in TencentARC/TokLIP 5 months ago

Add pipeline tag and fix image path in model card

#1 opened 5 months ago by

nielsr

authored a paper 5 months ago

Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs

Paper • 2508.14896 • Published Aug 20, 2025 • 22

upvoted a paper 5 months ago

Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs

Paper • 2508.14896 • Published Aug 20, 2025 • 22

commented a paper 5 months ago

Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs

Paper • 2508.14896 • Published Aug 20, 2025 • 22 •

updated a model 5 months ago

TencentARC/TokLIP

Image-Text-to-Text • Updated Aug 21, 2025 • 13 • 13

liked a model 5 months ago

TencentARC/ToonComposer

Image-to-Video • Updated Aug 15, 2025 • 32

upvoted a paper 5 months ago

ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing

Paper • 2508.10881 • Published Aug 14, 2025 • 52

Haokun Lin

AI & ML interests

Recent Activity

Organizations

Felix1023's activity

Add pipeline tag and fix image path in model card