Zihan Liu

LiuZH-19

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Light-A-Video: Training-free Video Relighting via Progressive Light Fusion

liked a dataset 4 days ago

HKUSTAudio/Llasa_opensource_speech_data_160k_hours_tokenized

upvoted a paper 4 days ago

VideoRoPE: What Makes for Good Video Rotary Position Embedding?

View all activity

Organizations

None yet

LiuZH-19's activity

upvoted a paper 1 day ago

Light-A-Video: Training-free Video Relighting via Progressive Light Fusion

Paper • 2502.08590 • Published 2 days ago • 34

liked a dataset 4 days ago

HKUSTAudio/Llasa_opensource_speech_data_160k_hours_tokenized

Updated 1 day ago • 383 • 21

upvoted a paper 4 days ago

VideoRoPE: What Makes for Good Video Rotary Position Embedding?

Paper • 2502.05173 • Published 7 days ago • 60

upvoted 2 papers about 1 month ago

OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?

Paper • 2501.05510 • Published Jan 9 • 39

Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction

Paper • 2501.03218 • Published Jan 6 • 35

upvoted a paper about 2 months ago

IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations

Paper • 2412.12083 • Published Dec 16, 2024 • 12

upvoted a paper 2 months ago

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Paper • 2412.09596 • Published Dec 12, 2024 • 94

upvoted 3 papers 4 months ago

PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction

Paper • 2410.17247 • Published Oct 22, 2024 • 45

MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models

Paper • 2410.17637 • Published Oct 23, 2024 • 34

SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree

Paper • 2410.16268 • Published Oct 21, 2024 • 67

liked a Space 5 months ago

Advanced MIDI Renderer

🎹

Transform and render any MIDI

liked a dataset 6 months ago

seungheondoh/LP-MusicCaps-MSD

Viewer • Updated Aug 1, 2023 • 514k • 70 • 31

liked a model 8 months ago

internlm/internlm-xcomposer2d5-7b

Visual Question Answering • Updated Jul 22, 2024 • 700k • 199

upvoted a paper 8 months ago

InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output

Paper • 2407.03320 • Published Jul 3, 2024 • 93

liked a Space 9 months ago

Vocals

🔥