Kyu Song

kyunocap

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Magic 1-For-1: Generating One Minute Video Clips within One Minute

liked a model 4 days ago

DAMO-NLP-SG/VideoLLaMA3-7B

upvoted a paper 7 days ago

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

View all activity

Organizations

None yet

kyunocap's activity

upvoted a paper 2 days ago

Magic 1-For-1: Generating One Minute Video Clips within One Minute

Paper • 2502.07701 • Published 3 days ago • 24

liked a model 4 days ago

DAMO-NLP-SG/VideoLLaMA3-7B

Visual Question Answering • Updated 15 days ago • 6.88k • 33

upvoted 2 papers 7 days ago

ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features

Paper • 2502.04320 • Published 8 days ago • 31

DynVFX: Augmenting Real Videos with Dynamic Content

Paper • 2502.03621 • Published 9 days ago • 27

upvoted 2 papers 9 days ago

LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer

Paper • 2502.01105 • Published 12 days ago • 16

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 10 days ago • 162

upvoted a paper 10 days ago

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published 12 days ago • 171

liked a Space 16 days ago

1.13k

FLUX Prompt Generator

😻

Display a user interface for various tasks

upvoted a paper 16 days ago

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published 20 days ago • 57

upvoted 2 papers 23 days ago

Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback

Paper • 2501.12895 • Published 24 days ago • 56

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 24 days ago • 318

upvoted a paper 28 days ago

Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published 29 days ago • 67

upvoted 5 papers about 1 month ago

SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training

Paper • 2412.09619 • Published Dec 12, 2024 • 23

VideoRAG: Retrieval-Augmented Generation over Video Corpus

Paper • 2501.05874 • Published Jan 10 • 67

liked a Space about 1 month ago

233

TransPixar

😻

https://huggingface.co/papers/2501.03006

upvoted a paper about 1 month ago

VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLM

Paper • 2501.00599 • Published Dec 31, 2024 • 41

upvoted a paper 2 months ago

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 139