gao

ym9

AI & ML interests

None yet

Recent Activity

upvoted a paper about 14 hours ago

TPDiff: Temporal Pyramid Video Diffusion Model

upvoted a paper 9 days ago

How far can we go with ImageNet for Text-to-Image generation?

upvoted a paper about 1 month ago

Fast Video Generation with Sliding Tile Attention

View all activity

Organizations

ym9's activity

upvoted a paper about 14 hours ago

TPDiff: Temporal Pyramid Video Diffusion Model

Paper • 2503.09566 • Published 1 day ago • 32

upvoted a paper 9 days ago

How far can we go with ImageNet for Text-to-Image generation?

Paper • 2502.21318 • Published 13 days ago • 25

upvoted 2 papers about 1 month ago

Fast Video Generation with Sliding Tile Attention

Paper • 2502.04507 • Published Feb 6 • 49

VideoRoPE: What Makes for Good Video Rotary Position Embedding?

Paper • 2502.05173 • Published Feb 7 • 64

upvoted 3 papers about 2 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 346

Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Paper • 2501.12202 • Published Jan 21 • 35

TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space

Paper • 2501.12224 • Published Jan 21 • 46

liked a model about 2 months ago

TencentARC/flux-mini

Text-to-Image • Updated Nov 29, 2024 • 122 • 88

upvoted a paper 2 months ago

1.58-bit FLUX

Paper • 2412.18653 • Published Dec 24, 2024 • 80

upvoted 2 papers 3 months ago

Large Motion Video Autoencoding with Cross-modal Video VAE

Paper • 2412.17805 • Published Dec 23, 2024 • 24

TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation

Paper • 2412.03069 • Published Dec 4, 2024 • 32

liked a Space 4 months ago

478

Kolors Portrait With Flux

🤗

Kolors Portrait to keep face identity developed with Flux

upvoted a collection 7 months ago

Papers I want to read

Collection

Papers in my to-read list • 259 items • Updated Jan 10 • 30

upvoted a paper 8 months ago

MiraData: A Large-Scale Video Dataset with Long Durations and Structured Captions

Paper • 2407.06358 • Published Jul 8, 2024 • 19

liked a dataset 8 months ago

TencentARC/MiraData

Viewer • Updated Jul 19, 2024 • 475k • 283 • 29

liked a Space 10 months ago

493

Chat-with-GPT4o

🚀

Generate conversational responses using text input

liked 2 Spaces 11 months ago

807

Face to All

👨

AI filter for your portraits

1.36k

InstantMesh

📚

Create a 3D model from an image in 10 seconds!

upvoted a paper 12 months ago

GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation

Paper • 2403.14621 • Published Mar 21, 2024 • 16

liked a model about 1 year ago

huggyllama/llama-7b

Text Generation • Updated Jul 2, 2024 • 180k • 320