haikuoxin's picture

haikuoxin

haikuoxin

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

InfiniteVGGT: Visual Geometry Grounded Transformer for Endless Streams

upvoted a collection about 2 months ago

Flux Kontext LoRAs

liked a model 3 months ago

DFloat11/Qwen-Image-Edit-2509-DF11

View all activity

Organizations

None yet

upvoted a paper 1 day ago

InfiniteVGGT: Visual Geometry Grounded Transformer for Endless Streams

Paper • 2601.02281 • Published 4 days ago • 28

upvoted a collection about 2 months ago

Flux Kontext LoRAs

Flux Kontext LoRAs trained by the community • 9 items • Updated Jul 21, 2025 • 5

liked 4 models 3 months ago

DFloat11/Qwen-Image-Edit-2509-DF11

Updated Sep 30, 2025 • 58 • 20

lightx2v/Qwen-Image-Lightning

Text-to-Image • Updated Nov 3, 2025 • 537k • • 746

Insta360-Research/DiT360-Panorama-Image-Generation

Text-to-Image • Updated Oct 17, 2025 • 1.21k • 20

mit-han-lab/nunchaku-flux.1-kontext-dev

Image-to-Image • Updated Jul 21, 2025 • 11.8k • 167

liked a model 6 months ago

zhang0jhon/flux_wavelet_v2_sc

Text-to-Image • Updated Jun 3, 2025 • 3 • 5

liked a Space 7 months ago

Image Arena Leaderboard

Image Generation and Image Editing Arena & Leaderboard

liked a model 7 months ago

KevinHuang/DreamCube

Image-to-3D • Updated Jun 24, 2025 • 51 • 11

upvoted a paper 7 months ago

Splatting Physical Scenes: End-to-End Real-to-Sim from Imperfect Robot Data

Paper • 2506.04120 • Published Jun 4, 2025 • 7

upvoted a paper 9 months ago

SphereDiff: Tuning-free Omnidirectional Panoramic Image and Video Generation via Spherical Latent Representation

Paper • 2504.14396 • Published Apr 19, 2025 • 27

liked a model 9 months ago

ysmikey/Layerpano3D-FLUX-Panorama-LoRA

Text-to-Image • Updated Feb 8, 2025 • • 14

liked a model 10 months ago

tencent/Hunyuan3D-2

Image-to-3D • Updated Oct 17, 2025 • 62.3k • 1.69k

liked 2 Spaces 10 months ago

MIDI 3D

Image to Compositional 3D Scene Generation

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

upvoted a collection 10 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated 10 days ago • 550

upvoted a collection 11 months ago

DeepSeek R1 (All Versions)

DeepSeek-R1-0528 is here! The most powerful reasoning open LLM, available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 37 items • Updated 16 days ago • 261

upvoted a paper about 1 year ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published Jan 1, 2025 • 109

commented a paper about 1 year ago

LumiNet: Latent Intrinsics Meets Diffusion Models for Indoor Scene Relighting

Paper • 2412.00177 • Published Nov 29, 2024 • 8 •

upvoted a collection about 1 year ago

Relighting

6 items • Updated Dec 16, 2024 • 1