Submitted by akhaliq 95 Design2Code: How Far Are We From Automating Front-End Engineering? · 5 authors 2
Submitted by akhaliq 63 Scaling Rectified Flow Transformers for High-Resolution Image Synthesis · 17 authors 3
Submitted by akhaliq 30 OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on · 4 authors 2
Submitted by akhaliq 28 MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies · 7 authors 6
Submitted by akhaliq 19 DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language Models · 7 authors 2
Submitted by akhaliq 16 InfiMM-HD: A Leap Forward in High-Resolution Multimodal Understanding · 10 authors 1
Submitted by akhaliq 15 ResAdapter: Domain Consistent Resolution Adapter for Diffusion Models · 10 authors 1
Submitted by akhaliq 9 ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models · 8 authors 1
Submitted by akhaliq 8 Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation · 7 authors 1
Submitted by akhaliq 6 3DGStream: On-the-Fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint Videos · 6 authors