EpiDiff: Enhancing Multi-View Synthesis via Localized Epipolar-Constrained Diffusion Paper β’ 2312.06725 β’ Published Dec 11, 2023 β’ 1
Supervision Exists Everywhere: A Data Efficient Contrastive Language-Image Pre-training Paradigm Paper β’ 2110.05208 β’ Published Oct 11, 2021
GVGEN: Text-to-3D Generation with Volumetric Representation Paper β’ 2403.12957 β’ Published Mar 19, 2024 β’ 6
BEVBert: Multimodal Map Pre-training for Language-guided Navigation Paper β’ 2212.04385 β’ Published Dec 8, 2022
Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT Paper β’ 2406.18583 β’ Published Jun 5, 2024
GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models Paper β’ 2406.14550 β’ Published Jun 20, 2024 β’ 4
DreamCraft3D++: Efficient Hierarchical 3D Generation with Multi-Plane Reconstruction Model Paper β’ 2410.12928 β’ Published Oct 16, 2024
MIDI: Multi-Instance Diffusion for Single Image to 3D Scene Generation Paper β’ 2412.03558 β’ Published Dec 4, 2024 β’ 17
PSHuman: Photorealistic Single-view Human Reconstruction using Cross-Scale Diffusion Paper β’ 2409.10141 β’ Published Sep 16, 2024 β’ 1
TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models Paper β’ 2502.06608 β’ Published Feb 10 β’ 33
TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models Paper β’ 2502.06608 β’ Published Feb 10 β’ 33
TripoSG: High-Fidelity 3D Shape Synthesis using Large-Scale Rectified Flow Models Paper β’ 2502.06608 β’ Published Feb 10 β’ 33