Submitted by akhaliq 35 Octopus: Embodied Vision-Language Programmer from Environmental Feedback · 11 authors 4
Submitted by akhaliq 33 Lemur: Harmonizing Natural Language and Code for Language Agents · 16 authors 3
Submitted by akhaliq 17 GaussianDreamer: Fast Generation from Text to 3D Gaussian Splatting with Point Cloud Priors · 8 authors 2
Submitted by akhaliq 17 Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation · 7 authors 6
Submitted by akhaliq 16 HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion · 9 authors 1
Submitted by akhaliq 15 MotionDirector: Motion Customization of Text-to-Video Diffusion Models · 8 authors 5