(Urban) World Model - a JJ-TMT Collection

JJ-TMT 's Collections

(Urban) World Model

Abstract Spatial Intelligence

Urban Spatial Intelligence

(Urban) World Model

updated 5 days ago

CityGaussianV2: Efficient and Geometrically Accurate Reconstruction for Large-Scale Scenes

Paper • 2411.00771 • Published Nov 1, 2024 • 9
SynCity: Training-Free Generation of 3D Worlds

Paper • 2503.16420 • Published Mar 20 • 26
CityDreamer4D: Compositional Generative Model of Unbounded 4D Cities

Paper • 2501.08983 • Published Jan 15 • 20
Streetscapes: Large-scale Consistent Street View Generation Using Autoregressive Video Diffusion

Paper • 2407.13759 • Published Jul 18, 2024 • 18
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

Paper • 2506.03147 • Published Jun 3 • 58
nvidia/Nemotron-Personas

Viewer • Updated Jun 9 • 100k • 18.5k • 174
nvidia/PhysicalAI-Autonomous-Vehicle-Cosmos-Drive-Dreams

Updated Jun 15 • 10.1k • 18
GDAlab/GeoContext-v1

Viewer • Updated May 16 • 4.46k • 43
NUS-UAL/global-streetscapes

Preview • Updated Feb 19 • 1.26k • 30
GPS as a Control Signal for Image Generation

Paper • 2501.12390 • Published Jan 21 • 15
CrossViewDiff: A Cross-View Diffusion Model for Satellite-to-Street View Synthesis

Paper • 2408.14765 • Published Aug 27, 2024 • 15
Tera-AI/STRIDE

Viewer • Updated 21 days ago • 1.7M • 2.29k • 4
ai4ce/CityWalker

Preview • Updated Apr 23 • 155 • 6
Lixsp11/Sekai-Project

Viewer • Updated Jun 27 • 344k • 947 • 33
frodobots/FrodoBots-2K

Updated May 15, 2024 • 202 • 9
LongAnimation: Long Animation Generation with Dynamic Global-Local Memory

Paper • 2507.01945 • Published Jul 2 • 78
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge

Paper • 2507.04447 • Published Jul 6 • 43
Beyond Simple Edits: X-Planner for Complex Instruction-Based Image Editing

Paper • 2507.05259 • Published Jul 7 • 5
EarthCrafter: Scalable 3D Earth Generation via Dual-Sparse Latent Diffusion

Paper • 2507.16535 • Published Jul 22 • 20
MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh

Paper • 2508.01242 • Published Aug 2 • 9
Pixie: Fast and Generalizable Supervised Learning of 3D Physics from Pixels

Paper • 2508.17437 • Published 13 days ago • 33
MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds

Paper • 2508.14879 • Published 13 days ago • 63