Update README.md
Browse files
README.md
CHANGED
|
@@ -6,9 +6,9 @@ tags:
|
|
| 6 |
- 3d-generation
|
| 7 |
- rectified-flow
|
| 8 |
---
|
| 9 |
-
# TripoSG
|
| 10 |
|
| 11 |
-
TripoSG-scribble is a variant of TripoSG. TripoSG is a state-of-the-art image-to-3D generation foundation model that leverages large-scale rectified flow transformers to produce high-fidelity 3D shapes from single images.
|
| 12 |
|
| 13 |
## Model Description
|
| 14 |
|
|
@@ -20,7 +20,7 @@ TripoSG utilizes a novel architecture combining:
|
|
| 20 |
- Cross-attention mechanism for image feature condition
|
| 21 |
- 1.5B parameters operating on 2048 latent tokens
|
| 22 |
|
| 23 |
-
|
| 24 |
- TripoSG-scribble is a CFG-distilled model and should be used with CFG=0
|
| 25 |
- TripoSG-scribble is trained with 512 latent tokens
|
| 26 |
|
|
|
|
| 6 |
- 3d-generation
|
| 7 |
- rectified-flow
|
| 8 |
---
|
| 9 |
+
# TripoSG-scribble - Fast 3D Shape Prototyping with Scribble and Prompt
|
| 10 |
|
| 11 |
+
TripoSG-scribble converts a scribble image and a text prompt to a 3D shape. TripoSG-scribble is a variant of TripoSG. TripoSG is a state-of-the-art image-to-3D generation foundation model that leverages large-scale rectified flow transformers to produce high-fidelity 3D shapes from single images.
|
| 12 |
|
| 13 |
## Model Description
|
| 14 |
|
|
|
|
| 20 |
- Cross-attention mechanism for image feature condition
|
| 21 |
- 1.5B parameters operating on 2048 latent tokens
|
| 22 |
|
| 23 |
+
For inference efficiency, TripoSG-scribble is different from TripoSG in:
|
| 24 |
- TripoSG-scribble is a CFG-distilled model and should be used with CFG=0
|
| 25 |
- TripoSG-scribble is trained with 512 latent tokens
|
| 26 |
|