PseudoTerminal X
commited on
Commit
·
7a7ca6a
1
Parent(s):
4c5a6d7
Update README.md
Browse files
README.md
CHANGED
@@ -13,6 +13,14 @@ library_name: diffusers
|
|
13 |
<!-- Provide a quick summary of what the model is/does. [Optional] -->
|
14 |
stable-diffusion-2-1 (stabilityai/stable-diffusion-2-1) finetuned with different aspect ratios, into a photography model (ptx0/pseudo-real-beta).
|
15 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
16 |
## Background
|
17 |
|
18 |
The `ptx0/pseudo-real-beta` pretrained checkpoint had its unet trained for 4,200 steps and its text encoder trained for 15,600 steps at a batch size of 15 with 10 gradient accumulations, on a diverse dataset:
|
@@ -117,16 +125,7 @@ For usage, see - [How to Get Started with the Model](#how-to-get-started-with-th
|
|
117 |
1. Generated images looks like they are cropped from a larger image.
|
118 |
|
119 |
2. Generating non-square images creates weird results, due to the model being trained on square images.
|
120 |
-
Examples: (WIP)
|
121 |
|
122 |
-
<!--
|
123 |
-
| resolution | model | stable diffusion | flex diffusion |
|
124 |
-
|:---------------:|:-------:|:----------------------------:|:-----------------------------:|
|
125 |
-
| 576x1024 (9:16) | v2-1 |  |  |
|
126 |
-
| 576x1024 (9:16) | v2-base |  |  |
|
127 |
-
| 1024x576 (16:9) | v2-1 |  |  |
|
128 |
-
| 1024x576 (16:9) | v2-base |  |  |
|
129 |
-
-->
|
130 |
|
131 |
### Limitations:
|
132 |
1. It's trained on a small dataset, so its improvements may be limited.
|
|
|
13 |
<!-- Provide a quick summary of what the model is/does. [Optional] -->
|
14 |
stable-diffusion-2-1 (stabilityai/stable-diffusion-2-1) finetuned with different aspect ratios, into a photography model (ptx0/pseudo-real-beta).
|
15 |
|
16 |
+
|
17 |
+
<!--
|
18 |
+
| resolution | model | stable diffusion | pseudo-flex | realism-engine |
|
19 |
+
|:---------------:|:-------:|:----------------------------:|:-----------------------------:|:-------------------------------:
|
20 |
+
| 576x1024 (9:16) | v2-1 |  |  |  |
|
21 |
+
| 1280x720 (16:9) | v2-1 |  |  |  |
|
22 |
+
|
23 |
+
-->
|
24 |
## Background
|
25 |
|
26 |
The `ptx0/pseudo-real-beta` pretrained checkpoint had its unet trained for 4,200 steps and its text encoder trained for 15,600 steps at a batch size of 15 with 10 gradient accumulations, on a diverse dataset:
|
|
|
125 |
1. Generated images looks like they are cropped from a larger image.
|
126 |
|
127 |
2. Generating non-square images creates weird results, due to the model being trained on square images.
|
|
|
128 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
129 |
|
130 |
### Limitations:
|
131 |
1. It's trained on a small dataset, so its improvements may be limited.
|