bghira
/

pseudo-flex-base

StableDiffusionPipeline

stable-diffusion

stable-diffusion-2-1

Model card Files Files and versions Community

PseudoTerminal X commited on Jun 26, 2023

Commit

7a7ca6a

·

1 Parent(s): 4c5a6d7

Update README.md

Files changed (1) hide show

README.md +8 -9

README.md CHANGED Viewed

@@ -13,6 +13,14 @@ library_name: diffusers
 <!-- Provide a quick summary of what the model is/does. [Optional] -->
 stable-diffusion-2-1 (stabilityai/stable-diffusion-2-1) finetuned with different aspect ratios, into a photography model (ptx0/pseudo-real-beta).
 ## Background
 The `ptx0/pseudo-real-beta` pretrained checkpoint had its unet trained for 4,200 steps and its text encoder trained for 15,600 steps at a batch size of 15 with 10 gradient accumulations, on a diverse dataset:
@@ -117,16 +125,7 @@ For usage, see - [How to Get Started with the Model](#how-to-get-started-with-th
 1. Generated images looks like they are cropped from a larger image.
 2. Generating non-square images creates weird results, due to the model being trained on square images.
-Examples: (WIP)
-<!--
-| resolution      | model   |   stable diffusion           |   flex diffusion              |
-|:---------------:|:-------:|:----------------------------:|:-----------------------------:|
-| 576x1024 (9:16) | v2-1    | ![img](imgs/21-576-1024.png) | ![img](imgs/21f-576-1024.png) |
-| 576x1024 (9:16) | v2-base | ![img](imgs/2b-576-1024.png) | ![img](imgs/2bf-576-1024.png) |
-| 1024x576 (16:9) | v2-1    | ![img](imgs/21-1024-576.png) | ![img](imgs/21f-1024-576.png) |
-| 1024x576 (16:9) | v2-base | ![img](imgs/2b-1024-576.png) | ![img](imgs/2bf-1024-576.png) |
--->
 ### Limitations:
 1. It's trained on a small dataset, so its improvements may be limited.

 <!-- Provide a quick summary of what the model is/does. [Optional] -->
 stable-diffusion-2-1 (stabilityai/stable-diffusion-2-1) finetuned with different aspect ratios, into a photography model (ptx0/pseudo-real-beta).
+<!--
+| resolution      | model   |   stable diffusion           |   pseudo-flex                 |   realism-engine               |
+|:---------------:|:-------:|:----------------------------:|:-----------------------------:|:-------------------------------:
+| 576x1024 (9:16) | v2-1    | ![img](imgs/21-576-1024.png) | ![img](imgs/21f-576-1024.png) | ![img](imgs/ellen-flex.png)    |
+| 1280x720 (16:9) | v2-1    | ![img](imgs/ellen-base.png)  | ![img](imgs/ellen-flex.png)   | ![img](imgs/ellen-realism.png) |
+-->
 ## Background
 The `ptx0/pseudo-real-beta` pretrained checkpoint had its unet trained for 4,200 steps and its text encoder trained for 15,600 steps at a batch size of 15 with 10 gradient accumulations, on a diverse dataset:
 1. Generated images looks like they are cropped from a larger image.
 2. Generating non-square images creates weird results, due to the model being trained on square images.
 ### Limitations:
 1. It's trained on a small dataset, so its improvements may be limited.