BAAI
/

Alon77777 commited on
Commit
88a3192
·
1 Parent(s): 5c9b4ab

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -6
README.md CHANGED
@@ -8,7 +8,7 @@ We have trained the first multilingual Stable Diffusion (SD) model that supports
8
 
9
  As shown in Figure 1, the training process consists of two stages: concept alignment and quality improvement. We first replaced the original OpenCLIP in SD with the multilingual CLIP AltCLIP-m18 and froze its parameters. In the first stage, we trained the k,v matrices in the CrossAttention layer of the Unet model to align the concepts between text and image using 256\*256 image resolution. In the second stage, we trained all the parameters in the Unet model to improve the generation performance using 512\*512 image resolution.
10
 
11
- <img src="/imgs/model.png" alt="illustrate for AltDiffusion" style="zoom:35%;" />
12
 
13
  <center>
14
  图1: AltDiffusion示意图 (Fig.1: illustrate for AltDiffusion)
@@ -33,15 +33,15 @@ checkpoint we used is SD v2.1 512-base-ema. We also use Xformer and Efficient At
33
  ## 效果展示
34
 
35
  ### 18语言效果
36
- ![boy](/imgs/boy.SVG)
37
 
38
- ![corgi_dog](/imgs/corgi_dog.SVG)
39
 
40
  ### 中文效果
41
 
42
- <img src="/imgs/chinese_samples.png" alt="chinese_samples" style="zoom:85%;" />
43
 
44
  ### 长图效果
45
- ![long1](/imgs/long1.SVG)
46
 
47
- ![long2](/imgs/long2.SVG)
 
8
 
9
  As shown in Figure 1, the training process consists of two stages: concept alignment and quality improvement. We first replaced the original OpenCLIP in SD with the multilingual CLIP AltCLIP-m18 and froze its parameters. In the first stage, we trained the k,v matrices in the CrossAttention layer of the Unet model to align the concepts between text and image using 256\*256 image resolution. In the second stage, we trained all the parameters in the Unet model to improve the generation performance using 512\*512 image resolution.
10
 
11
+ <img src="./imgs/model.png" alt="illustrate for AltDiffusion" style="zoom:35%;" />
12
 
13
  <center>
14
  图1: AltDiffusion示意图 (Fig.1: illustrate for AltDiffusion)
 
33
  ## 效果展示
34
 
35
  ### 18语言效果
36
+ ![boy](./imgs/boy.SVG)
37
 
38
+ ![corgi_dog](./imgs/corgi_dog.SVG)
39
 
40
  ### 中文效果
41
 
42
+ <img src="./imgs/chinese_samples.png" alt="chinese_samples" style="zoom:85%;" />
43
 
44
  ### 长图效果
45
+ ![long1](./imgs/long1.SVG)
46
 
47
+ ![long2](./imgs/long2.SVG)