Image-to-Text
Chinese
English
OpenFace-CQUPT commited on
Commit
9bf47fe
·
verified ·
1 Parent(s): b5f75b0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -4
README.md CHANGED
@@ -18,7 +18,9 @@ python pretrain.py > log.log
18
 
19
  ## Pre-trained Models
20
 
21
- We provide pretrained model weights for the [vit base version](https://huggingface.co/OpenFace-CQUPT/Facial-language-image-pretraining-model/tree/main/ckpt)
 
 
22
 
23
  ## Datasets
24
 
@@ -29,19 +31,19 @@ Download the FaceCaption-15M dataset from [here](https://huggingface.co/dataset
29
 
30
  ### Task1: Text-Image Retrieval
31
 
32
- **Comparison with other classical pretrained models. All pretrained model backbones are frozen, with only the linear layer being fine-tuned. † represents the model pretrained on the LAION-Face [86] dataset; * represents the model pretrained on the FaceCaption dataset constructed without using LLM text generation.**
33
 
34
  ![](https://img.yutangli.net/img/202403181015142.png)
35
 
36
  ### Task2: Facial Attributes Prediction
37
 
38
- **Comparison with other classical models. † represents the model pre-trained on the original LAION-Face dataset.**
39
 
40
  ![image-20240318101126897](https://img.yutangli.net/img/202403181011115.png)
41
 
42
  ### Task3: Sketch Less Facial Image Retrieval
43
 
44
- **Comparative results with different baseline methods. † represents the model pre-trained on the LAION-Face dataset.**
45
 
46
  ![image-20240318101633671](https://img.yutangli.net/img/202403181016876.png)
47
 
 
18
 
19
  ## Pre-trained Models
20
 
21
+ We provide pretrained model weights :
22
+ [FLIP Base](https://huggingface.co/OpenFace-CQUPT/Facial-language-image-pretraining-model/tree/main/ckpt)
23
+ [FLIP Large] Coming soon......
24
 
25
  ## Datasets
26
 
 
31
 
32
  ### Task1: Text-Image Retrieval
33
 
34
+ **Table 1:** Comparison with other classical pretrained models. All pretrained model backbones are frozen, with only the linear layer being fine-tuned. † represents the model pretrained on the LAION-Face [86] dataset; * represents the model pretrained on the FaceCaption dataset constructed without using LLM text generation.
35
 
36
  ![](https://img.yutangli.net/img/202403181015142.png)
37
 
38
  ### Task2: Facial Attributes Prediction
39
 
40
+ **Table 2:** Comparison with other classical models. † represents the model pre-trained on the original LAION-Face dataset.
41
 
42
  ![image-20240318101126897](https://img.yutangli.net/img/202403181011115.png)
43
 
44
  ### Task3: Sketch Less Facial Image Retrieval
45
 
46
+ **Table 3:** Comparative results with different baseline methods. † represents the model pre-trained on the LAION-Face dataset.
47
 
48
  ![image-20240318101633671](https://img.yutangli.net/img/202403181016876.png)
49