paper accepted to ICCV 2025 🎉
Browse files
README.md
CHANGED
@@ -35,7 +35,8 @@ This repository provides the official models for the following paper:
|
|
35 |
[Zichuan Liu](https://scholar.google.com/citations?user=-H18WY8AAAAJ),
|
36 |
[Hao Kang](https://scholar.google.com/citations?user=VeTCSyEAAAAJ),
|
37 |
[Xin Lu](https://scholar.google.com/citations?user=mFC0wp8AAAAJ)<br />
|
38 |
-
ByteDance Intelligent Creation
|
|
|
39 |
|
40 |
> **Abstract:** Achieving flexible and high-fidelity identity-preserved image generation remains formidable, particularly with advanced Diffusion Transformers (DiTs) like FLUX. We introduce **InfiniteYou (InfU)**, one of the earliest robust frameworks leveraging DiTs for this task. InfU addresses significant issues of existing methods, such as insufficient identity similarity, poor text-image alignment, and low generation quality and aesthetics. Central to InfU is InfuseNet, a component that injects identity features into the DiT base model via residual connections, enhancing identity similarity while maintaining generation capabilities. A multi-stage training strategy, including pretraining and supervised fine-tuning (SFT) with synthetic single-person-multiple-sample (SPMS) data, further improves text-image alignment, ameliorates image quality, and alleviates face copy-pasting. Extensive experiments demonstrate that InfU achieves state-of-the-art performance, surpassing existing baselines. In addition, the plug-and-play design of InfU ensures compatibility with various existing methods, offering a valuable contribution to the broader community.
|
41 |
|
@@ -96,11 +97,10 @@ This research aims to positively impact the field of Generative AI. Any usage of
|
|
96 |
If you find InfiniteYou useful for your research or applications, please cite our paper:
|
97 |
|
98 |
```bibtex
|
99 |
-
@
|
100 |
title={{InfiniteYou}: Flexible Photo Recrafting While Preserving Your Identity},
|
101 |
author={Jiang, Liming and Yan, Qing and Jia, Yumin and Liu, Zichuan and Kang, Hao and Lu, Xin},
|
102 |
-
|
103 |
-
volume={arXiv:2503.16418},
|
104 |
year={2025}
|
105 |
}
|
106 |
```
|
|
|
35 |
[Zichuan Liu](https://scholar.google.com/citations?user=-H18WY8AAAAJ),
|
36 |
[Hao Kang](https://scholar.google.com/citations?user=VeTCSyEAAAAJ),
|
37 |
[Xin Lu](https://scholar.google.com/citations?user=mFC0wp8AAAAJ)<br />
|
38 |
+
ByteDance Intelligent Creation<br />
|
39 |
+
**ICCV 2025**
|
40 |
|
41 |
> **Abstract:** Achieving flexible and high-fidelity identity-preserved image generation remains formidable, particularly with advanced Diffusion Transformers (DiTs) like FLUX. We introduce **InfiniteYou (InfU)**, one of the earliest robust frameworks leveraging DiTs for this task. InfU addresses significant issues of existing methods, such as insufficient identity similarity, poor text-image alignment, and low generation quality and aesthetics. Central to InfU is InfuseNet, a component that injects identity features into the DiT base model via residual connections, enhancing identity similarity while maintaining generation capabilities. A multi-stage training strategy, including pretraining and supervised fine-tuning (SFT) with synthetic single-person-multiple-sample (SPMS) data, further improves text-image alignment, ameliorates image quality, and alleviates face copy-pasting. Extensive experiments demonstrate that InfU achieves state-of-the-art performance, surpassing existing baselines. In addition, the plug-and-play design of InfU ensures compatibility with various existing methods, offering a valuable contribution to the broader community.
|
42 |
|
|
|
97 |
If you find InfiniteYou useful for your research or applications, please cite our paper:
|
98 |
|
99 |
```bibtex
|
100 |
+
@inproceedings{jiang2025infiniteyou,
|
101 |
title={{InfiniteYou}: Flexible Photo Recrafting While Preserving Your Identity},
|
102 |
author={Jiang, Liming and Yan, Qing and Jia, Yumin and Liu, Zichuan and Kang, Hao and Lu, Xin},
|
103 |
+
booktitle={ICCV},
|
|
|
104 |
year={2025}
|
105 |
}
|
106 |
```
|