68 lines
2.3 KiB
Markdown
68 lines
2.3 KiB
Markdown
# IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models
|
|
|
|
<div align="center">
|
|
|
|
[**Project Page**](https://ip-adapter.github.io) **|** [**Paper (ArXiv)**](https://arxiv.org/abs/2308.06721)
|
|
</div>
|
|
|
|
---
|
|
|
|
|
|
## Introduction
|
|
|
|
we present IP-Adapter, an effective and lightweight
|
|
adapter to achieve image prompt capability for the pretrained
|
|
text-to-image diffusion models. An IP-Adapter
|
|
with only 22M parameters can achieve comparable or even
|
|
better performance to a fine-tuned image prompt model. IPAdapter
|
|
can be generalized not only to other custom models
|
|
fine-tuned from the same base model, but also to controllable
|
|
generation using existing controllable tools. Moreover, the image prompt
|
|
can also work well with the text prompt to accomplish multimodal
|
|
image generation.
|
|
|
|

|
|
|
|
## Release
|
|
- [2023/8/16] 🔥 We release the code and models.
|
|
|
|
|
|
## Dependencies
|
|
- diffusers >= 0.19.3
|
|
|
|
## Download Models
|
|
|
|
you can download models from [here](https://huggingface.co/h94/IP-Adapter). To run the demo, you should also download the following models:
|
|
- [runwayml/stable-diffusion-v1-5](https://huggingface.co/runwayml/stable-diffusion-v1-5)
|
|
- [stabilityai/sd-vae-ft-mse](https://huggingface.co/stabilityai/sd-vae-ft-mse)
|
|
- [SG161222/Realistic_Vision_V4.0_noVAE](https://huggingface.co/SG161222/Realistic_Vision_V4.0_noVAE)
|
|
|
|
## How to Use
|
|
|
|
- [**ip_adapter_demo**](ip_adapter_demo.ipynb): image variations, image-to-image, and inpainting with image prompt.
|
|
|
|

|
|
|
|

|
|
|
|

|
|
- [**ip_adapter_controlnet_demo**](ip_adapter_controlnet_demo.ipynb): structural generation with image prompt.
|
|
|
|

|
|
|
|
- [**ip_adapter_multimodal_prompts_demo**](ip_adapter_multimodal_prompts_demo.ipynb): generation with multimodal prompts.
|
|
|
|

|
|
|
|
|
|
## Citation
|
|
If you find IP-Adapter useful for your your research and applications, please cite using this BibTeX:
|
|
```bibtex
|
|
@article{ye2023ip-adapter,
|
|
title={IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models},
|
|
author={Ye, Hu and Zhang, Jun and Liu, Sibo and Han, Xiao and Yang, Wei},
|
|
booktitle={arXiv preprint arxiv:2308.06721},
|
|
year={2023}
|
|
}
|
|
```
|