SD.Next: Advanced Implementation of Stable Diffusion

a1111-webui generative-art img2img sd-xl sdnext stable-diffusion stable-diffusion-webui txt2img

Go to file

Vladimir Mandic f2a85858fd Merge pull request #2986 from vladmandic/dev merge dev to master		2024-03-19 10:45:03 -04:00
.github	update git action	2024-02-22 14:09:01 -05:00
.vscode	cleanup project lookup paths	2024-03-02 13:42:11 -05:00
cli	fix ipadapter batch runs	2024-03-15 09:25:14 -04:00
configs	add support for json configs per model component	2024-02-28 15:12:05 -05:00
extensions	add placeholders	2023-01-06 12:15:25 -05:00
extensions-builtin	fix ipadapter batch runs	2024-03-15 09:25:14 -04:00
html	ledits fix image resize	2024-03-14 09:53:01 -04:00
javascript	js linting	2024-03-18 22:14:28 -04:00
models	full refactor of reference models	2024-03-12 23:51:36 -04:00
modules	RealESRGAN use devices.device_esrgan	2024-03-18 23:02:36 +03:00
repositories	fix interrogate api	2024-02-10 09:13:07 -05:00
scripts	inpaint mask checks	2024-03-15 09:56:43 -04:00
train/templates	Add pre-commit.	2023-09-04 13:57:47 +02:00
wiki@d855aabbc4	fix control inpaint	2024-03-18 09:49:00 -04:00
.eslintrc.json	enable swtiching built-in themes on-the-fly	2024-01-10 11:58:09 -05:00
.gitignore	Ignore dot-folders in root dir	2024-01-04 14:57:52 -05:00
.gitmodules	cleanup repo	2024-02-10 10:54:54 -05:00
.markdownlint.json	Add pre-commit.	2023-09-04 13:57:47 +02:00
.pre-commit-config.yaml	Add pre-commit.	2023-09-04 13:57:47 +02:00
.pylintrc	cleanup project lookup paths	2024-03-02 13:42:11 -05:00
CHANGELOG.md	update changelog and readme	2024-03-19 10:39:03 -04:00
CITATION.cff	add cff	2023-12-12 15:55:41 -05:00
LICENSE.txt	global crlf to lf	2024-01-10 09:45:26 -05:00
README.md	update changelog and readme	2024-03-19 10:39:03 -04:00
SECURITY.md	change onboarding and remove download default model	2023-12-26 13:06:14 -05:00
TODO.md	add masking api	2024-03-01 13:46:13 -05:00
installer.py	fix dict	2024-03-17 20:07:08 -04:00
launch.py	refactor stable-cascade, fix taesd bf16, add skip-env cmd flag	2024-02-21 17:13:53 -05:00
motd	add motd	2023-10-09 08:46:48 -04:00
package.json	diffusers img2img and inpaint	2023-07-22 15:39:41 -04:00
pyproject.toml	cleanup project lookup paths	2024-03-02 13:42:11 -05:00
requirements.txt	update diffusers and accelerate	2024-03-14 13:48:55 -04:00
webui.bat	global crlf to lf	2024-01-10 09:45:26 -05:00
webui.ps1	update startup scripts	2023-11-04 08:22:57 -04:00
webui.py	update todo and startup logging	2024-02-22 21:05:39 -05:00
webui.sh	Lower VENV_LIB priority	2024-01-13 20:18:42 +03:00

README.md

SD.Next

Stable Diffusion implementation with advanced features

Wiki | Discord | Changelog

Notable features

All individual features are not listed here, instead check ChangeLog for full list of changes

Multiple backends!
▹ Diffusers | Original
Multiple diffusion models!
▹ Stable Diffusion 1.5/2.1 | SD-XL | LCM | Segmind | Kandinsky | Pixart-α | Stable Cascade | Würstchen | aMUSEd | DeepFloyd IF | UniDiffusion | SD-Distilled | BLiP Diffusion | KOALA | etc.
Built-in Control for Text, Image, Batch and video processing!
▹ ControlNet | ControlNet XS | Control LLLite | T2I Adapters | IP Adapters
Multiplatform!
▹ Windows | Linux | MacOS with CPU | nVidia | AMD | IntelArc | DirectML | OpenVINO | ONNX+Olive | ZLUDA
Platform specific autodetection and tuning performed on install
Optimized processing with latest torch developments with built-in support for torch.compile
and multiple compile backends: Triton, ZLUDA, StableFast, DeepCache, OpenVINO, NNCF, IPEX
Improved prompt parser
Enhanced Lora/LoCon/Lyco code supporting latest trends in training
Built-in queue management
Enterprise level logging and hardened API
Built in installer with automatic updates and dependency management
Modernized UI with theme support and number of built-in themes (dark and light)

Main text2image interface:

For screenshots and informations on other available themes, see Themes Wiki

Backend support

SD.Next supports two main backends: Diffusers and Original:

Diffusers: Based on new Huggingface Diffusers implementation
Supports all models listed below
This backend is set as default for new installations
See wiki article for more information
Original: Based on LDM reference implementation and significantly expanded on by A1111
This backend and is fully compatible with most existing functionality and extensions written for A1111 SDWebUI
Supports SD 1.x and SD 2.x models
All other model types such as SD-XL, LCM, PixArt, Segmind, Kandinsky, etc. require backend Diffusers

Model support

Additional models will be added as they become available and there is public interest in them

RunwayML Stable Diffusion 1.x and 2.x (all variants)
StabilityAI Stable Diffusion XL
StabilityAI Stable Video Diffusion Base, XT 1.0, XT 1.1
LCM: Latent Consistency Models
Playground v1, v2 256, v2 512, v2 1024 and latest v2.5
Stable Cascade Full and Lite
aMUSEd 256 256 and 512
Segmind Vega
Segmind SSD-1B
Segmind SegMoE SD and SD-XL
Kandinsky 2.1 and 2.2 and latest 3.0
PixArt-α XL 2 Medium and Large
Warp Wuerstchen
Tsinghua UniDiffusion
DeepFloyd IF Medium and Large
ModelScope T2V
Segmind SD Distilled (all variants)
BLIP-Diffusion
KOALA 700M
VGen

Also supported are modifiers such as:

LCM and Turbo (adversarial diffusion distillation) networks
All LoRA types such as LoCon, LyCORIS, HADA, IA3, Lokr, OFT
IP-Adapters for SD 1.5 and SD-XL
InstantID, FaceSwap, FaceID, PhotoMerge
AnimateDiff for SD 1.5

Examples

IP Adapters:

Color grading:

InstantID:

[!IMPORTANT]

Loading any model other than standard SD 1.x / SD 2.x requires use of backend Diffusers

Loading any other models using Original backend is not supported

Loading manually download model .safetensors files is supported for specified models only (typically SD 1.x / SD 2.x / SD-XL models only)

For all other model types, use backend Diffusers and use built in Model downloader or
select model from Networks -> Models -> Reference list in which case it will be auto-downloaded and loaded

Platform support

nVidia GPUs using CUDA libraries on both Windows and Linux
AMD GPUs using ROCm libraries on Linux
Support will be extended to Windows once AMD releases ROCm for Windows
Intel Arc GPUs using OneAPI with IPEX XPU libraries on both Windows and Linux
Any GPU compatible with DirectX on Windows using DirectML libraries
This includes support for AMD GPUs that are not supported by native ROCm libraries
Any GPU or device compatible with OpenVINO libraries on both Windows and Linux
Apple M1/M2 on OSX using built-in support in Torch with MPS optimizations
ONNX/Olive

Install

Step-by-step install guide
Advanced install notes
Common installation errors
FAQ
If you can't run us locally, try our friends at RunDuffusion!

[!TIP]

Server can run with or without virtual environment,
Recommended to use VENV to avoid library version conflicts with other applications

nVidia/CUDA / AMD/ROCm / Intel/OneAPI are auto-detected if present and available,
For any other use case such as DirectML, ONNX/Olive, OpenVINO specify required parameter explicitly
or wrong packages may be installed as installer will assume CPU-only environment

Full startup sequence is logged in sdnext.log,
so if you encounter any issues, please check it first

Run

Once SD.Next is installed, simply run webui.ps1 or webui.bat (Windows) or webui.sh (Linux or MacOS)

List of available parameters, run webui --help for the full & up-to-date list:

Server options:
  --config CONFIG                                    Use specific server configuration file, default: config.json
  --ui-config UI_CONFIG                              Use specific UI configuration file, default: ui-config.json
  --medvram                                          Split model stages and keep only active part in VRAM, default: False
  --lowvram                                          Split model components and keep only active part in VRAM, default: False
  --ckpt CKPT                                        Path to model checkpoint to load immediately, default: None
  --vae VAE                                          Path to VAE checkpoint to load immediately, default: None
  --data-dir DATA_DIR                                Base path where all user data is stored, default:
  --models-dir MODELS_DIR                            Base path where all models are stored, default: models
  --allow-code                                       Allow custom script execution, default: False
  --share                                            Enable UI accessible through Gradio site, default: False
  --insecure                                         Enable extensions tab regardless of other options, default: False
  --use-cpu USE_CPU [USE_CPU ...]                    Force use CPU for specified modules, default: []
  --listen                                           Launch web server using public IP address, default: False
  --port PORT                                        Launch web server with given server port, default: 7860
  --freeze                                           Disable editing settings
  --auth AUTH                                        Set access authentication like "user:pwd,user:pwd""
  --auth-file AUTH_FILE                              Set access authentication using file, default: None
  --autolaunch                                       Open the UI URL in the system's default browser upon launch
  --docs                                             Mount API docs, default: False
  --api-only                                         Run in API only mode without starting UI
  --api-log                                          Enable logging of all API requests, default: False
  --device-id DEVICE_ID                              Select the default CUDA device to use, default: None
  --cors-origins CORS_ORIGINS                        Allowed CORS origins as comma-separated list, default: None
  --cors-regex CORS_REGEX                            Allowed CORS origins as regular expression, default: None
  --tls-keyfile TLS_KEYFILE                          Enable TLS and specify key file, default: None
  --tls-certfile TLS_CERTFILE                        Enable TLS and specify cert file, default: None
  --tls-selfsign                                     Enable TLS with self-signed certificates, default: False
  --server-name SERVER_NAME                          Sets hostname of server, default: None
  --no-hashing                                       Disable hashing of checkpoints, default: False
  --no-metadata                                      Disable reading of metadata from models, default: False
  --disable-queue                                    Disable queues, default: False
  --subpath SUBPATH                                  Customize the URL subpath for usage with reverse proxy
  --backend {original,diffusers}                     force model pipeline type
  --allowed-paths ALLOWED_PATHS [ALLOWED_PATHS ...]  add additional paths to paths allowed for web access

Setup options:
  --reset                                            Reset main repository to latest version, default: False
  --upgrade                                          Upgrade main repository to latest version, default: False
  --requirements                                     Force re-check of requirements, default: False
  --quick                                            Bypass version checks, default: False
  --use-directml                                     Use DirectML if no compatible GPU is detected, default: False
  --use-openvino                                     Use Intel OpenVINO backend, default: False
  --use-ipex                                         Force use Intel OneAPI XPU backend, default: False
  --use-cuda                                         Force use nVidia CUDA backend, default: False
  --use-rocm                                         Force use AMD ROCm backend, default: False
  --use-zluda                                        Force use ZLUDA, AMD GPUs only, default: False
  --use-xformers                                     Force use xFormers cross-optimization, default: False
  --skip-requirements                                Skips checking and installing requirements, default: False
  --skip-extensions                                  Skips running individual extension installers, default: False
  --skip-git                                         Skips running all GIT operations, default: False
  --skip-torch                                       Skips running Torch checks, default: False
  --skip-all                                         Skips running all checks, default: False
  --skip-env                                         Skips setting of env variables during startup, default: False
  --experimental                                     Allow unsupported versions of libraries, default: False
  --reinstall                                        Force reinstallation of all requirements, default: False
  --test                                             Run test only and exit
  --version                                          Print version information
  --ignore                                           Ignore any errors and attempt to continue
  --safe                                             Run in safe mode with no user extensions

Logging options:
  --log LOG                                          Set log file, default: None
  --debug                                            Run installer with debug logging, default: False
  --profile                                          Run profiler, default: False

Notes

Control

SD.Next comes with built-in control for all types of text2image, image2image, video2video and batch processing

Control interface:

Control processors:

Masking:

Extensions

SD.Next comes with several extensions pre-installed:

ControlNet (active in backend: original only)
Agent Scheduler
Image Browser

Collab

We'd love to have additional maintainers (with comes with full repo rights). If you're interested, ping us!
In addition to general cross-platform code, desire is to have a lead for each of the main platforms
This should be fully cross-platform, but we'd really love to have additional contributors and/or maintainers to join and help lead the efforts on different platforms

Credits

Main credit goes to Automatic1111 WebUI for original codebase
Additional credits are listed in Credits
Licenses for modules are listed in Licenses

Evolution

OSS Stats

Docs

If you're unsure how to use a feature, best place to start is Wiki and if its not there,
check ChangeLog for when feature was first introduced as it will always have a short note on how to use it

README.md Unescape Escape