automatic/README.md

[![](https://img.shields.io/static/v1?label=Sponsor&message=%E2%9D%A4&logo=GitHub&color=%23fe8e86)](https://github.com/sponsors/vladmandic)
![Last Commit](https://img.shields.io/github/last-commit/vladmandic/human?style=flat-square&svg=true)
![License](https://img.shields.io/github/license/vladmandic/human?style=flat-square&svg=true)
![GitHub Status Checks](https://img.shields.io/github/checks-status/vladmandic/human/main?style=flat-square&svg=true)

# Stable Diffusion - Automatic

*Heavily opinionated custom fork of* <https://github.com/AUTOMATIC1111/stable-diffusion-webui>

Fork is as close as up-to-date with origin as time allows
All code changes are merged upstream whenever possible

The idea behind the fork is to enable latest technologies and advances in text-to-image generation
*Sometimes this is not the same as "as simple as possible to use"*
If you are looking an amazing simple-to-use Stable Diffusion tool, I'd suggest [InvokeAI](https://invoke-ai.github.io/InvokeAI/) specifically due to its automated installer and ease of use

<br>

### Follow [Development updates](https://github.com/vladmandic/automatic/discussions/99) for daily updates on new features/fixes

<br>

![screenshot](javascript/black-orange.jpg)

<br>

## Notes

### Fork does differ in few things

- New installer
- Advanced CUDA tuning
  Available in UI Settings
- Advanced environment tuning
- Optimized startup and models lazy-loading
- Built-in performance profiler
- Updated libraries to latest known compatible versions
- Includes opinionated **System** and **Options** configuration
- Does not rely on `Accelerate` as it only affects distributed systems
  Gradio web server will be initialized much earlier which model load is done in the background
  Faster model loading plus ability to fallback on corrupt models
- Uses simplified folder structure
  e.g. `/train`, `/outputs/*`, `/models/*`, etc.
- Enhanced training templates
- Built-in `LoRA`, `LyCORIS`, `Custom Diffusion`, `Dreambooth` training
- Majority of settings configurable via UI without the need for command line flags
  e.g, cross-optimization methods, system folders, etc.
- New logger
- New error and exception handlers

### Optimizations

- Optimized for `Torch` 2.0
- Runs with `SDP` memory attention enabled by default if supported by system
  *Note*: `xFormers` and other cross-optimization methods are still available
- Auto-adjust parameters when running on **CPU** or **CUDA**
  *Note:* AMD and M1 platforms are supported, but without out-of-the-box optimizations

### Integrated Extensions

Hand-picked list of extensions that are deeply integrated into core workflows:

- [System Info](https://github.com/vladmandic/sd-extension-system-info)
- [ControlNet](https://github.com/Mikubill/sd-webui-controlnet)
- [Image Browser](https://github.com/AlUlkesh/stable-diffusion-webui-images-browser)
- [LORA](https://github.com/kohya-ss/sd-scripts) *(both training and inference)*
- [LyCORIS](https://github.com/KohakuBlueleaf/LyCORIS) *(both training and inference)*
- [Model Converter](https://github.com/Akegarasu/sd-webui-model-converter)
- [CLiP Interrogator](https://github.com/pharmapsychotic/clip-interrogator-ext)
- [Dynamic Thresholding](https://github.com/mcmonkeyprojects/sd-dynamic-thresholding)
- [Steps Animation](https://github.com/vladmandic/sd-extension-steps-animation)
- [Seed Travel](https://github.com/yownas/seed_travel)
- [Multi-Diffusion Upscaler](https://github.com/pkuliyi2015/multidiffusion-upscaler-for-automatic1111)

### User Interface

- Includes updated **UI**: reskinned and reorganized
  Black and orange dark theme with fixed width options panels and larger previews
- Includes support for **Gradio themes**
  *Settings* -> *User interface* -> *UI theme*
  Link to themes list & previews: <https://huggingface.co/spaces/gradio/theme-gallery>

### Removed

- Drops compatibility with older versions of `python` and requires **3.9** or **3.10**
- Drops localizations

### Integrated CLI/API tools

Fork adds extra functionality:

- New skin and UI layout
- Ships with set of **CLI** tools that rely on *SD API* for execution:
  e.g. `generate`, `train`, `bench`, etc.
  [Full list](<cli/>)

<br>

## Install

1. Install first:
**Python** & **Git**
2. If you have nVidia GPU, install nVidia CUDA toolkit:
<https://developer.nvidia.com/cuda-downloads>
3. Clone repository
`git clone https://github.com/vladmandic/automatic`

## Run

Run desired startup script to install dependencies and extensions and start server:

- `webui.bat` and `webui.sh`:
  Platform specific wrapper scripts For Windows, Linux and OSX
  Starts `launch.py` in a Python virtual environment (venv)
  *Note*: Server can run without virtual environment, but it is recommended to use it to avoid library version conflicts with other applications
  **If you're unsure which launcher to use, this is the one you want**
- `launch.py`:
  Main startup script
  Can be used directly to start server in a manually activated `venv` or to run server without `venv`
- `setup.py`:
  Main installer, used by `launch.py`
  Can also be used directly to update repository or extensions
  If running manually, make sure to activate `venv` first (if used)
- `webui.py`:
  Main server script

Any of the above scripts can be used with `--help` to display detailed usage information and available parameters
For example:
> webui.bat --help

Full startup sequence is logged in `setup.log`, so if you encounter any issues, please check it first

## Update

The launcher can perform automatic update of main repository, requirements, extensions and submodules:

- **Main repository**:
  Update is *not* performed by default, enable with `--upgrade` flag
- **Requirements**:
  Check is performed on each startup and missing requirements are auto-installed
  Can be skipped with `--skip-requirements` flag
- **Extensions and submodules**:
  Update is performed on each startup and installer for each extension is started
  Can be skipped with `--skip-extensions` flag
- **Quick mode**: Automatically enabled if timestamp of last sucessful setup is newer than actual repository version or version of newest extension

<br>

## Other

### Scripts

This repository comes with a large collection of scripts that can be used to process inputs, train, generate, and benchmark models
As well as number of auxiliary scripts that do not rely on **WebUI**, but can be used for end-to-end solutions such as extract frames from videos, etc.
For full details see [Docs](cli/README.md)

<br>

### Docs

- Scripts are in [Scripts](cli/README.md)
- Everything else is in [Wiki](https://github.com/vladmandic/automatic/wiki)
- Except my current [TODO](TODO.md)

<br>