mirror of https://github.com/vladmandic/automatic
5.9 KiB
5.9 KiB
Publish extensions
sd-extension-aesthetic-scorersd-extension-steps-animationsd-extension-system-info
Investigating
Things I'm actively investigating...
Embeddings
Need to study more to determine best out-of-the-box settings:
- Train object
- Train style
- Impact of Hires fix:
- e.g 25 steps and denoising strength 0.25-0.7
- Impact of non-square target resolution
Hypernetworks
LORA
Ideas
Things I'm looking into...
Automatic
Stuff to be fixed...
- Reconnect WebUI
Tech that can be integrated as part of the core workflow...
- Custom watermark
- Embedding mixing
- DAAM
- Merge without distortion
- Weighted merges
- Prune models
- Use scripts from API
- Aesthetic gradients
- Face swap
- Custom diffusion
- LORA
- Hypernetworks
- Null-text inversion
- Custom diffusion
- Dream artist
Video Generation
- Deforum
- Latent blending
- VIDM: video implicit diffusion models
- Tune-a-Video
- Animator extension
- Prompt travel
- KLMC2 animation
- BOAAB-limit animation https://www.ajayjain.net/journey
- Disco diffusion
- Video killed the radio star
- Seed travel
- Google frame interpolation
- Prompt fusion
Experimental
Cool stuff that is not integrated anywhere...
- TensorRT
- GIT
- Bunch of stuff:https://pharmapsychotic.com/tools.html
- Prevalent colors to interrogate
- Auto-Sort inputs by face recognition
Updates
- core library updates:
- must run
./automatic.sh install - note: this is quite a big one so some testing is reccomended after upgrade
- must run
- non-trivial ui updates
- renamed scripts in
cli/modulesto be more descriptive
if you're using old script names, update them
for example,ffmpeg.pyis nowvideo-extract.py
also possible that there are some bugs due to broken import paths, so testing is welcome - updated script
process.py- new brightness dynamic range check
- new preview mode to run all checks but without saving images plus print a summary at the end
- updated scripts
models-preview.py- can generate lora previews, note that trigger keywords are inferred from model name so name models carefully
- can generate hypernetwork previews
- new script:
image-watermark.py- optionally strip exif from images
- add invisible watermark to images which persists even if user modifies image so we can always track it
- new script:
palette-extract.py- creates color palette wheel from image(s)
- new script:
extract-lora.py- extract lora from fine-tuned model
- updated
embedding-preview.py- skip existing previews or overwrite them
- expose variation seed in main ui
- integrated seed travel functionality into core
- integrated
pix2pixfunctionality to standardimg2imgworkflow- note: requires pix2pix model to be loaded
- integrated large
cfg scalevalues fix - integrated model converter
- tested
aesthetic gradientstraining, not worth it - updated
image browser
was broken for a while and maintainer is gone - initial work on queue management allowing to submit multiple requests to server
- initial work on
loraintegration
can render loras without extensions
can extract lora from fine-tuned model
training prototype in place intrain-lora.py, not optimized or integrated - initial work on
custom diffusionintegration
no testing so far - spent quite some time making stable-diffusion compatible with upcomming
pytorch2.0 release
and testingdynamotorch dynamic optimizer andtritonscript compiler