update changelog/todo

Signed-off-by: vladmandic <mandic00@live.com>
pull/4696/head
vladmandic 2026-03-20 18:06:27 +01:00
parent 93947d3901
commit 79bf0f8c1d
2 changed files with 16 additions and 5 deletions

View File

@ -61,7 +61,7 @@ But also many smaller quality-of-life improvements - for full details, see [Chan
- **localization** improved translation quality and new translations locales:
*en, en1, en2, en3, en4, hr, es, it, fr, de, pt, ru, zh, ja, ko, hi, ar, bn, ur, id, vi, tr, sr, po, he, xx, yy, qq, tlh*
yes, this now includes stuff like *latin, esperanto, arabic, hebrew, klingon* and a lot more!
and also intruce some pseudo-locales such as: *techno-babbel*, *for-n00bs*
and also introduce some pseudo-locales such as: *techno-babbel*, *for-n00bs*
*hint*: click on locale icon in bottom-left corner to cycle through available locales, or set default in *settings -> ui*
- **server settings** new section in *settings*
- **kanvas** add paste image from clipboard
@ -97,7 +97,7 @@ But also many smaller quality-of-life improvements - for full details, see [Chan
- refactor: move `rebmg` to core instead of extensions
- remove face restoration
- unified command line parsing
- use explcit image references in `gallery`, thanks @awsr
- use explicit icon image references in `gallery`, thanks @awsr
- launch use threads to async execute non-critical tasks
- switch from deprecated `pkg_resources` to `importlib`
- modernize typing and type annotations

17
TODO.md
View File

@ -2,12 +2,18 @@
## Release
- Release: bumb packages
- Release: implement `unload_auxiliary_models`
- Update **README**
- Bumb packages
- Implement `unload_auxiliary_models`
- Release **Launcher**
- Release **Enso**
- Update **ROCm**
- Tips **Color Grading**
## Internal
- Feature: Color grading in processing
- Feature: RIFE update
- Feature: RIFE in processing
- Feature: SeedVR2 in processing
- Deploy: Lite vs Expert mode
@ -47,6 +53,7 @@
TODO: Investigate which models are diffusers-compatible and prioritize!
### Image-Base
- [Chroma Zeta](https://huggingface.co/lodestones/Zeta-Chroma): Image and video generator for creative effects and professional filters
- [Chroma Radiance](https://huggingface.co/lodestones/Chroma1-Radiance): Pixel-space model eliminating VAE artifacts for high visual fidelity
- [Liquid](https://github.com/FoundationVision/Liquid): Unified vision-language auto-regressive generation paradigm
@ -56,6 +63,7 @@ TODO: Investigate which models are diffusers-compatible and prioritize!
- [Lumina-DiMOO](https://huggingface.co/Alpha-VLLM/Lumina-DiMOO): foundational multi-modal multi-task generation and understanding
### Image-Edit
- [Meituan LongCat-Image-Edit-Turbo](https://huggingface.co/meituan-longcat/LongCat-Image-Edit-Turbo):6B instruction-following image editing with high visual consistency
- [VIBE Image-Edit](https://huggingface.co/iitolstykh/VIBE-Image-Edit): (Sana+Qwen-VL)Fast visual instruction-based image editing framework
- [LucyEdit](https://github.com/huggingface/diffusers/pull/12340):Instruction-guided video editing while preserving motion and identity
@ -64,6 +72,7 @@ TODO: Investigate which models are diffusers-compatible and prioritize!
- [ByteDance DreamO](https://huggingface.co/ByteDance/DreamO): image customization framework for IP adaptation and virtual try-on
### Video
- [OpenMOSS MOVA](https://huggingface.co/OpenMOSS-Team/MOVA-720p): Unified foundation model for synchronized high-fidelity video and audio
- [Wan family (Wan2.1 / Wan2.2 variants)](https://huggingface.co/Wan-AI/Wan2.2-Animate-14B): MoE-based foundational tools for cinematic T2V/I2V/TI2V
example: [Wan2.1-T2V-14B-CausVid](https://huggingface.co/lightx2v/Wan2.1-T2V-14B-CausVid)
@ -86,6 +95,7 @@ TODO: Investigate which models are diffusers-compatible and prioritize!
- [Ming (inclusionAI)](https://github.com/inclusionAI/Ming): Unified multimodal model for processing text, audio, image, and video
### Other/Unsorted
- [DiffusionForcing](https://github.com/kwsong0113/diffusion-forcing-transformer): Full-sequence diffusion with autoregressive next-token prediction
- [Self-Forcing](https://github.com/guandeh17/Self-Forcing): Framework for improving temporal consistency in long-horizon video generation
- [SEVA](https://github.com/huggingface/diffusers/pull/11440): Stable Virtual Camera for novel view synthesis and 3D-consistent video
@ -102,6 +112,7 @@ TODO: Investigate which models are diffusers-compatible and prioritize!
- [ReNO](https://github.com/ExplainableML/ReNO): Reward-based Noise Optimization to improve text-to-image quality during inference
### Not Planned
- [Bria FIBO](https://huggingface.co/briaai/FIBO): Fully JSON based
- [Bria FiboEdit](https://github.com/huggingface/diffusers/commit/d7a1c31f4f85bae5a9e01cdce49bd7346bd8ccd6): Fully JSON based
- [LoRAdapter](https://github.com/CompVis/LoRAdapter): Not recently updated
@ -117,7 +128,7 @@ TODO: Investigate which models are diffusers-compatible and prioritize!
## Code TODO
> npm run todo
- fc: autodetect distilled based on model
- fc: autodetect tensor format based on model
- hypertile: vae breaks when using non-standard sizes