Commit Graph

7327 Commits (10bbbed218458b8a899aac2140ec738d8d716f05)

Author SHA1 Message Date
Vladimir Mandic 9389aa710a video implement sampler shift
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-17 14:43:41 -04:00
Vladimir Mandic f5711d0f90 fix hf downloader
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-17 13:42:28 -04:00
Seunghoon Lee 552c223569
use driver library, more checks for windows rocm 2025-10-18 01:17:30 +09:00
Disty0 f12caf81f9 SDNQ skip bad layers on svd and fix svd with dequantize_fp32 2025-10-17 17:25:50 +03:00
Vladimir Mandic e66d78f556 pre-merge cleanup
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-17 08:53:26 -04:00
Vladimir Mandic 4f336d3aab linting
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-16 19:39:05 -04:00
Vladimir Mandic a4bc61919d add explicit sync for vae preview
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-16 19:25:12 -04:00
Disty0 2cf9938d97 SDNQ fix sdxl unet quant config not getting saved 2025-10-17 00:08:17 +03:00
Vladimir Mandic 45c5091aa9 fix wan and add hf mirror setting
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-16 11:26:23 -04:00
Disty0 63aad89676 remove the unused state_dict arg 2025-10-16 16:29:23 +03:00
Vladimir Mandic 070edb20b0 update transformers and fix quant params
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-16 09:21:20 -04:00
Vladimir Mandic ffe2a9d148 add ltxvideo-0.9.8
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-15 14:49:31 -04:00
Vladimir Mandic 4452e03221 cleanup
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-15 13:39:56 -04:00
Disty0 845869079d Fix sdnq unset config 2025-10-14 17:58:09 +03:00
Disty0 6d64f4a2fd rename svd options 2025-10-14 17:48:01 +03:00
Vladimir Mandic 85a58ed5bf seedvr enable quant
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-14 10:23:12 -04:00
Disty0 4aee524ddf SDNQ add NaDiT keys 2025-10-14 17:18:58 +03:00
Vladimir Mandic 57230bbf3a update seedvr-7b config
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-14 09:35:00 -04:00
Vladimir Mandic b4d61a5a5c
Merge pull request #4262 from nolbert82/master
Fix prompt scheduling embed reuse and add per-batch caching
2025-10-14 09:27:40 -04:00
Nolan GILBERT a0e8b04ec6
Fix empty prompt overriding 2025-10-14 15:25:28 +02:00
Nolan GILBERT db9297244c
Fixed prompt scheduling + uses TE once per batch 2025-10-14 01:07:48 +02:00
Disty0 b601f0d402 SDNQ expose svd_steps and update module skip keys 2025-10-14 00:15:09 +03:00
Vladimir Mandic f3b4ef2551 simplify seedvr depedencies
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-13 10:20:42 -04:00
Vladimir Mandic 32014fbb9d seedvr requirements
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-13 10:04:01 -04:00
Disty0 d4d24214b3 SDNQ use a better way of loading pre quants and cleanup 2025-10-13 14:06:13 +03:00
Vladimir Mandic 2e4e741d47 seedvt2
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-12 15:35:08 -04:00
Vladimir Mandic 8d36a5aebb lint
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-11 18:51:07 -04:00
Vladimir Mandic eaa7dc119b prototype seedvr
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-11 18:35:43 -04:00
Disty0 a376f89fd6 Add type checking to SDNQConfig 2025-10-12 01:02:47 +03:00
Disty0 9206d9443e SDNQ add dequantize model 2025-10-12 00:00:53 +03:00
Disty0 9a8ba0fc90 SDNQ unset device specific configs on save 2025-10-11 19:24:09 +03:00
Vladimir Mandic 0ae4decadc hidream-e1.1
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-11 11:15:15 -04:00
Vladimir Mandic c0600ae960 update seq method
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-11 09:32:23 -04:00
Disty0 f7286c90d5 SDNQ add native pre-quant loader support to from_pretrained 2025-10-11 16:19:11 +03:00
Disty0 6bc83bc296 Prevent accelerate from splitting Linear and Conv layers and causing device mismatch errors 2025-10-11 03:19:30 +03:00
Disty0 0f785880ee SDNQ fix a singular bias not getting offloaded 2025-10-11 02:38:49 +03:00
Disty0 1d2775103f Fix Qwen VAE 2025-10-11 00:55:22 +03:00
Disty0 c7aba8589b SDNQ fix Qwen loading 2025-10-11 00:05:09 +03:00
Vladimir Mandic 5db54ffb55 set unique guidance labels
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-09 16:11:47 -04:00
Disty0 2a3deaa064 Check T5 keys before override 2025-10-09 22:46:27 +03:00
Disty0 6995d8c3c6 SDNQ fix T5 loading 2025-10-09 22:42:20 +03:00
Disty0 624e525fd1 Fix VAE config not being read correctly with SDNQ pre-quants 2025-10-09 20:30:23 +03:00
Disty0 612df3abbb cleanup 2025-10-09 20:09:34 +03:00
Disty0 a9de8ef152 cleanup 2025-10-09 19:58:57 +03:00
Disty0 e19fb2d833 SDNQ keep the quant configs inside the module subfolder, add dtype cast and don't send to GPU 2025-10-09 19:34:48 +03:00
Vladimir Mandic ee3e0aa978 add epoch to namegens and disable spellchecks
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-09 12:22:55 -04:00
Vladimir Mandic 70defe6d06 handle load shards
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-09 11:29:36 -04:00
Vladimir Mandic 9c4780a5e7 fix guidance metadata
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-09 09:25:49 -04:00
Vladimir Mandic 6907fcd320 speedup prequant model load
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-08 13:47:36 -04:00
Disty0 35277a79d3 cleanup x3 2025-10-08 01:21:11 +03:00
Disty0 9c16e2234a cleanup 2025-10-08 01:18:12 +03:00
Disty0 25303bb182 cleanup 2025-10-08 01:16:25 +03:00
Disty0 bdcd07f713 Add add_module_skip_keys to pre-load quant too 2025-10-08 01:11:40 +03:00
Disty0 7fdf400e8b cleanup 2025-10-08 00:41:04 +03:00
Disty0 df03ea9ba8 SDNQ add sdnq_post_load_quant and update Qwen keys 2025-10-08 00:29:36 +03:00
Vladimir Mandic 962cb7115d infra for full-model load/save with quant
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-07 14:30:45 -04:00
Vladimir Mandic e4120bd4d6 detect sdnq saved model
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-07 10:57:17 -04:00
Vladimir Mandic 0092a8b86b add quantization_config for post-load
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-07 10:14:31 -04:00
Vladimir Mandic 7fdc880a73 sdnq patches
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-07 09:43:34 -04:00
Vladimir Mandic 25e28050c3 update swagger docs
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-07 08:21:05 -04:00
Vladimir Mandic fe41d7da2a use shared llama
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-07 08:14:11 -04:00
Vladimir Mandic 3fe1d090e4 add configurable layers to taehv
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-06 19:13:30 -04:00
Disty0 1cd7b6d63a fix upcast scale check 2025-10-07 01:27:54 +03:00
Disty0 aa0c10440f SDNQ make the loader don't touch the model options by default 2025-10-07 00:15:23 +03:00
Disty0 5306376b2a improve contiguous mm performance 2025-10-06 19:05:46 +03:00
Disty0 be91bbff75 SDNQ add SVD support for Convs 2025-10-06 18:26:42 +03:00
Disty0 c931bf9efa SDNQ add dtype casting to loader 2025-10-06 17:44:52 +03:00
Disty0 5c042c5fb8 cleanup 2025-10-06 11:30:26 +03:00
Vladimir Mandic a315a004e9 linting
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-05 20:25:33 -04:00
Vladimir Mandic 28e3ae0480 experimental xomni
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-05 20:17:32 -04:00
Vladimir Mandic 3e47f3dd9a video prompt enhance
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-05 20:17:32 -04:00
Vladimir Mandic 58b0ab9da6 unified video save
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-05 20:17:32 -04:00
Vladimir Mandic d7d86ed286 ltx job tracking
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-05 20:17:32 -04:00
Vladimir Mandic 7dad90b385 video use shared t5
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-05 20:17:32 -04:00
Vladimir Mandic 50c3385cf9 fix ltx model selection
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-05 20:17:32 -04:00
Disty0 23f2deaa58 fix enable_quantized_mamtul 2025-10-06 02:04:28 +03:00
Disty0 1f81a37e8e Set the default svd rank to 32 2025-10-06 01:27:29 +03:00
Disty0 ebb26ac123 SDNQ make load file name configurable 2025-10-06 01:04:00 +03:00
Disty0 0acb571472 SDNQ ass load and save model funcs 2025-10-06 00:57:23 +03:00
Disty0 9e52d0c1fb SDNQ add SVDQuant quantization method 2025-10-05 22:50:30 +03:00
Vladimir Mandic 268798a24e add framepack granular job tracking
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-05 10:39:55 -04:00
Disty0 428600613a SDNQ fix new transformers again 2025-10-05 15:30:15 +03:00
Disty0 a164f3e0c2 SDNQ Improve UINT3 and below quant speed 2025-10-05 03:12:05 +03:00
Vladimir Mandic 8325e886c7 add typo to legacy compatibility options
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-04 18:11:52 -04:00
Vladimir Mandic c530167cbe qwen multi-image edits
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-04 18:06:50 -04:00
Vladimir Mandic 8b698ed67f upadte qwen pruning and allow hf models in subfolders
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-04 15:49:20 -04:00
Disty0 f2e12a682f SDNQ remove use_contiguous_mm path in re_quant 2025-10-04 19:17:05 +03:00
Disty0 df142afe81 don't use triton mm for nvidia 2025-10-04 18:48:03 +03:00
Disty0 5c5d7d5a86 cleanup 2025-10-04 18:38:18 +03:00
Disty0 99113947bf SDNQ add RDNA2 INT8 support via Triton 2025-10-04 18:31:25 +03:00
Vladimir Mandic 54ae18a611 update nunchaku
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-03 12:23:24 -04:00
Disty0 95a7da7e75 SDNQ use non-contiguous re-quantize 2025-10-03 18:54:58 +03:00
Vladimir Mandic a6108dd6df add qwen pruning variants
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-03 11:36:14 -04:00
Disty0 54acf1760b Make SDNQ scales compatible with balanced offload 2025-10-03 18:13:55 +03:00
Disty0 c5cab96223 SDNQ simplify check_mats 2025-10-03 02:58:17 +03:00
Vladimir Mandic 7325f9dbae dont import guiders until needed
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-02 16:28:34 -04:00
Disty0 34c2a624aa SDNQ autodetect fp8 tw fallback and disable dynamic compile 2025-10-02 19:40:07 +03:00
Vladimir Mandic f245506bf2 cleanup hf login
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-02 08:17:35 -04:00
CalamitousFelicitousness 78711fb1d4
Merge branch 'dev' into patch-2 2025-10-01 20:58:58 +01:00
CalamitousFelicitousness 78820a14dc
Allow VLM temp setting temperature to 0
Allow VLM temp setting temperature to 0
2025-10-01 20:52:04 +01:00