Commit Graph

147 Commits (8049936e66d98e07da740793af8a58b84ef9d455)

Author SHA1 Message Date
Vladimir Mandic d43091f1fa lint set minimum to py310 and update rules
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-29 11:28:09 -04:00
Disty0 7bcc5fa29c SDNQ add HunyuanImage3ForCausalMM keys 2025-10-29 13:36:31 +03:00
Disty0 6c937c2747 Fix transformers using all the ram 2025-10-29 13:09:03 +03:00
Vladimir Mandic bc775f0530 add wan asymmetric vae upscaler
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-28 13:55:46 -04:00
Disty0 e6af602c0d handle files = str case 2025-10-27 21:40:18 +03:00
Disty0 a830c0a7e0 cleanup 2025-10-27 21:32:52 +03:00
Vladimir Mandic 5ab9a5a15d add sota model loader: runai streamer
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-27 14:20:10 -04:00
Disty0 2104bf8bb0 sdnq add wan keys 2025-10-25 15:34:14 +03:00
Disty0 b627617d14 SDNQ fix enable matmul after load 2025-10-19 17:25:02 +03:00
Disty0 f05c29175e cleanup 2025-10-19 02:09:25 +03:00
Disty0 758b006104 cleanup 2025-10-19 02:00:16 +03:00
Disty0 ef72edf18f SDNQ improve svd and low bit matmul perf 2025-10-19 00:06:07 +03:00
Disty0 f12caf81f9 SDNQ skip bad layers on svd and fix svd with dequantize_fp32 2025-10-17 17:25:50 +03:00
Vladimir Mandic 4f336d3aab linting
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-16 19:39:05 -04:00
Disty0 2cf9938d97 SDNQ fix sdxl unet quant config not getting saved 2025-10-17 00:08:17 +03:00
Disty0 63aad89676 remove the unused state_dict arg 2025-10-16 16:29:23 +03:00
Vladimir Mandic 070edb20b0 update transformers and fix quant params
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-16 09:21:20 -04:00
Disty0 845869079d Fix sdnq unset config 2025-10-14 17:58:09 +03:00
Disty0 4aee524ddf SDNQ add NaDiT keys 2025-10-14 17:18:58 +03:00
Disty0 b601f0d402 SDNQ expose svd_steps and update module skip keys 2025-10-14 00:15:09 +03:00
Disty0 d4d24214b3 SDNQ use a better way of loading pre quants and cleanup 2025-10-13 14:06:13 +03:00
Vladimir Mandic 2e4e741d47 seedvt2
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-12 15:35:08 -04:00
Disty0 a376f89fd6 Add type checking to SDNQConfig 2025-10-12 01:02:47 +03:00
Disty0 9206d9443e SDNQ add dequantize model 2025-10-12 00:00:53 +03:00
Disty0 9a8ba0fc90 SDNQ unset device specific configs on save 2025-10-11 19:24:09 +03:00
Disty0 f7286c90d5 SDNQ add native pre-quant loader support to from_pretrained 2025-10-11 16:19:11 +03:00
Disty0 6bc83bc296 Prevent accelerate from splitting Linear and Conv layers and causing device mismatch errors 2025-10-11 03:19:30 +03:00
Disty0 0f785880ee SDNQ fix a singular bias not getting offloaded 2025-10-11 02:38:49 +03:00
Disty0 c7aba8589b SDNQ fix Qwen loading 2025-10-11 00:05:09 +03:00
Disty0 2a3deaa064 Check T5 keys before override 2025-10-09 22:46:27 +03:00
Disty0 6995d8c3c6 SDNQ fix T5 loading 2025-10-09 22:42:20 +03:00
Disty0 612df3abbb cleanup 2025-10-09 20:09:34 +03:00
Disty0 a9de8ef152 cleanup 2025-10-09 19:58:57 +03:00
Disty0 e19fb2d833 SDNQ keep the quant configs inside the module subfolder, add dtype cast and don't send to GPU 2025-10-09 19:34:48 +03:00
Vladimir Mandic 70defe6d06 handle load shards
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-09 11:29:36 -04:00
Vladimir Mandic 6907fcd320 speedup prequant model load
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-08 13:47:36 -04:00
Disty0 35277a79d3 cleanup x3 2025-10-08 01:21:11 +03:00
Disty0 9c16e2234a cleanup 2025-10-08 01:18:12 +03:00
Disty0 25303bb182 cleanup 2025-10-08 01:16:25 +03:00
Disty0 bdcd07f713 Add add_module_skip_keys to pre-load quant too 2025-10-08 01:11:40 +03:00
Disty0 7fdf400e8b cleanup 2025-10-08 00:41:04 +03:00
Disty0 df03ea9ba8 SDNQ add sdnq_post_load_quant and update Qwen keys 2025-10-08 00:29:36 +03:00
Vladimir Mandic 962cb7115d infra for full-model load/save with quant
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-07 14:30:45 -04:00
Vladimir Mandic 7fdc880a73 sdnq patches
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-07 09:43:34 -04:00
Disty0 1cd7b6d63a fix upcast scale check 2025-10-07 01:27:54 +03:00
Disty0 aa0c10440f SDNQ make the loader don't touch the model options by default 2025-10-07 00:15:23 +03:00
Disty0 5306376b2a improve contiguous mm performance 2025-10-06 19:05:46 +03:00
Disty0 be91bbff75 SDNQ add SVD support for Convs 2025-10-06 18:26:42 +03:00
Disty0 c931bf9efa SDNQ add dtype casting to loader 2025-10-06 17:44:52 +03:00
Disty0 5c042c5fb8 cleanup 2025-10-06 11:30:26 +03:00