Commit Graph

64 Commits (0e0b607cfaabf59167e3ab2a44d491b4e4e1b08e)

Author SHA1 Message Date
Disty0 30d21f420f Allow all types in dyn quant 2026-03-11 13:46:24 +03:00
Disty0 d9e628574a SDNQ add 15, 13, 11 and 9 bit support 2026-03-11 03:39:02 +03:00
Disty0 9fbe70195f SDNQ add 14, 12 and 10 bit support 2026-03-09 23:05:08 +03:00
Disty0 0160f6b3ef add sdnq variants of pytorch native fp formats too 2026-02-24 20:08:41 +03:00
Disty0 78efbc7e85 update sdnq 2026-02-24 19:47:30 +03:00
Disty0 f439d51ea7 SDNQ update CosmosTransformer3DModel keys 2026-02-04 10:19:23 +03:00
Disty0 df30ba28ac SDNQ clamp weight_std and add CosmosTransformer3DModel keys 2026-02-04 00:31:00 +03:00
Disty0 8d6bfcd827 Update SDNQ 2026-01-23 14:39:07 +03:00
Disty0 784cda80aa update sdnq 2026-01-14 16:23:26 +03:00
Disty0 47dcab3522 update sdnq 2026-01-09 00:34:32 +03:00
Seunghoon Lee 49965dfda8
get_hip_arch_name -> get_hip_agent, use amdhip64_7.dll served within rocm package 2026-01-03 21:00:36 +09:00
vladmandic b9c18452f2 unify hip get arch name
Signed-off-by: vladmandic <mandic00@live.com>
2026-01-03 08:22:19 +01:00
Disty0 8e34866238 SDNQ fix outdated PyTorch 2025-12-30 21:29:41 +03:00
Disty0 5e934a12a2 sdnq cleanup unused args 2025-12-28 20:08:58 +03:00
Disty0 b852ff42ef SDNQ fix wrong fp8 mm type is set 2025-12-27 17:27:05 +03:00
Disty0 448e7b7735 replace the default fp6 type 2025-12-27 02:10:12 +03:00
Disty0 761fb82685 fix missing comma 2025-12-26 21:27:57 +03:00
Disty0 fd6c89a626 cleanup 2025-12-26 21:16:55 +03:00
Disty0 4a4784eafa SDNQ add new stack of custom floating point types and remove irrelevant qtypes from the ui list 2025-12-26 20:09:17 +03:00
Disty0 471b6dc1b7 SDNQ add siglip_embedder to ZImage skip keys 2025-12-23 04:32:54 +03:00
Disty0 ce8b6d138c SDNQ remove forced uint4 from convs and cleanup 2025-12-13 01:32:52 +03:00
Disty0 d4e2cbb826 SDNQ fix torch.compile always being active 2025-12-08 18:15:08 +03:00
Disty0 7a6356f8eb SDNQ fix transformers v5 and check for torch._dynamo.config.disable 2025-12-08 00:36:15 +03:00
Disty0 1cfb61809f cleanup 2025-12-05 18:40:49 +03:00
Disty0 5b86bef796 SDNQ add longcat keys 2025-12-05 18:37:20 +03:00
Disty0 7aa1bfdc70 Add get_modules_to_not_convert from transformers v5 2025-12-02 01:01:51 +03:00
Disty0 01a0f6b356 Warn and disable quantized matmul if triton is not available 2025-11-29 01:34:54 +03:00
Disty0 55cf627ac6 add version to sdnq 2025-11-28 00:45:24 +03:00
Disty0 16c429711c update lumina and z_image keys 2025-11-26 23:22:44 +03:00
Disty0 679060bd00 SDNQ add lumina and z_image keys 2025-11-26 22:51:15 +03:00
Disty0 da3c439059 SDNQ fix _tied_weights_keys is dict case 2025-11-25 19:37:46 +03:00
Disty0 aeb71d172e SDNQ add Flux2Transformer2DModel keys 2025-11-25 19:22:02 +03:00
vladmandic 9658a330b2 lint
Signed-off-by: vladmandic <mandic00@live.com>
2025-11-23 13:29:03 -05:00
Disty0 25d05b1445 SDNQ catch all exceptions on triton import 2025-11-22 14:48:55 +03:00
Disty0 b6e9332cfe SDNQ de-couple matmul dtype and add fp16 matmul 2025-11-22 02:16:20 +03:00
Disty0 1745ed53f8 Refactor SDNQDequantizer 2025-11-18 01:42:58 +03:00
Disty0 a4378a79e4 fix typo 2025-11-04 14:30:52 +03:00
Disty0 8ad53ed4b3 SDNQ update keys 2025-11-04 14:29:44 +03:00
Disty0 76d699dc09 SDNQ add common keys 2025-10-31 00:21:54 +03:00
Disty0 da3d183f96 add Emu3ForCausalLM keys 2025-10-30 23:44:05 +03:00
Disty0 b9435257c4 SDNQ add chrono keys 2025-10-30 23:33:38 +03:00
Disty0 7bcc5fa29c SDNQ add HunyuanImage3ForCausalMM keys 2025-10-29 13:36:31 +03:00
Disty0 2104bf8bb0 sdnq add wan keys 2025-10-25 15:34:14 +03:00
Disty0 4aee524ddf SDNQ add NaDiT keys 2025-10-14 17:18:58 +03:00
Disty0 b601f0d402 SDNQ expose svd_steps and update module skip keys 2025-10-14 00:15:09 +03:00
Disty0 a376f89fd6 Add type checking to SDNQConfig 2025-10-12 01:02:47 +03:00
Disty0 df142afe81 don't use triton mm for nvidia 2025-10-04 18:48:03 +03:00
Disty0 5c5d7d5a86 cleanup 2025-10-04 18:38:18 +03:00
Disty0 99113947bf SDNQ add RDNA2 INT8 support via Triton 2025-10-04 18:31:25 +03:00
Disty0 34c2a624aa SDNQ autodetect fp8 tw fallback and disable dynamic compile 2025-10-02 19:40:07 +03:00