Commit Graph

1375 Commits (5d2f5dd6e7dbacd3e2ba648517548b43fdde80cc)

Author SHA1 Message Date
Vladimir Mandic bff75f0db3 add omnigen2
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-06-27 20:20:43 -04:00
Disty0 71f7474de2 Unify quant options 2025-06-27 21:05:14 +03:00
Disty0 3b8ced444c Add auto quantization mode 2025-06-27 18:54:15 +03:00
Vladimir Mandic 42b3e08e65 Control add setting to run hires with or without control
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-06-25 18:43:29 -04:00
Vladimir Mandic 5b486a6ef1 sdnq add xyz grid support, improve offloading compatibility
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-06-25 15:32:37 -04:00
Disty0 cb4684cbeb SNDQ add separate quant mode option for Text Encoders 2025-06-13 12:42:57 +03:00
Disty0 5e013fb154 SDNQ optimize input quantization and use the word quantize instead of compress 2025-06-12 12:06:57 +03:00
Disty0 33fadf946b SDNQ add 7 bit support 2025-06-10 11:33:06 +03:00
Disty0 58b646e7f2 SDNQ add 5-bit and 3-bit quantization support 2025-06-10 01:48:51 +03:00
Disty0 2ccc76ab91 Increase medvram mode to 12 GB and update wiki 2025-06-06 15:27:30 +03:00
Disty0 e25890bb1d SDNQ INT8 matmul support for Conv2d 2025-06-05 13:36:49 +03:00
Disty0 b146025a5e SDNQ add int2 2025-06-02 00:17:39 +03:00
Disty0 839295f79a Add fp8 fnuz to sdnq options 2025-06-01 23:10:08 +03:00
Disty0 539fae3234 Update naming 2025-06-01 21:01:56 +03:00
Disty0 c85cc6b397 SDNQ enable quant with GPU by default and don't do unnecessary clones 2025-05-30 15:21:29 +03:00
Disty0 29bd2af779 SDNQ add 6-bit support 2025-05-30 12:20:13 +03:00
Disty0 d1491962d9 One bit 2025-05-30 05:41:02 +03:00
Disty0 3c8be0f55f SDNQ add uint2 2025-05-30 04:47:29 +03:00
Disty0 d8e8f47ce5 SDNQ add an option to toggle quantize with GPU 2025-05-28 15:18:39 +03:00
Disty0 1961e88c13 Set SDPA as the default on all backends and enable Dyn SDPA on ROCm, DML, CPU and MPS 2025-05-28 13:42:29 +03:00
Disty0 b724cd7c57 Update changelog 2025-05-27 21:21:42 +03:00
Disty0 5d3c1832b2 SDNQ add FP8 quants 2025-05-27 20:29:15 +03:00
Disty0 dece497f10 Refactor SDNQ to use weights_dtype and rename decompress_int8_matmul to use_quantized_matmul 2025-05-27 15:49:21 +03:00
Disty0 79bb348927 SDNQ sort quant schemes by recommended order 2025-05-27 13:06:17 +03:00
Disty0 ccf9deaf28 Move SDNQ to the top of the settings list 2025-05-26 18:30:50 +03:00
Disty0 4453efee76 Rename NNCF to SDNQ and rename quant schemes 2025-05-26 02:39:51 +03:00
Disty0 85f00f9edb Enable dyn atten by default for ROCm 2025-05-23 18:24:50 +03:00
Vladimir Mandic 4157336238 rename vae
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-05-15 08:59:49 -04:00
Disty0 18c10883b8 Move NNCF above in the settings list 2025-05-14 05:19:11 +03:00
Disty0 f4e3a81a84 NNCF experimental direct INT8 MatMul support 2025-05-12 21:41:49 +03:00
Vladimir Mandic 91080f349f latent-diffusion-upscale n-steps
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-05-12 11:12:36 -04:00
Disty0 4eedeab9f8 NNCF use group size instead of number of groups and set default group size for int4 to 64 2025-05-11 20:38:01 +03:00
Disty0 0673689d5b NNCF set the default group size to 128 for INT4 2025-05-11 08:45:27 +03:00
Disty0 03a6d7f9bf NNCF add number of quantization groups 2025-05-11 05:55:58 +03:00
Disty0 b0e5a6c4df Add devices.has_triton() and enable NNCF compile if triton is available 2025-05-09 22:24:36 +03:00
Disty0 a4d4462e2a NNCF add decompress using toch.compile option 2025-05-09 21:02:24 +03:00
Seunghoon Lee 45c0bd6ec6
basic windows native pytorch support 2025-05-09 22:23:07 +09:00
Vladimir Mandic 808462fdab update changelog
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-05-09 08:18:40 -04:00
Vladimir Mandic 55b1cb8c8b lower default teacache threshold
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-05-08 10:14:15 -04:00
Disty0 dfebc909eb Disable cuDNN benchmark on ROCm and add cudnn_benchmark_limit option 2025-05-08 13:27:06 +03:00
Disty0 a57c7087b8 Make NNCF INT4 quant run 75% faster and don't force fp32 decompress 2025-05-07 20:34:07 +03:00
Vladimir Mandic 5261c55890 fix lora legacy disabled
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-05-07 11:48:21 -04:00
Disty0 f4dfe20bc1 Add sigmoid beta scheduler 2025-05-04 17:42:22 +03:00
Vladimir Mandic 473f394f97 fix save style
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-04-30 09:18:41 -04:00
Vladimir Mandic ff649291b5 lint fixes
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-04-28 10:55:21 -04:00
Vladimir Mandic 1b341dd809 setting to enable/disable clip skip editing
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-04-28 09:44:51 -04:00
Vladimir Mandic 5b68979226 update option names
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-04-26 13:46:33 -04:00
Vladimir Mandic 5647d782f8 configurable restore metadata settings and params
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-04-26 10:45:49 -04:00
Disty0 74d4093e74 NNCF disable quant conv by default 2025-04-23 16:31:27 +03:00
Vladimir Mandic 641e1e52b3 fix config save
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-04-23 09:10:53 -04:00