Commit Graph

30 Commits (2e4e741d47c61730d34e79ae8dae4f7995f7b4a7)

Author SHA1 Message Date
Disty0 5c042c5fb8 cleanup 2025-10-06 11:30:26 +03:00
Vladimir Mandic a315a004e9 linting
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-10-05 20:25:33 -04:00
Disty0 9e52d0c1fb SDNQ add SVDQuant quantization method 2025-10-05 22:50:30 +03:00
Disty0 54acf1760b Make SDNQ scales compatible with balanced offload 2025-10-03 18:13:55 +03:00
Disty0 afb3a5a06d SDNQ move non_blocking to quant config 2025-08-11 15:07:02 +03:00
Disty0 86cd272b96 SDNQ fix Dora 2025-06-18 16:24:42 +03:00
Disty0 25fc0094a9 SDNQ use quantize_device and return_device args and fix decompress_fp32 always being on 2025-06-14 21:29:08 +03:00
Disty0 2ba64abcde Cleanup 2025-06-14 00:54:18 +03:00
Disty0 5e013fb154 SDNQ optimize input quantization and use the word quantize instead of compress 2025-06-12 12:06:57 +03:00
Disty0 5eed9135e3 Split SDNQ into multiple files and linting 2025-06-10 03:18:25 +03:00
Disty0 976f0ba61f Cleanup 2025-06-05 20:59:58 +03:00
Disty0 90324f9c8c SDNQ fix lora with quant matmul 2025-05-29 18:25:12 +03:00
Disty0 dece497f10 Refactor SDNQ to use weights_dtype and rename decompress_int8_matmul to use_quantized_matmul 2025-05-27 15:49:21 +03:00
Disty0 280be31883 SDNQ fix Lora change 2025-05-27 00:08:32 +03:00
Disty0 84ddfb2868 SDNQ fix lora apply 2025-05-26 22:39:20 +03:00
Disty0 687c50dcc8 SDNQ fix Lora 2025-05-26 19:48:45 +03:00
Disty0 91bb07f650 SDNQ remove unused args and simplify decompressors 2025-05-26 15:51:53 +03:00
Disty0 4453efee76 Rename NNCF to SDNQ and rename quant schemes 2025-05-26 02:39:51 +03:00
Disty0 2d79380bd7 NNCF implement better layer hijacks and remove all NNCF imports 2025-05-26 01:12:28 +03:00
Vladimir Mandic 5c0e3b635c update diffusers and lint/changelog/todo
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-05-14 13:30:52 -04:00
Disty0 f4e3a81a84 NNCF experimental direct INT8 MatMul support 2025-05-12 21:41:49 +03:00
Disty0 9cfdc3c079 Remove NNCF device hijack 2025-05-11 18:30:10 +03:00
Disty0 75d169bc1c Fix NNCF Lora with model offload 2025-04-23 17:13:08 +03:00
Disty0 f1d8543cae NNCF lora support 2025-04-23 15:44:09 +03:00
Vladimir Mandic 84a24fb681 lora restore weights to orig device on apply
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-04-07 07:49:18 -04:00
Vladimir Mandic d30b1cb1c8 lora improvements
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-04-06 13:02:58 -04:00
Vladimir Mandic 8725cfc488 lora obey device
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-04-03 14:31:39 -04:00
Vladimir Mandic 5c6c1465f4 fix style apply params
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-04-03 10:03:48 -04:00
Vladimir Mandic 6430f7006f add monitor cli option and finish lora refactor
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-04-01 13:39:47 -04:00
Vladimir Mandic b5031a5eba lora modularize code
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-04-01 13:39:47 -04:00