Commit Graph

9922 Commits (4e4557d81c57128a9d847b19463b1ed67dd7c8d8)

Author SHA1 Message Date
Disty0 4e4557d81c NNCF set min matmul shape to 32 2025-05-13 18:50:23 +03:00
Disty0 b9ad55857d NNCF INT8 MatMul don't force FP32 with FP16 scales 2025-05-13 05:08:22 +03:00
Disty0 129c701b3d NNCF use torch.compile directly on int8_matmul instead of sub functions 2025-05-13 04:28:37 +03:00
Disty0 f1eefe97a4 NNCF use inplace ops 2025-05-13 03:49:30 +03:00
Vladimir Mandic b23aed746d update changelog
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-05-12 20:33:16 -04:00
Vladimir Mandic 47862fef08 prompt enhance nsfw allow/disallow
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-05-12 20:33:16 -04:00
Disty0 860bbe1856
Merge pull request #3933 from vladmandic/master
OpenVINO force device
2025-05-13 03:30:33 +03:00
Disty0 95fa636f58
Merge pull request #3932 from chrismuzyn/master
When using the openvino backend, do not look for an nvidia gpu.
2025-05-13 03:29:38 +03:00
Disty0 98ccce68a1
Merge branch 'master' into master 2025-05-13 03:28:51 +03:00
Vladimir Mandic 5142fed8e4
Merge pull request #3931 from vladmandic/dev
dev merge
2025-05-12 19:19:43 -04:00
Vladimir Mandic 82f56e7b01
Merge branch 'master' into dev 2025-05-12 19:19:34 -04:00
Vladimir Mandic 01a038746d fix networks display
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-05-12 19:17:42 -04:00
chrismuzyn 299d189276 When using the openvino backend, do not look for an nvidia gpu. 2025-05-12 19:14:26 -04:00
Disty0 f4e3a81a84 NNCF experimental direct INT8 MatMul support 2025-05-12 21:41:49 +03:00
Vladimir Mandic c3def9b601 update readme
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-05-12 14:22:09 -04:00
Vladimir Mandic 62da781959
Merge pull request #3929 from vladmandic/dev
merge dev
2025-05-12 11:17:13 -04:00
Vladimir Mandic 91080f349f latent-diffusion-upscale n-steps
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-05-12 11:12:36 -04:00
Vladimir Mandic b9bf4e5c0f prompt-enhance-i2i
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-05-12 09:59:40 -04:00
Vladimir Mandic 40d3b1bfc2 update requirements
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-05-12 09:30:15 -04:00
Vladimir Mandic 8051c0824a update changelog
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-05-12 09:27:44 -04:00
Vladimir Mandic df8c45e0ff update changelog
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-05-11 14:58:57 -04:00
Vladimir Mandic a6e6c6ce5e support sdxl packaged without vae
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-05-11 14:49:27 -04:00
Disty0 4eedeab9f8 NNCF use group size instead of number of groups and set default group size for int4 to 64 2025-05-11 20:38:01 +03:00
Disty0 03d05b6243 NNCF fix very large number of groups 2025-05-11 18:43:19 +03:00
Disty0 9cfdc3c079 Remove NNCF device hijack 2025-05-11 18:30:10 +03:00
Disty0 0673689d5b NNCF set the default group size to 128 for INT4 2025-05-11 08:45:27 +03:00
Disty0 4af570bfd4 Cleanup 2025-05-11 08:10:13 +03:00
Disty0 8d27d34969 NNCF don't clip the zero_point 2025-05-11 07:57:51 +03:00
Disty0 98308fa187 NNCF fix group size calculation 2025-05-11 07:08:21 +03:00
Disty0 020e1aa374 Cleanup 2025-05-11 06:56:36 +03:00
Disty0 03a6d7f9bf NNCF add number of quantization groups 2025-05-11 05:55:58 +03:00
Vladimir Mandic fbaca247ef update logging and changelog
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-05-10 20:20:02 -04:00
Vladimir Mandic 76e6f02cb9 update wiki
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-05-10 10:47:04 -04:00
Vladimir Mandic 6036ff2105 modernui framepack and history tabs
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-05-10 10:21:37 -04:00
Vladimir Mandic aa04dd900a fix modernui disabled
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-05-10 08:53:30 -04:00
Vladimir Mandic 7306c470af update todo/changelog
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-05-10 08:40:52 -04:00
Vladimir Mandic f0d81ee1e0 cleanup
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-05-10 08:28:19 -04:00
Seunghoon Lee 6d2b5fc37c
Merge pull request #3921 from alexsarmiento/patch-1
Update fwd_prefill.py
2025-05-10 11:47:26 +09:00
Disty0 a1491a660c Cleanup 2025-05-09 23:36:50 +03:00
Disty0 1ee9832e05 NNCF silence the pytorch version warning 2025-05-09 23:16:55 +03:00
Disty0 b0e5a6c4df Add devices.has_triton() and enable NNCF compile if triton is available 2025-05-09 22:24:36 +03:00
AS 399a630778
Merge branch 'dev' into patch-1 2025-05-09 11:18:25 -07:00
Disty0 a4d4462e2a NNCF add decompress using toch.compile option 2025-05-09 21:02:24 +03:00
Seunghoon Lee 45c0bd6ec6
basic windows native pytorch support 2025-05-09 22:23:07 +09:00
Vladimir Mandic 51716bbaba update diffusers
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-05-09 09:04:21 -04:00
Vladimir Mandic 808462fdab update changelog
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-05-09 08:18:40 -04:00
Seunghoon Lee a27abf7363
zluda fix old cards 2025-05-09 20:02:45 +09:00
Disty0 9b641becd5 IPEX fix torch.compile with tensor.to 2025-05-09 11:07:51 +03:00
Seunghoon Lee 18f58e13f8
Update fwd_prefill.py 2025-05-09 13:36:01 +09:00
Vladimir Mandic 9e18d2f658
Merge branch 'dev' into patch-1 2025-05-08 23:21:24 -04:00