Disty0
|
4e4557d81c
|
NNCF set min matmul shape to 32
|
2025-05-13 18:50:23 +03:00 |
Disty0
|
b9ad55857d
|
NNCF INT8 MatMul don't force FP32 with FP16 scales
|
2025-05-13 05:08:22 +03:00 |
Disty0
|
129c701b3d
|
NNCF use torch.compile directly on int8_matmul instead of sub functions
|
2025-05-13 04:28:37 +03:00 |
Disty0
|
f1eefe97a4
|
NNCF use inplace ops
|
2025-05-13 03:49:30 +03:00 |
Vladimir Mandic
|
47862fef08
|
prompt enhance nsfw allow/disallow
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2025-05-12 20:33:16 -04:00 |
Disty0
|
98ccce68a1
|
Merge branch 'master' into master
|
2025-05-13 03:28:51 +03:00 |
chrismuzyn
|
299d189276
|
When using the openvino backend, do not look for an nvidia gpu.
|
2025-05-12 19:14:26 -04:00 |
Disty0
|
f4e3a81a84
|
NNCF experimental direct INT8 MatMul support
|
2025-05-12 21:41:49 +03:00 |
Vladimir Mandic
|
91080f349f
|
latent-diffusion-upscale n-steps
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2025-05-12 11:12:36 -04:00 |
Vladimir Mandic
|
a6e6c6ce5e
|
support sdxl packaged without vae
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2025-05-11 14:49:27 -04:00 |
Disty0
|
4eedeab9f8
|
NNCF use group size instead of number of groups and set default group size for int4 to 64
|
2025-05-11 20:38:01 +03:00 |
Disty0
|
03d05b6243
|
NNCF fix very large number of groups
|
2025-05-11 18:43:19 +03:00 |
Disty0
|
9cfdc3c079
|
Remove NNCF device hijack
|
2025-05-11 18:30:10 +03:00 |
Disty0
|
0673689d5b
|
NNCF set the default group size to 128 for INT4
|
2025-05-11 08:45:27 +03:00 |
Disty0
|
4af570bfd4
|
Cleanup
|
2025-05-11 08:10:13 +03:00 |
Disty0
|
8d27d34969
|
NNCF don't clip the zero_point
|
2025-05-11 07:57:51 +03:00 |
Disty0
|
98308fa187
|
NNCF fix group size calculation
|
2025-05-11 07:08:21 +03:00 |
Disty0
|
020e1aa374
|
Cleanup
|
2025-05-11 06:56:36 +03:00 |
Disty0
|
03a6d7f9bf
|
NNCF add number of quantization groups
|
2025-05-11 05:55:58 +03:00 |
Vladimir Mandic
|
6036ff2105
|
modernui framepack and history tabs
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2025-05-10 10:21:37 -04:00 |
Vladimir Mandic
|
aa04dd900a
|
fix modernui disabled
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2025-05-10 08:53:30 -04:00 |
Vladimir Mandic
|
f0d81ee1e0
|
cleanup
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2025-05-10 08:28:19 -04:00 |
Seunghoon Lee
|
6d2b5fc37c
|
Merge pull request #3921 from alexsarmiento/patch-1
Update fwd_prefill.py
|
2025-05-10 11:47:26 +09:00 |
Disty0
|
a1491a660c
|
Cleanup
|
2025-05-09 23:36:50 +03:00 |
Disty0
|
1ee9832e05
|
NNCF silence the pytorch version warning
|
2025-05-09 23:16:55 +03:00 |
Disty0
|
b0e5a6c4df
|
Add devices.has_triton() and enable NNCF compile if triton is available
|
2025-05-09 22:24:36 +03:00 |
AS
|
399a630778
|
Merge branch 'dev' into patch-1
|
2025-05-09 11:18:25 -07:00 |
Disty0
|
a4d4462e2a
|
NNCF add decompress using toch.compile option
|
2025-05-09 21:02:24 +03:00 |
Seunghoon Lee
|
45c0bd6ec6
|
basic windows native pytorch support
|
2025-05-09 22:23:07 +09:00 |
Vladimir Mandic
|
51716bbaba
|
update diffusers
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2025-05-09 09:04:21 -04:00 |
Vladimir Mandic
|
808462fdab
|
update changelog
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2025-05-09 08:18:40 -04:00 |
Seunghoon Lee
|
a27abf7363
|
zluda fix old cards
|
2025-05-09 20:02:45 +09:00 |
Disty0
|
9b641becd5
|
IPEX fix torch.compile with tensor.to
|
2025-05-09 11:07:51 +03:00 |
Seunghoon Lee
|
18f58e13f8
|
Update fwd_prefill.py
|
2025-05-09 13:36:01 +09:00 |
Vladimir Mandic
|
9e18d2f658
|
Merge branch 'dev' into patch-1
|
2025-05-08 23:21:24 -04:00 |
AS
|
5e561560ea
|
Update fwd_prefill.py
Remove cudagraph from triton autotune. Buggy under zluda
|
2025-05-08 19:32:51 -07:00 |
Vladimir Mandic
|
6489e4c37d
|
prompt-enhance api support and img2img support
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2025-05-08 15:31:07 -04:00 |
Vladimir Mandic
|
55b1cb8c8b
|
lower default teacache threshold
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2025-05-08 10:14:15 -04:00 |
Vladimir Mandic
|
432a5977a7
|
adetailer fix enable-disable
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2025-05-08 10:10:43 -04:00 |
Disty0
|
f3aa3b4574
|
NNCF remove T5 hijack from pre quant mode
|
2025-05-08 14:11:19 +03:00 |
Disty0
|
dfebc909eb
|
Disable cuDNN benchmark on ROCm and add cudnn_benchmark_limit option
|
2025-05-08 13:27:06 +03:00 |
Vladimir Mandic
|
8433f685e7
|
clear-cache on model unload
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2025-05-07 20:22:30 -04:00 |
Disty0
|
b6d2aa7fd8
|
NNCF more optimizations
|
2025-05-07 21:50:31 +03:00 |
Disty0
|
a57c7087b8
|
Make NNCF INT4 quant run 75% faster and don't force fp32 decompress
|
2025-05-07 20:34:07 +03:00 |
Vladimir Mandic
|
5261c55890
|
fix lora legacy disabled
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2025-05-07 11:48:21 -04:00 |
Vladimir Mandic
|
78e22350b9
|
add api get-checkpoint
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2025-05-07 10:31:06 -04:00 |
Seunghoon Lee
|
2a4eb86d10
|
zluda 3.9.4 & update flash attention 2
|
2025-05-07 00:39:40 +09:00 |
Vladimir Mandic
|
83fc68ece3
|
update requirements
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2025-05-06 09:47:13 -04:00 |
Vladimir Mandic
|
02fd0aee06
|
fix control refiner
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2025-05-06 09:06:50 -04:00 |
Vladimir Mandic
|
1da43e4f62
|
api b64 error handler
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2025-05-05 22:28:03 -04:00 |