Commit Graph

254 Commits (10bbbed218458b8a899aac2140ec738d8d716f05)

Author SHA1 Message Date
Disty0 b14e8f9a5f Don't assume Cuda on devices.same_device() 2024-10-14 17:23:51 +03:00
Vladimir Mandic c2ab0b11c3 check te device
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2024-10-14 09:29:33 -04:00
Disty0 9c05124c33 Add devices.has_xpu() 2024-10-13 15:13:38 +03:00
Disty0 011d9c3348 Move device backed initialization to shared.py 2024-10-13 14:56:28 +03:00
Disty0 84f8ab4076 Fix IPEX 2024-10-13 14:24:55 +03:00
Vladimir Mandic 0c54c235cb add sageattention
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2024-10-12 15:43:34 -04:00
Disty0 2e2cb43406 Make SDPA hijacks chainable and add Sage Attention 2024-10-12 21:19:38 +03:00
Vladimir Mandic ea0dfebe2d better handle any quant lib requirements
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2024-10-12 13:36:16 -04:00
Vladimir Mandic 3bbcc33181 add detailer
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2024-10-07 09:32:56 -04:00
Vladimir Mandic c21a10b7c9 add bf16 override for directml
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2024-09-29 21:34:59 -04:00
Vladimir Mandic b31d02ba1d cleanup
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2024-09-29 20:28:01 -04:00
Vladimir Mandic 47755dce6b refactor devices
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2024-09-29 20:17:03 -04:00
Vladimir Mandic fe94edf781 set default cuda dtype to auto
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2024-09-29 14:16:14 -04:00
Vladimir Mandic 174add0c3a restore dtype after upcast complete
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2024-09-29 13:22:08 -04:00
Vladimir Mandic 92f2a2902f improve profiling 2024-09-23 11:07:24 -04:00
Vladimir Mandic fe93ad6929 refactor xyzgrid 2024-09-13 19:22:02 -04:00
Disty0 586e5384b5 Update IPEX to 2.3 on Linux 2024-09-10 19:13:42 +03:00
Vladimir Mandic b4df9a4de1 jumbo update with flux.1 refactor, see changelog for details 2024-09-01 22:56:15 -04:00
Vladimir Mandic 5ed58ac7cc end-to-end update flux, see changelog and wiki 2024-08-28 08:04:24 -04:00
Disty0 a3f26c9df0 Convert Dynamic Attention SDP to a global SDP option 2024-08-18 01:44:27 +03:00
Disty0 f2769c0449 ROCm flash atten fall back to sdpa with fp32 inputs 2024-07-23 01:13:55 +03:00
xedis 809444eb5c
fix typo 2024-07-03 23:11:08 -07:00
Vladimir Mandic 16ab1a0af7 lint updates 2024-06-26 08:58:22 -04:00
Vladimir Mandic b036c2fc3b improve gc threshold 2024-06-21 12:57:17 -04:00
Disty0 092a326c09 Add torch_gc to state.nextjob, vae and upscale 2024-06-20 14:47:30 +03:00
Vladimir Mandic a1f53add94 fix typos 2024-06-16 17:00:35 -04:00
Vladimir Mandic 6d6f1de295 additional python 3.12 compatibility 2024-06-08 14:14:48 -04:00
Vladimir Mandic db9718eee6 add torch full deterministic mode 2024-06-07 09:26:51 -04:00
Vladimir Mandic d63f35e298 add cudaMallocAsync 2024-05-16 18:02:55 -04:00
Vladimir Mandic 40d5fdfdfd maybe fix slerp 2024-04-03 17:51:06 -04:00
Vladimir Mandic 834cb1b665 run fp16/bf16 test only once 2024-04-03 10:26:29 -04:00
Vladimir Mandic 9873178897 add extra_network_reference setting, refactor geninfo parser 2024-03-11 11:15:51 -04:00
Disty0 06149c4a41 ROCm add Flash Attention support 2024-03-10 00:11:50 +03:00
Vladimir Mandic ee7517dfb8 expose sdp options 2024-02-19 08:29:24 -05:00
Vladimir Mandic 1b3028b667 minor update 2024-02-15 09:13:34 -05:00
Vladimir Mandic d5a4f43f43 post release jumbo update 2024-02-08 12:10:32 -05:00
Disty0 0f829b2d04 Make OpenVINO compatible with IPEX venv 2024-01-08 03:00:18 +03:00
Disty0 8f70b7d08c Add DISABLE_VENV_LIBS env variable 2024-01-08 02:04:55 +03:00
Vladimir Mandic 17b30a320e enable batched taesd 2024-01-03 10:38:30 -05:00
Disty0 068f0a7d71 Return CPU device with OpenVINO on MAC 2024-01-02 20:20:36 +03:00
Vladimir Mandic 70bfe4ced8 enable gc on ram threshold 2023-12-31 08:15:22 -05:00
Vladimir Mandic 439542d3df redesign profiler 2023-12-03 11:27:24 -05:00
Disty0 bd141bbfeb IPEX decrease Torch GC Threshold to 80 2023-11-21 16:47:35 +03:00
Seunghoon Lee 36bef98cd5
Show device information log for DirectML. 2023-10-17 12:24:27 +09:00
Vladimir Mandic 2ec797472b add hypertile 2023-10-06 16:10:56 -04:00
Disty0 dc31dcbc1c Cleanup 2023-09-29 18:29:44 +03:00
Disty0 72a33d5247 Update device logging 2023-09-29 18:28:04 +03:00
Disty0 6184a8cb5c IPEX and DML fix Cuda error 2023-09-28 20:51:11 +03:00
Disty0 21d53b6ac8 Cleanup 2023-09-28 19:53:52 +03:00
Disty0 7a3c1da954 Add OpenVINO device logging 2023-09-28 19:33:14 +03:00
Disty0 5edf481c8d Add Torch GC threshold slider 2023-09-28 14:38:22 +03:00
Vladimir Mandic 0afcfe6097 logger early init 2023-09-23 23:44:34 -04:00
Disty0 550b7056ac IPEX fix SDPA and reduce torch_gc force to %90 2023-09-18 15:36:14 +03:00
Vladimir Mandic 484dae8dbd upgrade diffusers 2023-09-14 09:38:17 -04:00
Vladimir Mandic 76c444fbc8 cleanup 2023-09-13 11:48:13 -04:00
Vladimir Mandic f8fcb6f853 fix original hires non-latent 2023-09-10 18:30:20 -04:00
Vladimir Mandic 250d1bf2fb update hints 2023-09-10 13:05:31 -04:00
Disty0 34ee67477e Fix BF16 and FP32 logging 2023-09-08 23:49:49 +03:00
Vladimir Mandic 29d88cf557 cleanup logging 2023-09-08 13:29:33 -04:00
Vladimir Mandic f36c1eb476 jumbo patch 2023-09-08 13:01:20 -04:00
Vladimir Mandic 8fd96d0f30 catch directml and ipex initialization errors 2023-09-07 07:27:54 -04:00
Vladimir Mandic df65df3f36 minor fixes 2023-08-30 09:45:47 -04:00
Vladimir Mandic 48c0ce9b2b fix model lookups 2023-08-27 08:01:29 +00:00
Vladimir Mandic 6a4d4ea5b7 update logging and model hashinh 2023-08-22 18:28:09 +00:00
Disty0 f9718f068c Seperate OpenVINO from IPEX 2023-08-19 17:52:15 +03:00
Disty0 209f9a19c6 IPEX fixes 2023-08-16 18:56:50 +03:00
Vladimir Mandic 88fff06c9e downgrade warn to info 2023-08-13 10:58:02 +00:00
Vladimir Mandic 4234555566 update 2023-08-05 14:37:09 +00:00
Seunghoon Lee 0f44332e5c
Make sequential CPU offload available for non-CUDA
Add settings override for DirectML.
Move `devices.set_cuda_params()` to correct line.
2023-07-28 23:11:57 +09:00
Disty0 f38d5a91bf Move ipex fixes into it's own folder 2023-07-28 10:58:45 +03:00
Disty0 38dcca7399 ipex cleanup 2023-07-28 01:35:02 +03:00
Nuullll 6acb3ef131 [IPEX] Fix batch_norm for Tiled VAE
Tiled VAE invokes `torch.nn.functional.batch_norm` without providing the
`weight` and `bias` parameter, so torch backend creates default empty
tensors for them but bails out with "tensor does not have a device" error.

This patch overrides the `weight` and `bias` parameters to all-ones and
all-zeros if they are `None`.
2023-07-27 22:56:19 +08:00
Seunghoon Lee 43b9c52bd4
Fix image corruption in half mode with embeddings.
(DirectML)
2023-07-25 17:02:55 +09:00
Seunghoon Lee d4aa840a77
DirectML update.
DirectML reuses GPU memory instead of returning it.
So prints "practical" GPU memory utilization too.
2023-07-24 16:10:01 +09:00
Vladimir Mandic b31fa98669 fixes... 2023-07-21 09:28:02 -04:00
Disty0 7959dbceac ipex fix cuda error with openpose 2023-07-21 15:28:07 +03:00
Disty0 57d1d3ed16 Fix Kandinsky safety_checker and compile 2023-07-20 14:29:15 +03:00
Disty0 88794e3724 ipex fix cuda error when using pin memory 2023-07-17 17:14:13 +03:00
Vladimir Mandic e2b33b81d3 fix diffusers samplers 2023-07-15 22:40:03 -04:00
Disty0 f773c782fa ipex cleanup 2023-07-16 01:39:40 +03:00
Seunghoon Lee 0a52c44e73
DirectML rework & provide GPU memory usage (AMD only). 2023-07-15 18:55:38 +09:00
Disty0 14d1136fe7 Fix ipex memstats 2023-07-14 18:09:07 +03:00
Disty0 2a9133bfec IPEX rework 2023-07-14 17:33:24 +03:00
Disty0 c3a4293f22 Disable torch_gc for IPEX in WSL2 2023-07-12 13:02:42 +03:00
Disty0 2bce86a50a Replace empty_cache with torch_gc 2023-07-12 12:45:21 +03:00
Disty0 562ca33275 Fix Diffusers _conv_forward dtype error with IPEX 2023-07-12 02:03:45 +03:00
Vladimir Mandic db30f5faec update changelog 2023-07-08 14:22:51 -04:00
Vladimir Mandic 2a21196061
Merge branch 'master' into dev 2023-07-08 13:35:25 -04:00
Vladimir Mandic 89a7ea6a3f overal quality fixes 2023-07-08 09:49:41 -04:00
Disty0 205b516487 Fix diffusers_sdxl on ipex 2023-07-07 22:41:26 +03:00
Disty0 3bcca6f92b Patch torch.Generator again 2023-07-06 02:51:27 +03:00
Disty0 422c60c787 Patch torch.Generator 2023-07-05 20:49:39 +03:00
Disty0 99284ff020 Cleanup 2023-07-05 12:43:15 +03:00
Disty0 a62d9b0ca4 Cleanup 2023-07-05 12:39:34 +03:00
Nuullll 860bf8e2bf [IPEX] Support SDE samplers
This is a W/A since `torch.Generator()` API doesn't support `xpu`
backend at the moment. So replacing it with `torch.xpu.Generator()` API
provided by IPEX.
2023-07-05 15:48:58 +08:00
Disty0 45d50bd106 Remove cpu=xpu with ipex 2023-07-05 00:12:07 +03:00
Disty0 966eed8dd9 Autodetect IPEX 2023-07-04 23:37:36 +03:00
Vladimir Mandic b216a35ddd update diffusers and extra networks 2023-07-04 09:28:48 -04:00
Vladimir Mandic 2a41bf1406 fix styles 2023-06-27 09:04:42 -04:00
Disty0 102503a3a4 Fix ControlNet and change to sub-quad on ipex 2023-06-27 15:17:13 +03:00