Commit Graph

119 Commits (cfcfa4e4d28dcce91eb029c8a70d01042161403f)

Author SHA1 Message Date
Disty0 51c46339d6 IPEX fix Flux and HiDream 2025-04-11 22:17:18 +03:00
Disty0 90f887ac4a Add dim checks to ck flash atten and fix dim check on dyn atten 2025-03-25 03:50:21 +03:00
Disty0 4fce7c279e IPEX add dg2 to arch list 2025-03-14 18:56:03 +03:00
Disty0 f2409d38c4 IPEX use get_arch_list from torch.xpu 2025-03-14 18:52:13 +03:00
Disty0 cd9cb51a05 IPEX fix torch.compile on PyTorch 2.8 2025-03-14 02:03:33 +03:00
Disty0 1974172296 IPEX fix triton 2025-03-13 23:53:04 +03:00
Disty0 953a15a218 IPEX fix PyTorch 2.7 compatibility 2025-03-11 16:35:48 +03:00
Disty0 67f495454c IPEX fix torch.cuda.device 2025-03-06 22:55:03 +03:00
Disty0 88a2f95fe4 Fix IPEX 2.3 x2 2025-03-06 20:36:53 +03:00
Disty0 04f61d8355 IPEX fix PyTorch 2.3 2025-03-06 20:24:22 +03:00
Disty0 0e01c1b9f0 IPEX fix UntypedStorage 2025-03-06 19:49:08 +03:00
Disty0 955acaa149 IPEX fix dtype mismatch on tensor.to when input is a tensor 2025-03-05 23:02:49 +03:00
Disty0 6f68f91efe IPEX cleanup hijacks and add torch.eye 2025-03-05 22:27:29 +03:00
Disty0 5d18bd6fe3 IPEX fix Flux 2025-02-27 23:52:25 +03:00
Disty0 bb8368e52e Revert torch.xpu.mem_get_info 2025-02-14 18:04:50 +03:00
Disty0 1ffa12430c IPEX use native torch.xpu.mem_get_info 2025-02-14 17:54:17 +03:00
Disty0 e193a92fe1 IPEX fix torch.cuda.synchronize 2025-01-31 18:46:07 +03:00
Disty0 50f73c9553 Update ipex dyn atten defaults 2025-01-29 00:43:22 +03:00
Disty0 b0ecdf2f1c Better dyn atten sdpa usage estimation logic 2025-01-26 16:57:26 +03:00
Disty0 a21bc9d9cd Make IPEX dyn atten care only about the sdpa memory usage 2025-01-26 04:45:05 +03:00
Disty0 d266143f20 Cleanup 2025-01-25 21:46:23 +03:00
Disty0 02ecc6710d Fix missing unsqueeze 2025-01-25 21:43:38 +03:00
Disty0 a770b1c888 More correct Dynamic Atten SDPA implementation and deprecate IPEX Diffusers attention 2025-01-25 21:33:42 +03:00
Disty0 af35296a68 IPEX 4GB alloc detection and log driver version 2025-01-22 18:15:25 +03:00
Disty0 9f3a34d0a3 IPEX enable FP64 emulation 2025-01-22 17:02:22 +03:00
Disty0 facfe2be6b IPEX make IPEX_FORCE_ATTENTION_SLICE=-1 disable Dynamic Atten 2025-01-18 22:10:34 +03:00
Disty0 042f5f9fed IPEX disable interpolate cpu fallback with IPEX 2.5 2024-12-22 03:04:40 +03:00
Disty0 1649e4e3c4 IPEX dupe conv2d fix for conv1d and conv3d too 2024-12-21 01:25:49 +03:00
Disty0 fc63de6958 IPEX enable empty_cache fix for WSL on PyTorch 2024-12-17 22:43:30 +03:00
Disty0 7d7bcb9684 Fix balanced offload with Cascade 2024-12-13 23:01:53 +03:00
Disty0 8a6e5617ba Fix IPEX 2.5 2024-12-13 22:05:13 +03:00
Disty0 7a213fe69a IPEX fix Flux 2024-12-13 00:12:40 +03:00
Disty0 85c3a76c1a Dynamic Atten use expand instead of repeat 2024-10-25 21:06:46 +03:00
Disty0 7d56d5b72c Dynamic atten don't repeat the last shape of atten masks 2024-10-25 13:45:20 +03:00
Disty0 3e268e7cea Cleanup 2024-10-24 21:53:33 +03:00
Disty0 3195e8ad1f Dynamic Atten fix OmniGen 2024-10-24 21:22:06 +03:00
Disty0 1d51ae3baa Intel fix torch.compile 2024-10-18 14:46:43 +03:00
Disty0 702846c75c Intel fix Triton 2024-10-18 14:01:54 +03:00
Disty0 065155f5e6 IPEX fixes 2024-10-18 01:12:28 +03:00
Disty0 6c11002420 PyTorch 2.5 XPU support 2024-10-17 23:11:52 +03:00
Vladimir Mandic 0c54c235cb add sageattention
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2024-10-12 15:43:34 -04:00
Disty0 f62dddf3ca Add Cascade APG 2024-10-04 20:51:04 +03:00
Disty0 4de0480d9f Upcast bf16 fftn to fp32 2024-09-23 22:53:40 +03:00
Disty0 31b66cfa4a IPEX fix FreeU 2024-09-23 21:43:01 +03:00
Disty0 91bdd3b384 fix return xpu with torch.device 2024-09-10 19:20:49 +03:00
Disty0 586e5384b5 Update IPEX to 2.3 on Linux 2024-09-10 19:13:42 +03:00
Disty0 5e1da44bba IPEX fix custom_fwd x2 2024-08-18 05:22:11 +03:00
Disty0 b862400ea2 IPEX fix AMP custom_fwd 2024-08-18 05:11:00 +03:00
Disty0 26d1d42b06 IPEX update interpolate hijack 2024-08-12 10:47:00 +03:00
Disty0 dcedcbae9d IPEX fixes 2024-08-12 03:01:46 +03:00