Commit Graph

28 Commits (3af9d3bb501c1ef34bb9a2fa0df9bb75bc3830e2)

Author SHA1 Message Date
Seunghoon Lee 6f7fe5e7cc fix directml generator. 2024-09-20 10:09:00 +09:00
Disty0 e3b087b6c0 Add balanced offload mode and make offload modes a single choice list 2024-08-11 17:27:30 +03:00
Seunghoon Lee ff2c1db1cc
fix code formatting under modules/dml 2024-02-05 22:43:10 +09:00
Vladimir Mandic 1de3c6af0f add amused model support and update reference list 2023-12-27 10:15:36 -05:00
Vladimir Mandic 564d04d9f4
Merge pull request #2296 from vladmandic/master
update dml
2023-10-08 07:26:55 -04:00
Seunghoon Lee e5f8b7f0a4
update dml 2023-10-08 02:19:56 +09:00
Vladimir Mandic 0afcfe6097 logger early init 2023-09-23 23:44:34 -04:00
Vladimir Mandic 496bdf7c55 fix backend switch 2023-09-15 08:39:25 -04:00
Vladimir Mandic 8fd96d0f30 catch directml and ipex initialization errors 2023-09-07 07:27:54 -04:00
Hameer Abbasi 4c7792ed01
Add pre-commit. 2023-09-04 13:57:47 +02:00
Seunghoon Lee d711880aa9
New option for DirectML: memory stats provider.
1. Performance Counter.
    Get vram size allocated to & used by python.exe from pdh.dll.
    Generation can be slower than atiadlxx.
    Use memory less greedy then atiadlxx.
    Windows only.
2. atiadlxx.
    Get max vram size and available vram size from AMD GPU driver (atiadlxx.dll).
    Use memory more greedy than Performance Counter.
    Windows & WSL are supported.
3. None.
    Assume available vram size is 8GB.
    Use memory regardless of current vram usage.
2023-08-01 01:58:04 +09:00
Seunghoon Lee 42c6147ac8
cleanup 2023-07-29 13:48:28 +09:00
Seunghoon Lee 47f2f50574
Restore Python 3.9 compatibility. (DirectML) 2023-07-29 12:08:22 +09:00
Seunghoon Lee 77de9cd093
Fix medvram with DirectML. 2023-07-28 23:18:28 +09:00
Seunghoon Lee 0f44332e5c
Make sequential CPU offload available for non-CUDA
Add settings override for DirectML.
Move `devices.set_cuda_params()` to correct line.
2023-07-28 23:11:57 +09:00
Seunghoon Lee d4aa840a77
DirectML update.
DirectML reuses GPU memory instead of returning it.
So prints "practical" GPU memory utilization too.
2023-07-24 16:10:01 +09:00
Seunghoon Lee 0a52c44e73
DirectML rework & provide GPU memory usage (AMD only). 2023-07-15 18:55:38 +09:00
Seunghoon Lee 578f206bf7
DirectML rework. 2023-07-15 16:19:38 +09:00
Vladimir Mandic 2a21196061
Merge branch 'master' into dev 2023-07-08 13:35:25 -04:00
Seunghoon Lee 99c7f23c89
remove useless methods. 2023-07-09 01:29:04 +09:00
Seunghoon Lee 993de932ab
Add an opts override for DirectML. 2023-07-06 01:35:00 +09:00
Vladimir Mandic cb307399dd jumbo merge 2023-06-13 11:59:56 -04:00
Vladimir Mandic 5134471bc8 dml autocast 2023-05-14 13:24:59 -04:00
Vladimir Mandic c46f0dbdeb fix 2023-05-13 12:55:50 -04:00
Vladimir Mandic d96ab6a1ae update directml 2023-05-13 11:21:11 -04:00
Seunghoon Lee d2d5011bd3
Implement memory estimation for AMDGPUs.
Stable.
2023-04-26 17:44:32 +09:00
Seunghoon Lee df0e89be48
fix.
Unstable & need more test.
2023-04-26 12:45:44 +09:00
Seunghoon Lee 09ae33cdf7
Implement torch.dml.
VERY UNSTABLE & NOT TESTED.
2023-04-26 12:21:44 +09:00