Hameer Abbasi
4c7792ed01
Add pre-commit.
2023-09-04 13:57:47 +02:00
Seunghoon Lee
d711880aa9
New option for DirectML: memory stats provider.
...
1. Performance Counter.
Get vram size allocated to & used by python.exe from pdh.dll.
Generation can be slower than atiadlxx.
Use memory less greedy then atiadlxx.
Windows only.
2. atiadlxx.
Get max vram size and available vram size from AMD GPU driver (atiadlxx.dll).
Use memory more greedy than Performance Counter.
Windows & WSL are supported.
3. None.
Assume available vram size is 8GB.
Use memory regardless of current vram usage.
2023-08-01 01:58:04 +09:00
Seunghoon Lee
42c6147ac8
cleanup
2023-07-29 13:48:28 +09:00
Seunghoon Lee
47f2f50574
Restore Python 3.9 compatibility. (DirectML)
2023-07-29 12:08:22 +09:00
Seunghoon Lee
77de9cd093
Fix medvram with DirectML.
2023-07-28 23:18:28 +09:00
Seunghoon Lee
0f44332e5c
Make sequential CPU offload available for non-CUDA
...
Add settings override for DirectML.
Move `devices.set_cuda_params()` to correct line.
2023-07-28 23:11:57 +09:00
Seunghoon Lee
d4aa840a77
DirectML update.
...
DirectML reuses GPU memory instead of returning it.
So prints "practical" GPU memory utilization too.
2023-07-24 16:10:01 +09:00
Seunghoon Lee
0a52c44e73
DirectML rework & provide GPU memory usage (AMD only).
2023-07-15 18:55:38 +09:00
Seunghoon Lee
578f206bf7
DirectML rework.
2023-07-15 16:19:38 +09:00
Vladimir Mandic
2a21196061
Merge branch 'master' into dev
2023-07-08 13:35:25 -04:00
Seunghoon Lee
99c7f23c89
remove useless methods.
2023-07-09 01:29:04 +09:00
Seunghoon Lee
993de932ab
Add an opts override for DirectML.
2023-07-06 01:35:00 +09:00
Vladimir Mandic
cb307399dd
jumbo merge
2023-06-13 11:59:56 -04:00
Vladimir Mandic
5134471bc8
dml autocast
2023-05-14 13:24:59 -04:00
Vladimir Mandic
c46f0dbdeb
fix
2023-05-13 12:55:50 -04:00
Vladimir Mandic
d96ab6a1ae
update directml
2023-05-13 11:21:11 -04:00
Seunghoon Lee
d2d5011bd3
Implement memory estimation for AMDGPUs.
...
Stable.
2023-04-26 17:44:32 +09:00
Seunghoon Lee
df0e89be48
fix.
...
Unstable & need more test.
2023-04-26 12:45:44 +09:00
Seunghoon Lee
09ae33cdf7
Implement torch.dml.
...
VERY UNSTABLE & NOT TESTED.
2023-04-26 12:21:44 +09:00