Commit Graph

17 Commits (85cc55af51d8ae4e5bbae0a088f7aabcfd7a8111)

Author SHA1 Message Date
Seunghoon Lee d597d5912d
use bitmasking for agent detection 2024-10-26 13:40:54 +09:00
Vladimir Mandic e0d702a3dd add gfx autodetect options
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2024-10-07 11:38:10 -04:00
Seunghoon Lee 470ea10e95
fix hipinfo 2024-09-27 10:24:55 +09:00
Seunghoon Lee b74af75622
add more known devices 2024-09-27 09:42:13 +09:00
Seunghoon Lee 96922d70a7
rocm&zluda handle apu 2024-09-26 13:34:11 +09:00
Seunghoon Lee 1395f5bf9e
add triton backend flash-attn (experimental) 2024-09-24 12:53:18 +09:00
Seunghoon Lee e246e55734
rocm install flash-attn if needed 2024-09-24 12:27:43 +09:00
Seunghoon Lee 9c4213e4a8
tcmalloc experiment 2024-08-09 16:11:12 +09:00
Seunghoon Lee 864263f570
accurate wsl check 2024-08-09 10:38:28 +09:00
Seunghoon Lee c490b9cb51
fix first launch 2024-08-09 00:45:12 +09:00
Seunghoon Lee 61961f5d6d
hipblaslt check torch version 2024-08-09 00:35:46 +09:00
Seunghoon Lee 4492dedf16
fix hip path detection 2024-07-28 00:55:40 +09:00
Seunghoon Lee bdf6501f40
fix 2024-07-28 00:34:54 +09:00
Seunghoon Lee 5a9474180d
rocm.py 2024-07-28 00:00:19 +09:00
Seunghoon Lee 25c3c6107e
fix 2024-07-22 17:04:44 +09:00
Seunghoon Lee 7fff2a76cd
fix linux 2024-07-22 15:33:04 +09:00
Seunghoon Lee 0284c77ae5
refactor rocm & zluda 2024-07-22 15:29:49 +09:00