Commit Graph

29 Commits (77e0eecf58c0896121fbafa77e8a8a069657c765)

Author SHA1 Message Date
Disty0 74b6edf2df revert gfx1101 2025-06-11 19:25:05 +03:00
Disty0 6aa5c08fb0 Cleanup and update changelog 2025-06-11 16:03:33 +03:00
Disty0 71be3c7d45 ROCm don't override gfx with gfx1100 and gfx1101 + rocm 6.4 2025-06-11 15:47:25 +03:00
Disty0 df6b13ea47 Don't set gfx override with RX 9000 and above 2025-06-11 15:09:03 +03:00
Disty0 bd2d9d1677 Python 3.13 support 2025-06-09 22:58:08 +03:00
Disty0 748bdbf437 Update rocm flash attention repo with navi rotary fix 2025-04-08 00:15:57 +03:00
Seunghoon Lee 0eaa2c0378
rocm wsl better arch detection 2025-04-01 21:14:02 +09:00
Disty0 dc551bfd85 Set blaslt_tensile_libpath to empty string instead of None 2025-02-15 18:04:42 +03:00
Seunghoon Lee 13e9f4f3e1
flash attn triton is already merged into main 2025-02-14 23:21:56 +09:00
Disty0 f94196bcd1 Rename ROCm Flash atten hijack to CK Flash atten and enable AOTriton memory and flash atten by default 2025-02-13 22:01:06 +03:00
Seunghoon Lee 389bb78b8e
Fix bug. 2025-01-11 23:41:17 +09:00
Seunghoon Lee e43d8e9448
check hipblaslt availability in windows 2025-01-11 23:36:51 +09:00
Seunghoon Lee d597d5912d
use bitmasking for agent detection 2024-10-26 13:40:54 +09:00
Vladimir Mandic e0d702a3dd add gfx autodetect options
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2024-10-07 11:38:10 -04:00
Seunghoon Lee 470ea10e95
fix hipinfo 2024-09-27 10:24:55 +09:00
Seunghoon Lee b74af75622
add more known devices 2024-09-27 09:42:13 +09:00
Seunghoon Lee 96922d70a7
rocm&zluda handle apu 2024-09-26 13:34:11 +09:00
Seunghoon Lee 1395f5bf9e
add triton backend flash-attn (experimental) 2024-09-24 12:53:18 +09:00
Seunghoon Lee e246e55734
rocm install flash-attn if needed 2024-09-24 12:27:43 +09:00
Seunghoon Lee 9c4213e4a8
tcmalloc experiment 2024-08-09 16:11:12 +09:00
Seunghoon Lee 864263f570
accurate wsl check 2024-08-09 10:38:28 +09:00
Seunghoon Lee c490b9cb51
fix first launch 2024-08-09 00:45:12 +09:00
Seunghoon Lee 61961f5d6d
hipblaslt check torch version 2024-08-09 00:35:46 +09:00
Seunghoon Lee 4492dedf16
fix hip path detection 2024-07-28 00:55:40 +09:00
Seunghoon Lee bdf6501f40
fix 2024-07-28 00:34:54 +09:00
Seunghoon Lee 5a9474180d
rocm.py 2024-07-28 00:00:19 +09:00
Seunghoon Lee 25c3c6107e
fix 2024-07-22 17:04:44 +09:00
Seunghoon Lee 7fff2a76cd
fix linux 2024-07-22 15:33:04 +09:00
Seunghoon Lee 0284c77ae5
refactor rocm & zluda 2024-07-22 15:29:49 +09:00