Commit Graph

8 Commits (4ecec822e0f75dafc19380832a44c8ade3c68a80)

Author SHA1 Message Date
Disty0 9e52d0c1fb SDNQ add SVDQuant quantization method 2025-10-05 22:50:30 +03:00
Disty0 99113947bf SDNQ add RDNA2 INT8 support via Triton 2025-10-04 18:31:25 +03:00
Disty0 54acf1760b Make SDNQ scales compatible with balanced offload 2025-10-03 18:13:55 +03:00
Disty0 c5cab96223 SDNQ simplify check_mats 2025-10-03 02:58:17 +03:00
Disty0 03382bdd4c SDNQ simplify check_mats 2025-10-01 01:35:51 +03:00
Disty0 0c1d34721c SDNQ use contiguous for intel 2025-09-30 02:37:58 +03:00
Disty0 6b67a9d0c4 SDNQ add check_mats to matmul 2025-09-30 01:58:13 +03:00
Disty0 c3d007b02c SDNQ split forward.py into layers and cleanup 2025-08-02 17:36:55 +03:00