Commit Graph

6 Commits (4e74172f3a25e9993d01686c5ecc3a62a27192b5)

Author SHA1 Message Date
Disty0 bbb345cf44 Fix bias dtype mismatch 2025-08-30 02:31:41 +03:00
Disty0 6c36433a14 SDNQ fix row-wise FP8 matmul with fp32 and fp16 inputs 2025-08-30 02:27:15 +03:00
Disty0 8460be662c SDNQ use inplace transpose and use view instead of reshape 2025-08-17 05:07:55 +03:00
Disty0 9992338187 sdnq fix convs 2025-08-11 23:24:13 +03:00
Disty0 dc7b25d387 Cleanup SDNQ and add SDNQ_USE_TENSORWISE_FP8_MATMUL env var 2025-08-11 14:50:17 +03:00
Disty0 c3d007b02c SDNQ split forward.py into layers and cleanup 2025-08-02 17:36:55 +03:00