automatic/modules/sdnq
Disty0 33726b5019 Update triton mm config 2026-04-16 20:51:10 +03:00
..
layers SDNQ add nn.Embedding quantization support and add Gemma4 keys 2026-04-10 18:46:58 +03:00
packed_int SDNQ add ufp aliases 2026-03-27 21:31:36 +03:00
__init__.py pull sdnq version from .common 2025-11-28 01:10:05 +03:00
common.py SDNQ update RDNA2 detection 2026-04-16 08:34:17 +03:00
dequantizer.py SDNQ add 15, 13, 11 and 9 bit support 2026-03-11 03:39:02 +03:00
file_loader.py RUF013 updates 2026-03-24 05:48:19 -07:00
forward.py SDNQ add nn.Embedding quantization support and add Gemma4 keys 2026-04-10 18:46:58 +03:00
loader.py SDNQ add fp8 mm info to modules_quant_config 2026-04-16 00:18:42 +03:00
packed_float.py SDNQ add 15, 13, 11 and 9 bit support 2026-03-11 03:39:02 +03:00
quantizer.py SDNQ fix post load quant 2026-04-16 00:38:54 +03:00
triton_mm.py Update triton mm config 2026-04-16 20:51:10 +03:00