automatic/modules/sdnq
Disty0 471b6dc1b7 SDNQ add siglip_embedder to ZImage skip keys 2025-12-23 04:32:54 +03:00
..
layers SDNQ fix fp16 mm with fp8 weights and improve stochastic rounding performance 2025-12-09 17:41:29 +03:00
__init__.py pull sdnq version from .common 2025-11-28 01:10:05 +03:00
common.py SDNQ add siglip_embedder to ZImage skip keys 2025-12-23 04:32:54 +03:00
dequantizer.py SDNQ remove forced uint4 from convs and cleanup 2025-12-13 01:32:52 +03:00
file_loader.py SDNQ remove forced uint4 from convs and cleanup 2025-12-13 01:32:52 +03:00
forward.py SDNQ de-couple matmul dtype and add fp16 matmul 2025-11-22 02:16:20 +03:00
loader.py SDNQ remove forced uint4 from convs and cleanup 2025-12-13 01:32:52 +03:00
packed_int.py SDNQ Improve UINT3 and below quant speed 2025-10-05 03:12:05 +03:00
quantizer.py SDNQ remove forced uint4 from convs and cleanup 2025-12-13 01:32:52 +03:00
triton_mm.py SDNQ remove forced uint4 from convs and cleanup 2025-12-13 01:32:52 +03:00