automatic/modules/sdnq
Disty0 4a4784eafa SDNQ add new stack of custom floating point types and remove irrelevant qtypes from the ui list 2025-12-26 20:09:17 +03:00
..
layers SDNQ fix fp16 mm with fp8 weights and improve stochastic rounding performance 2025-12-09 17:41:29 +03:00
__init__.py pull sdnq version from .common 2025-11-28 01:10:05 +03:00
common.py SDNQ add new stack of custom floating point types and remove irrelevant qtypes from the ui list 2025-12-26 20:09:17 +03:00
dequantizer.py SDNQ add new stack of custom floating point types and remove irrelevant qtypes from the ui list 2025-12-26 20:09:17 +03:00
file_loader.py SDNQ remove forced uint4 from convs and cleanup 2025-12-13 01:32:52 +03:00
forward.py SDNQ de-couple matmul dtype and add fp16 matmul 2025-11-22 02:16:20 +03:00
loader.py SDNQ add new stack of custom floating point types and remove irrelevant qtypes from the ui list 2025-12-26 20:09:17 +03:00
packed_float.py SDNQ add new stack of custom floating point types and remove irrelevant qtypes from the ui list 2025-12-26 20:09:17 +03:00
packed_int.py SDNQ Improve UINT3 and below quant speed 2025-10-05 03:12:05 +03:00
quantizer.py SDNQ add new stack of custom floating point types and remove irrelevant qtypes from the ui list 2025-12-26 20:09:17 +03:00
triton_mm.py SDNQ remove forced uint4 from convs and cleanup 2025-12-13 01:32:52 +03:00