|
layers
|
SDNQ fix Qwen loading
|
2025-10-11 00:05:09 +03:00 |
|
common.py
|
Add type checking to SDNQConfig
|
2025-10-12 01:02:47 +03:00 |
|
dequantizer.py
|
SDNQ add dequantize model
|
2025-10-12 00:00:53 +03:00 |
|
forward.py
|
cleanup
|
2025-08-02 17:41:53 +03:00 |
|
loader.py
|
SDNQ unset device specific configs on save
|
2025-10-11 19:24:09 +03:00 |
|
packed_int.py
|
SDNQ Improve UINT3 and below quant speed
|
2025-10-05 03:12:05 +03:00 |
|
quantizer.py
|
seedvt2
|
2025-10-12 15:35:08 -04:00 |
|
triton_mm.py
|
SDNQ add RDNA2 INT8 support via Triton
|
2025-10-04 18:31:25 +03:00 |