Post
110
We demonstrate that AutoRound achieves SOTA or near SOTA performance under INT4 (W4A4) quantization.
Check out the accuracy data at https://github.com/intel/auto-round/blob/main/docs/int4_acc.md
This capability is currently a research-only feature, with no production model export.
Check out the accuracy data at https://github.com/intel/auto-round/blob/main/docs/int4_acc.md
This capability is currently a research-only feature, with no production model export.