nunchaku

Nunchaku is a high-performance inference engine optimized for 4-bit neural networks.