A Survey of Quantization Methods for Efficient Neural Network Inference, Yu Wang, Zhaohui Li, Xiaoming Liu, Xiaofeng Shi, Hongyan Liu, Xin Li, 2023ACM Computing Surveys, Vol. 55 (Association for Computing Machinery)DOI: 10.1145/3547348 - Comprehensive overview of various quantization techniques for neural networks, relevant for different bit depths and post-training methods.