Tesla P4 And P40 GPUs Boost Deep Learning Inference Performance With INT8, TensorRT Support
Nvidia continues to beat on deep learning GPUs with the release of two new “inference” GPUs, the Tesla P4 and the Tesla P40. The pair are the 16nm FinFET direct successors to Tesla M4 and M40, with much improved performance and support for 8-bit (INT8) operations. Deep learning consists of two steps: training and inference. […]