Because the dynamic network pruning algorithm will not reduce the actual scale of the network model, and there will be no obvious reasoning acceleration when running on GPU, it is necessary to design a specific hardware accelerator for the dynamic network pruning algorithm, and hardware some steps according to the flow details of the algorithm, so that the deep neural network using the dynamic pruning algorithm can realize obvious reasoning acceleration on the mobile side.
正在翻译中..