Abstract: With the explosive growth in the number of parameters in deep neural networks (DNNs), sparsity-centric algorithm and hardware designs have become critical for low-latency AI serving systems.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results