Tagre uw madison

11/28/2023

Bioĭimitris Papailiopoulos is an Assistant Professor of Electrical and Computer Engineering and Computer Sciences (by courtesy) at the University of Wisconsin-Madison, a faculty fellow of the Grainger Institute for Engineering, and a faculty affiliate at the Wisconsin Institute for Discovery. I will conclude with sharing hints of a general framework indicating the existence of good pruned networks for a variety of activation functions, architectures, even applicable for the case where both initialization weights and activations are binary. Our work indicates the existence of a universal striking phenomenon: neural network training is equivalent to pruning slightly overparameterized networks of random weights. This is possible by establishing a connection between pruning random ReLU networks and random instances of the weakly NP-hard SubsetSum problem. I will give a sketch of the proof that any target network can be approximated by pruning a random one that is only a logarithmic factor wider. In this talk, I will tell you how we close this gap and offer an exponential improvement to the over-parameterization requirement. This polynomial over-parameterization is at odds with experimental research that achieves good approximation by pruning networks that are only a small factor wider than the target one. Recent work establishes that the strong LTH is true if the random network to be pruned is a large poly-factor wider than the target one. The strong lottery ticket hypothesis (LTH) postulates that any neural network can be approximated by simply pruning a sufficiently larger network of random weights.

0 Comments

Tagre uw madison

Leave a Reply.

Author

Archives

Categories