notesum.ai
Published at December 4A Granger-Causal Perspective on Gradient Descent with Application to Pruning
cs.LG
Released Date: December 4, 2024
Authors: Aditya Shah1, Aditya Challa2, Sravan Danda2, Archana Mathur3, Snehanshu Saha2
Aff.: 1Google Search, Google Austin, Texas, USA; 2APPCAIR and CS&IS, BITS Pilani KK Birla Goa Campus, India; 3Nitte Meenakshi Institute of Technology, Yelahanka, Bangalore, India
| Name | Explanation | Typical Values |
| Number of epochs for training the network after which the pruning is performed | 5-10 | |
| Number of iterations of pruning to be performed | 2-10 | |
| Number of epochs for training the network to collect the data required for causal pruning | 2-10 | |
| Number of epochs for training the network after pruning to evaluate the performance | 100-300 | |
| L1_coeff | The regularization parameter used for LassoRegression | to (log space) |