Benefits of stochastic weight averaging in developing neural network radiation scheme for numerical weather prediction