What is Weight Initialization for Neural Networks?

Company

AssemblyAI

Date Published

Jan. 31, 2022

Author

Misra Turp

Word count

Language

English

Hacker News points

None

URL

www.assemblyai.com/blog/what-is-weight-initialization-for-neural-networks

Summary

Weight Initialization plays a significant role in deep feedforward neural networks' training process. Xavier Glorot and Yoshua Bengio highlighted the issue of using normal distribution for initializing weights with mean 0 and variance 1, which contributes to unstable gradients. To tackle these problems, new techniques have been introduced. This video discusses these methods, their differences, and ideal activation functions they correspond to.