-- Nov 2 In-Class Exercise Thread
Our First Layer was
Conv2D(64, (3,3), padding="same", input_shape=(64, 64, 1))
Input m: 64 * 64 = 4096
Output n: No. of filters * Output shape = 64 * (64 * 64) = 262144
Therefore, using Glorot and Bengio initialization
`W_(i,j) ~ U (-(6/(4096+262144))^0.5, (6/(4096+262144))^0.5)`
`W_(i,j) ~ U (-0.00475, 0.00475)`
(
Edited: 2021-11-03)
Our First Layer was
Conv2D(64, (3,3), padding="same", input_shape=(64, 64, 1))
Input m: 64 * 64 = 4096
Output n: No. of filters * Output shape = 64 * (64 * 64) = 262144
Therefore, using Glorot and Bengio initialization
@BT@W_(i,j) ~ U (-(6/(4096+262144))^0.5, (6/(4096+262144))^0.5)@BT@
@BT@W_(i,j) ~ U (-0.00475, 0.00475)@BT@