Loss surface with multiple valleys
This post is a follow-up of 1. We start off with an eye-catching plot, representing the functioning of an optimiser using the stochastic gradient method. The plot is explained in more detail further below. Visualisation of a loss surface with multiple minima. The surface is in gray, the exemplary path …