How LSTMs solve the vanishing gradient problem