Epsilon-greedy exploration policy