Understanding categorical cross entropy loss