to the best of my knowledge, cross entropy is consistent with MLE(maximum likelihood estimation), assume we have data with binomial distribution, if we do MLE on the data, t