I have some categorical data where the x-value is categorical and y-value is numerical. There are chances the y-values will be the same for multiple data-points and if the x