for this practice exercise I\'m only supposed to use numpy so I can\'t just use scikit learn.
I\'ve loaded the data set and managed to split it into positive and nega