all, i am trying to create from scratch (without use of sklearn libs) to create 5 samples (len of df / 5) such that each one has the same proportion of target variable (1\'s