I\'m working on a binary classification problem with a large image dataset. I have annotations with the faces bounding boxes in a json file witch contains the image file pat