I am trying to train my model which classifies images. The problem I have is, they have different sizes. how should i format my images/or model architecture ?
Try making a spatial pyramid pooling layer. Then put it after your last convolution layer so that the FC layers always get constant dimensional vectors as input . During training , train the images from the entire dataset using a particular image size for one epoch . Then for the next epoch , switch to a different image size and continue training .