I\'m currently working on video classification and I want to classify multiple frames of a video with a pretrained Network (VGG16) and average afterwards the results of the sing