A colleague and I are currently trying to implement some variation of knowledge distillation in Tensoflow2. Hinton wrote a paper on online knowledge distillation which we want t