Here\'s some research I have done so far:
- I have used Google Vision API to detect various face landmarks.
Here\'s the reference: https://developers.google.com/vision/intro
You need to either predict x,y,z coordinates(check out this demo), either use x,y predictions but then find parameters of universal 3d-model & camera that will give the closest projection of current x,y.