Augmented Reality Rendering through Camera Pose Estimation
I rendered the virtual objects of the drill and the bottle in a video by first estimating the camera pose using the info on the April Tag, using either P3P (3-point correspondence) or PnP (n-point correspondence) approach from scratch (nonOpenCV implementation).
The code can be found here.