Do you think it is possible from a single conventional camera shot (not using stereo vision), to obtain the image and the depth information of the objects in the picture?
It is now possible using coded apertures :)
On the following paper I briefly explain how it can be possible:
IMAGE AND DEPTH FROM A CONVENTIONAL CAMERA USING CODED APERTURE
And here the Slides of my presentation: PowerPoint Slides Presentation