Ashish Tiwari, Mihir Sutariya, and Shanmuganathan Raman
Computer Vision, Imaging, and Graphics (CVIG) Lab, IIT Gandhinagar, India
LIPIDS Framework
Abstract
Photometric stereo is a powerful method for obtaining per-pixel surface normals from differently illuminated images of an object. While several methods address photometric stereo with different image (or light) counts ranging from one to two to a hundred, very few focus on learning optimal lighting configuration. Finding an optimal configuration is challenging due to the vast number of possible lighting directions. Moreover, exhaustively sampling all possibilities is impractical due to time and resource constraints. Photometric stereo methods have demonstrated promising performance on existing datasets, which feature limited light directions sparsely sampled from the light space. Therefore, can we optimally utilize these datasets for illumination planning? In this work, we introduce LIPIDS - Learning-based Illumination Planning In Discretized light Space to achieve minimal and optimal lighting configurations for photometric stereo under arbitrary light distribution. We propose a Light Sampling Network (LSNet) that optimizes lighting direction for a fixed number of lights by minimizing the normal loss through a normal regression network. The learned light configurations can directly estimate surface normals during inference, even using an off-the-shelf photometric stereo method. Extensive qualitative and quantitative analyses on synthetic and real-world datasets show that photometric stereo under learned lighting configurations through LIPIDS either surpasses or is nearly comparable to existing illumination planning methods across different photometric stereo backbones.
Schematic of the LIPIDS framework
Evolution of the learned lighting directions over different epochs
Acknowledgements
Special thanks to the anonymous reviewers for their constructive feedback on our work. We would like to acknowledge the generous support of the Prime Minister Research Fellowship (PMRF) grant and the Jibaben Patel Chair in Artificial Intelligence for the completion of this work.