Non-Rigid Structure-from-Motion via Differential Geometry with Recoverable Conformal Scale

Code coming soon!

Non-Rigid Structure-from-Motion

(potential to be mapping part of deformable SLAM)

Non-rigid structure-from-motion (NRSfM) addresses the challenge of reconstructing the 3D shapes of deforming objects from multiple calibrated monocular images. It is a fundamental problem in 3D computer vision, with applications ranging from entertainment to modern surgery. The NRSfM algorithm reconstructs 3D shapes in local camera coordinates, inherently intertwining camera motion with object deformations. This concept closely aligns with deformable visual SLAM, which aims to localize a robot and map its environment, even in dynamically deforming scenarios. NRSfM has the potential to play a pivotal role in overcoming the mapping challenges associated with deformable visual SLAM. By integrating tools for robot pose estimation, NRSfM can enhance mapping consistency through information fusion, paving the way for significant advancements in deformable visual SLAM research.

Theoretical Basis

Metric Preservation

Rotational Invariant Connection

Framework

Pre-Step. Variable optimization using isometric assumption

Step 1. Second-order terms optimization

Step 2. First-order terms optimization using conformal assumption

Step 3. Depth computation

Step 4. Conformal scale optimization

Loop and terminal conditions

Dense 3D point cloud with texture

Simulation and Experimental Results

T-shirt

Flag dataset

The mean \% 3D errors are respectively 1.5364\% (Ours), 2.6631\% (Diff), 1.6289\% (infP), 3.0996\% (Ch17), 3.0309\% (SDP17), and 1.5710\% (Go20).

Rug dataset

The mean % 3D errors of all the images are respectively 1.2511% (Ours), 1.970% (Diff), 1.5286% (infP), 2.082% (Ch17), 2.1748% (SDP17), and 1.4296% (Go20).

KinectPaper dataset

The mean % 3D errors are respectively 0.7011% (Ours), 3.1577% (Diff), 1.2106% (infP), 1.9164% (Ch17), 2.0804% (SDP17), and 0.9294% (Go20).

NRSfM challenge datasets

Deformable SLAM Datasets

Conclusion

This paper introduces a novel theoretical framework, Con-NRSfM, for addressing conformal NRSfM problems. The framework leverages the rotational invariance of connections to express conformal constraints without relying on assumptions about surface geometry. The proposed formulation is solved using selected image warps, a separable parallel graph optimization strategy, and a self-supervised convolutional network. Our method has been extensively tested on a wide range of synthetic and real datasets featuring diverse baseline viewpoints and deformations. The results demonstrate that Con-NRSfM outperforms state-of-the-art methods in both accuracy and robustness.

Page updated

Google Sites

Report abuse