SemanticHouseCat3D is a synthetic dataset for semantic matching where each category directory contains a number of rendered frames for each objects. The typical structure of a category directory is the following:
๐ category
๐ depth # depth maps of each frame
๐ผ๏ธ000000.png
๐ผ๏ธ000001.png
๐ผ๏ธ000002.png
...
๐ instance # instance masks of each frame
ย ย ย ๏ธ๐ผ๏ธ000000_mask.png
๐ผ๏ธ000001_mask.png
๐ผ๏ธ000002_mask.png
...
๐ nocs # nocs maps of each frame
๐ผ๏ธ000000.pngย
๐ผ๏ธ000001.png
๐ผ๏ธ000002.png
...ย
๐ normals # mesh normals maps of each frame
๐ผ๏ธ000000.pngย
๐ผ๏ธ000001.png
๐ผ๏ธ000002.png
...
๐ pcd # point clouds of each frame
๐ผ๏ธ000000.npy
๐ผ๏ธ000001.npy
๐ผ๏ธ000002.npy
...
๐ rgb # RGB image of each frame
๐ผ๏ธ000000.png
๐ผ๏ธ000001.png
๐ผ๏ธ000002.png
...
๐ frame_metadata.json # extra information about each frame, eg.: object name, ID, 6D pose, scale
The main information about the point pairs annotations is contained in the PairAnnotations directory. Each file is named following the structure "PairID-SourceID-TargetID:category.json"
For questions and more information, send an email to sebeto@acin.tuwien.ac.at