Understanding tabletop

So we suppose that we want to use tabletop in order to do some object detection tasks for our system. Even though it's easy to use (as explained in the tutorial), understanding how tabletop works surely offers more satisfaction.

At the end of the reading, we hope to make it clear about what tabletop is, what relationship between tabletop and ecto framework, and how to manage tabletop's configuration file.

What is tabletop?

Tabletop is a ros project that does planar surface detection and object detection.

To perform these tasks tabletop relies the following assumptions:

the objects are resting on a table, which is the dominant plane in the scene
the minimum distance between two objects exceeds a given threshold (3cm in the demo)
in addition, object recognition can only handle 2 degrees of freedom: translation along the X and Y axes (with the Z axis assumed to be pointing "up" relative to the table). Therefore, in order to recognize an object:

The sensor data that tabletop uses consists of a point cloud from the narrow stereo or Kinect cameras. tabletop performs the following steps:

segmentation
recognition

The output from this components consists of the location of the table, the identified point clusters, and the corresponding database object id and fitting pose for those clusters found to be similar to database objects.

Tabletop uses CouchDB database that contains details about the objects to be recognized. These details include the object id (generated automatically when the object is added to the database), object's name, object's author, object's tags object's 3D mesh. This 3D mesh will be used by tabletop during its recognition step. Setting up a CouchDB is explained here and managing objects is explained here.

How to manage tabletop's input parameters?

tabletop manages a certain number of parameters, such as input image flow, database details, and detection thresholds, all of these parameters are defined in a configuration file, given as -c parameter in the tabletop command. The configuration file needs to be in YAML format. An example of the configuration file for tabletop that process image published by openni_launch package is as below:

=============================================

source1:

type: RosKinect

module: 'object_recognition_ros.io'

parameters:

rgb_frame_id: camera_rgb_optical_frame

rgb_image_topic: /camera/rgb/image_rect_color

rgb_camera_info: /camera/rgb/camera_info

depth_image_topic: /camera/depth_registered/image_raw

depth_camera_info: /camera/depth_registered/camera_info

sink1:

type: TablePublisher

module: 'object_recognition_tabletop'

inputs: [source1]

sink2:

type: Publisher

module: 'object_recognition_ros.io'

inputs: [source1]

pipeline1:

type: TabletopTableDetector

module: 'object_recognition_tabletop'

inputs: [source1]

outputs: [sink1]

parameters:

table_detector:

min_table_size: 4000

plane_threshold: 0.01

pipeline2:

type: TabletopObjectDetector

module: 'object_recognition_tabletop'

inputs: [source1, pipeline1]

outputs: [sink2]

parameters:

object_ids: 'all'

tabletop_object_ids: 'REDUCED_MODEL_SET'

threshold: 0.85

db:

type: CouchDB

root: http://localhost:5984

collection: object_recognition

========================================

'source1' defines image topics that tabletop needs for its detection steps. Basically, tabletop needs a depth image topic, a color image topic, and the camera information messages for these images as input.

'sink1' and 'sink2' defines how the output of tabletop can be processed further. In this example, they take care of publishing the detection results of tabletop.

'pipeline1' takes care of detecting planar surfaces.

'pipeline2' takes care of detecting objects on the main (detected) surface.

You can modify these parameters, such as input image topics, detection threshold, URI of the CouchDB to be used, etc. according to the setting you have in your project.

Example:

Page updated

Google Sites

Report abuse