Research

A probabilistic model for planar sliding of objects with unknown material properties: Identification and robust planning

Changkyu Song and Abdeslam Boularias, A Probabilistic Model for Planar Sliding of Objects with Unknown Material Properties, IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2020), Oct. 25-29th, 2020, Las Vegas, Nevada, USA [PDF] [Video] [Project Page]

This paper introduces a new technique for learning probabilistic models of mass and friction distributions of unknown objects, and performing robust sliding actions by using the learned models. The proposed method is executed in two consecutive phases. In the exploration phase, a table-top object is poked by a robot from different angles. The observed motions of the object are compared against simulated motions with various hypothesized mass and friction models. The simulation-to-reality gap is then differentiated with respect to the unknown mass and friction parameters, and the analytically computed gradient is used to optimize those parameters. Since it is difficult to disentangle the mass from the friction coefficients in low-data and quasi-motion regimes, our approach retains a set of locally optimal pairs of mass and friction models. A probability distribution on the models is computed based on the relative accuracy of each pair of models. In the exploitation phase, a probabilistic planner is used to select a goal configuration and waypoints that are stable with a high confidence. The proposed technique is evaluated on real objects and using a real manipulator. The results show that this technique can not only identify accurately mass and friction coefficients of non-uniform heterogeneous objects, but can also be used to successfully slide an unknown object to the edge of a table and pick it up from there, without any human assistance or feedback.

Learning to Slide Unknown Objects with Differentiable Physics Simulations

Changkyu Song and Abdeslam Boularias, Learning to Slide Unknown Objects with Differentiable Physics Simulations, Robotics: Science and Systems (R:SS), July 14-16th, 2020, Oregon State University at Corvallis, Oregon, USA. [PDF] [Video] [Project Page]

We propose a new technique for pushing an unknown object from an initial configuration to a goal configuration with stability constraints. The proposed method leverages recent progress in differentiable physics models to learn unknown mechanical properties of pushed objects, such as their distributions of mass and coefficients of friction. The proposed learning technique computes the gradient of the distance between predicted poses of objects and their actual observed poses, and utilizes that gradient to search for values of the mechanical properties that reduce the reality gap. The proposed approach is also utilized to optimize a policy to efficiently push an object toward the desired goal configuration. Experiments with real objects using a real robot to gather data show that the proposed approach can identify mechanical properties of heterogeneous objects from a small number of pushing actions.

Object Rearrangement with Nested Nonprehensile Manipulation Actions

Changkyu Song and Abdeslam Boularias, Object Rearrangement with Nested Nonprehensile Manipulation Actions, IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2019) Macau, China. [PDF] [Video] [Project Page]

This paper considers the problem of rearrangement planning, i.e finding a sequence of manipulation actions that displace multiple objects from an initial configuration to a given goal configuration. Rearrangement is a critical skill for robots so that they can effectively operate in confined spaces that contain clutter. Examples of tasks that require rearrangement include packing objects inside a bin, wherein objects need to lay according to a predefined pattern. In tight bins, collision-free grasps are often unavailable. Nonprehensile actions, such as pushing and sliding, are preferred because they can be performed using minimalistic end-effectors that can easily be inserted in the bin. Rearrangement with nonprehensile actions is a challenging problem as it requires reasoning about object interactions in a combinatorially large configuration space of multiple objects. This work revisits several existing rearrangement planning techniques and introduces a new one that exploits nested nonprehensile actions by pushing several similar objects simultaneously along the same path, which removes the need to rearrange each object individually. Experiments in simulation and using a real Kuka robotic arm show the ability of the proposed approach to solve difficult rearrangement tasks while reducing the length of the end-effector's trajectories.

Inferring 3D Shapes of Unknown Rigid Objects in Clutter through Inverse Physics Reasoning with Monte Carlo Tree Search

Changkyu Song and Abdeslam Boularias, Inferring 3D Shapes of Unknown Rigid Objects in Clutter through Inverse Physics Reasoning, IEEE Robotics and Automation Letters (RA-L 2018) and presented in IEEE International Conference on Robotics and Automation (ICRA 2019) Montreal, Canada. [PDF] [Poster] [Video] [Article in Tech Xplore] [Project Page]

We present a probabilistic approach for building, on the fly, 3-D models of unknown objects while being manipulated by a robot. We specifically consider manipulation tasks in piles of clutter that contain previously unseen objects. Most manipulation algorithms for performing such tasks require known geometric models of the objects in order to grasp or rearrange them robustly. One of the novel aspects of this work is the utilization of a physics engine for verifying hypothesized geometries in simulation. The evidence provided by physics simulations is used in a probabilistic framework that accounts for the fact that mechanical properties of the objects are uncertain. We present an efficient algorithm for inferring occluded parts of objects based on their observed motions and mutual interactions. Experiments using a robot show that this approach is efficient for constructing physically realistic 3-D models, which can be useful for manipulation planning. Experiments also show that the proposed approach significantly outperforms alternative approaches in terms of shape accuracy.

Towards Robust Product Packing with a Minimalistic End-Effector

Rahul Shome, Wei N. Tang, Changkyu Song, Chaitanya Mitash, Chris Kourtev, Jingjin Yu, Abdeslam Boularias, and Kostas E. Bekris, Towards Robust Product Packing with a Minimalistic End-Effector, IEEE International Conference on Robotics and Automation (ICRA 2019), Montreal, Canada. Finalist for Best Paper in Automation [PDF] [Video] [Webpage] [Article in Tech Xplore] [Project Page]

Advances in sensor technologies, object detection algorithms, planning frameworks and hardware designs have motivated the deployment of robots in warehouse automation. A variety of such applications, like order fulfillment or packing tasks, require picking objects from unstructured piles and carefully arranging them in bins or containers. Desirable solutions need to be low-cost, easily deployable and controllable, making minimalistic hardware choices desirable. The challenge in designing an effective solution to this problem relates to appropriately integrating multiple components, so as to achieve a robust pipeline that minimizes failure conditions. The current work proposes a complete pipeline for solving such packing tasks, given access only to RGB-D data and a single robot arm with a minimalistic, vacuum-based end-effector. To achieve the desired level of robustness, three key manipulation primitives are identified, which take advantage of the environment and simple operations to successfully pack multiple cubic objects. The overall approach is demonstrated to be robust to execution and perception errors. The impact of each manipulation primitive is evaluated by considering different versions of the proposed pipeline that incrementally introduce reasoning about object poses and corrective manipulation actions.

Fast ADMM Algorithm for Distributed Optimization with Adaptive Penalty

Changkyu Song, Sejong Yoon and Vladimir Pavlovic, Fast ADMM Algorithm for Distributed Optimization with Adaptive Penalty, The 30th AAAI Conference on Artificial Intelligence (AAAI 2016), Phoenix, Arizona, USA, 2016. [PDF] [Slide] [Project Page]

We propose new methods to speed up convergence of the Alternating Direction Method of Multipliers (ADMM), a common optimization tool in the context of large scale and distributed learning. The proposed method accelerates the speed of convergence by automatically deciding the constraint penalty needed for parameter consensus in each iteration. In addition, we also propose an extension of the method that adaptively determines the maximum number of iterations to update the penalty. We show that this approach effectively leads to an adaptive, dynamic network topology underlying the distributed optimization. The utility of the new penalty update schemes is demonstrated on both synthetic and real data, including an instance of the probabilistic matrix factorization task known as the structure-from-motion problem.

Results

3D Monocular Multiview Tracker with 3D Aspect Parts

Yu Xiang*, Changkyu Song*, Roozbeh Mottaghi, and Silvio Savarese (*equal contribution), Monocular Multiview Object Tracking with 3D Aspect Parts, in European Conference on Computer Vision 2014 (ECCV 2014) [PDF] [Video] [Project Page]

In this work, we focus on the problem of tracking objects under significant viewpoint variations, which poses a big challenge to traditional object tracking methods. We propose a novel method to track an object and estimate its continuous pose and part locations under severe viewpoint change. In order to handle the change in topological appearance introduced by viewpoint transformations, we represent objects with 3D aspect parts [1] and model the relationship between viewpoint and 3D aspect parts in a part-based particle filtering framework. Moreover, we show that instance-level online-learned part appearance can be incorporated into our model, which makes it more robust in difficult scenarios with occlusions. Experiments are conducted on a new dataset of challenging YouTube videos and a subset of the KITTI dataset [2] that include significant viewpoint variations, as well as a standard sequence for car tracking. We demonstrate that our method is able to track the 3D aspect parts and the viewpoint of objects accurately despite significant changes in viewpoint. (a) An example output of our tracking framework. Our multiview tracker provides the estimates for continuous pose and 3D aspect parts of the object. (b) An example of the 3D aspect part representation of a 3D object (car) and the projections of the object from different viewpoints.

Results

Dataset

Multiview Tracking Dataset ~650MB (uploaded on stanford)

Reference

Y. Xiang and S. Savarese. Estimating the Aspect Layout of Object Categories. In CVPR, 2012.

A. Geiger, P. Lenz and R. Urtasun. Are we ready for Autonomous Driving? The KITTI Vision Benchmark Suite. In CVPR, 2012.

Acknowledgement

We acknowledge the support of DARPA UPSIDE grant A13-0895-S002 and NSF CAREER grant N.1054127.

Link

This is one of Stanford CVGL Lab. Projects.

Human Intuitive Hierarchical Model for Fine-Grained Categorization

(Machine Learning Class Project, 2013.Jan ~ 2013.Apr) [PDF]

This project focuses on the fine-grained categorization problem, such as Caltech-UCSD Birds, which gives very low performance with the-state-of-art methods. This method categorizes images according to its hierarchical model, and the hierarchical model is built from categorization examples by human. We collected enough number of human-examples by asking human to classify a small set of training data. After that, we transform each classification result to a hierarchical tree, and synthesize all the trees into a hierarchical tree. For synthesizing trees into one, we transform the binary tree to a matrix form and find a new matrix which has minimum norm error with those matrices and represents a new binary tree. We proposed a categorization method based on human intuition and proposed a novel method to find a new binary tree which contains common hierarchical characteristics of a number of binary trees.

Dynamic Resource Allocation by Particle Filter Tracking

Changkyu Song, Jeany Son, Suha Kwak, and Bohyung Han, Dynamic Resource Allocation by Ranking SVM for Particle Filter Tracking, in British Machine Vision Conference 2011 (BMVC 2011) [PDF] [Poster] [Project Page]

We propose a dynamic resource allocation algorithm based on Ranking Support Vector Machine (R-SVM) [1] for particle filter tracking. We adjust the number of observations in each frame adaptively and automatically, where tracker performs measurement for a subset of highly ranked particles in likelihood to preserve mode locations in the posterior and allocates the rest of particles to maintain the diversity of the posterior without actual measurements.

.