Robotic Manipulation

Next-Best-View (NBV) Prediction for Highly Reflective Objects


Jun Yang and Steven L. Waslander. 

International Conference on Robotics and Automation, 2022 (Submitted)

Depth acquisition with the active stereo camera is a challenging task for highly reflective objects. When setup permits, multi-view fusion can provide increased levels of depth completion. However, due to the slow acquisition speed of high-end active stereo cameras, collecting a large number of viewpoints for a single scene is generally not practical. In this work, we propose a next-best-view framework to strategically select camera viewpoints for completing depth data on reflective objects.

Probabilistic Multi-View Fusion  of Active Stereo Depth Maps


Jun Yang, Dong Li and Steven L. Waslander. 

IEEE Robotics and Automation Letters (RA-L), 2021

The reliable fusion of depth maps from multiple viewpoints has become an important problem in many 3D reconstruction pipelines. In this work, we investigate its impact on robotic bin-picking tasks such as 6D object pose estimation. In this work, we propose a novel probabilistic framework for scene reconstruction in robotic bin-picking. Based on active stereo camera data, we first explicitly estimate the uncertainty of depth measurements, and incorporated into a probabilistic model for incrementally updating the scene.

ROBI: Reflective Object In Bins Dataset


Jun Yang, Yizhou Gao, Dong Li and Steven L. Waslander. 

IEEE/RSJ International Conference on Intelligent Robots and Systems, 2021. 

In robotic bin-picking applications, the perception of texture-less, highly reflective parts is a valuable but challenging task. The high glossiness can introduce fake edges in RGB images and inaccurate depth measurements especially in heavily cluttered bin scenario. In this paper, we present the ROBI (Reflective Objects in BIns) dataset, a public dataset for 6D object pose estimation and multi-view depth fusion in robotic bin-picking scenarios. The ROBI dataset includes a total of 63 bin-picking scenes captured with two active stereo camera: a high-cost Ensenso sensor and a low-cost RealSense sensor.​The ROBI dataset is available here.