BiKC: Keypose-Conditioned Consistency Policy for Bimanual Robotic Manipulation
Speaker
Yizhou Chen, TransGP
Co-author
Dongjie Yu, Hang Xu, Jia Pan
Abstract
Bimanual manipulation tasks typically involve multiple stages which require efficient interactions between two arms, posing step-wise and stage-wise challenges for imitation learning systems. Specifically, failure and delay of one step will broadcast through time, hinder success and efficiency of each sub-stage task, and thereby overall task performance. Although recent works have made strides in addressing certain challenges, few approaches explicitly consider the multi-stage nature of bimanual tasks while simultaneously emphasizing the importance of inference speed. In this paper, we introduce a novel keypose-conditioned consistency policy tailored for bimanual manipulation. It is a hierarchical imitation learning framework that consists of a high-level keypose predictor and a low-level trajectory generator. The predicted keyposes provide guidance for trajectory generation and also mark the completion of one sub-stage task. The trajectory generator is designed as a consistency model trained from scratch without distillation, which generates action sequences conditioning on current observations and predicted keyposes with fast inference speed. Simulated and real-world experimental results demonstrate that the proposed approach surpasses baseline methods in terms of success rate and operational efficiency.
Speaker Bio
Dr. Yizhou Chen is currently a postdoctoral fellow at the Centre for Transformative Garment Production (TransGP), HKU. He received his Ph.D. in Mechanical and Automation Engineering from The Chinese University of Hong Kong (CUHK) in 2023. Dr. Chen's research focuses on advancing robotics technology, with particular emphasis on task and motion planning, formal methods, and parallel computing for robot mapping.