Open-Vocabulary Affordance Detection in 3D Point Clouds

BEST PAPER AWARD FINALIST

Toan Nguyen1,2       Minh Nhat Vu3,4       An Vuong1       Dzung Nguyen1
Thieu Vo5               Ngan Le6               Anh Nguyen7

1FPT Software AI Center   2VNUHCM - University of Science   3ACIN - TU Wien   4Austrian Institute of Technology  
5Ton Duc Thang University   6University of Arkansas   7University of Liverpool

Abstract

Affordance detection is a challenging problem with a wide variety of robotic applications. Traditional affordance detection methods are limited to a predefined set of affordance labels, hence potentially restricting the adaptability of intelligent robots in complex and dynamic environments. In this paper, we present the Open-Vocabulary Affordance Detection (OpenAD) method, which is capable of detecting an unbounded number of affordances in 3D point clouds. By simultaneously learning the affordance text and the point feature, OpenAD successfully exploits the semantic relationships between affordances. Therefore, our proposed method enables zero-shot detection and can be able to detect previously unseen affordances without a single annotation example. Intensive experimental results show that OpenAD works effectively on a wide range of affordance detection setups and outperforms other baselines by a large margin. Additionally, we demonstrate the practicality of the proposed OpenAD in real-world robotic applications with a fast inference speed (≈ 100 ms).

BibTeX

@inproceedings{Nguyen2023open,
      title={Open-vocabulary affordance detection in 3d point clouds},
      author={Nguyen, Toan and Vu, Minh Nhat and Vuong, An and Nguyen, Dzung and Vo, Thieu and Le, Ngan and Nguyen, Anh},
      booktitle = IROS,
      year      = {2023}
  }

Acknowledgements

We borrow the page template from HyperNeRF. Special thanks to them!