SO(3)-Pose: SO(3)-Equivariance Learning for 6D Object Pose Estimation

Pan, Haoran; Zhou, Jun; Liu, Yuanpeng; Lu, Xuequan; Wang, Weiming; Yan, Xuefeng; Wei, Mingqiang

dc.contributor.author	Pan, Haoran	en_US
dc.contributor.author	Zhou, Jun	en_US
dc.contributor.author	Liu, Yuanpeng	en_US
dc.contributor.author	Lu, Xuequan	en_US
dc.contributor.author	Wang, Weiming	en_US
dc.contributor.author	Yan, Xuefeng	en_US
dc.contributor.author	Wei, Mingqiang	en_US
dc.contributor.editor	Umetani, Nobuyuki	en_US
dc.contributor.editor	Wojtan, Chris	en_US
dc.contributor.editor	Vouga, Etienne	en_US
dc.date.accessioned	2022-10-04T06:41:23Z
dc.date.available	2022-10-04T06:41:23Z
dc.date.issued	2022
dc.identifier.issn	1467-8659
dc.identifier.uri	https://doi.org/10.1111/cgf.14684
dc.identifier.uri	https://diglib.eg.org:443/handle/10.1111/cgf14684
dc.description.abstract	6D pose estimation of rigid objects from RGB-D images is crucial for object grasping and manipulation in robotics. Although RGB channels and the depth (D) channel are often complementary, providing respectively the appearance and geometry information, it is still non-trivial on how to fully benefit from the two cross-modal data. From the simple yet new observation, when an object rotates, its semantic label is invariant to the pose while its keypoint offset direction is variant to the pose. To this end, we present SO(3)-Pose, a new representation learning network to explore SO(3)-equivariant and SO(3)-invariant features from the depth channel for pose estimation. The SO(3)-invariant features facilitate to learn more distinctive representations for segmenting objects with similar appearance from RGB channels. The SO(3)-equivariant features communicate with RGB features to deduce the (missed) geometry for detecting keypoints of an object with the reflective surface from the depth channel. Unlike most of existing pose estimation methods, our SO(3)-Pose not only implements the information communication between the RGB and depth channels, but also naturally absorbs the SO(3)-equivariance geometry knowledge from depth images, leading to better appearance and geometry representation learning. Comprehensive experiments show that our method achieves the stateof- the-art performance on three benchmarks. Code is available at https://github.com/phaoran9999/SO3-Pose.	en_US
dc.publisher	The Eurographics Association and John Wiley & Sons Ltd.	en_US
dc.subject	CCS Concepts: Computing methodologies → Point-based models
dc.subject	Computing methodologies → Point
dc.subject	based models
dc.title	SO(3)-Pose: SO(3)-Equivariance Learning for 6D Object Pose Estimation	en_US
dc.description.seriesinformation	Computer Graphics Forum
dc.description.sectionheaders	Image Detection and Understanding
dc.description.volume	41
dc.description.number	7
dc.identifier.doi	10.1111/cgf.14684
dc.identifier.pages	371-381
dc.identifier.pages	11 pages

Files in this item

Name:: v41i7pp371-381.pdf
Size:: 2.890Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

41-Issue 7
Pacific Graphics 2022 - Symposium Proceedings

Show simple item record