E2PNet: Event to Point Cloud Registration with Spatio-Temporal Representation Learning

Lin, Xiuhong; Qiu, Changjie; Cai, Zhipeng; Shen, Siqi; Zang, Yu; Liu, Weiquan; Bian, Xuesheng; Müller, Matthias; Wang, Cheng

Computer Science > Computer Vision and Pattern Recognition

arXiv:2311.18433 (cs)

[Submitted on 30 Nov 2023 (v1), last revised 27 Dec 2023 (this version, v2)]

Title:E2PNet: Event to Point Cloud Registration with Spatio-Temporal Representation Learning

Authors:Xiuhong Lin, Changjie Qiu, Zhipeng Cai, Siqi Shen, Yu Zang, Weiquan Liu, Xuesheng Bian, Matthias Müller, Cheng Wang

View PDF HTML (experimental)

Abstract:Event cameras have emerged as a promising vision sensor in recent years due to their unparalleled temporal resolution and dynamic range. While registration of 2D RGB images to 3D point clouds is a long-standing problem in computer vision, no prior work studies 2D-3D registration for event cameras. To this end, we propose E2PNet, the first learning-based method for event-to-point cloud registration. The core of E2PNet is a novel feature representation network called Event-Points-to-Tensor (EP2T), which encodes event data into a 2D grid-shaped feature tensor. This grid-shaped feature enables matured RGB-based frameworks to be easily used for event-to-point cloud registration, without changing hyper-parameters and the training procedure. EP2T treats the event input as spatio-temporal point clouds. Unlike standard 3D learning architectures that treat all dimensions of point clouds equally, the novel sampling and information aggregation modules in EP2T are designed to handle the inhomogeneity of the spatial and temporal dimensions. Experiments on the MVSEC and VECtor datasets demonstrate the superiority of E2PNet over hand-crafted and other learning-based methods. Compared to RGB-based registration, E2PNet is more robust to extreme illumination or fast motion due to the use of event data. Beyond 2D-3D registration, we also show the potential of EP2T for other vision tasks such as flow estimation, event-to-image reconstruction and object recognition. The source code can be found at: this https URL.

Comments:	10 pages, 4 figures, accepted by Thirty-seventh Conference on Neural Information Processing Systems(NeurIPS 2023)
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2311.18433 [cs.CV]
	(or arXiv:2311.18433v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2311.18433

Submission history

From: Xiuhong Lin [view email]
[v1] Thu, 30 Nov 2023 10:33:49 UTC (19,192 KB)
[v2] Wed, 27 Dec 2023 13:44:08 UTC (18,736 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:E2PNet: Event to Point Cloud Registration with Spatio-Temporal Representation Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:E2PNet: Event to Point Cloud Registration with Spatio-Temporal Representation Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators