EPR-Net: Enhanced patch representation network for point cloud normal estimation

•A novel GraphFormer module is designed for learning feature embeddings to effectively represent the geometric structure of local patches. The module employs the PoolFormer architecture and incorporates graph convolution with adaptive kernels, enabling the model to extract discriminative features in...

Full description

Saved in:
Bibliographic Details
Published inComputer aided design Vol. 189; p. 103944
Main Authors Wu, Min, Wang, Yinghui, Huang, Liangyi, Yang, Jinlong, Li, Wei, Shen, Jiaxing, Ning, Xiaojuan
Format Journal Article
LanguageEnglish
Published Elsevier Ltd 01.12.2025
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:•A novel GraphFormer module is designed for learning feature embeddings to effectively represent the geometric structure of local patches. The module employs the PoolFormer architecture and incorporates graph convolution with adaptive kernels, enabling the model to extract discriminative features in sharp feature regions (e.g., edges and corners).•A pyramid dynamic graph update strategy is introduced to further enhance feature integration. This strategy performs multi-scale geometric information fusion, effectively capturing both global structure and local geometry while alleviating the scale ambiguity in determining the optimal neighborhood. Meanwhile, at each network layer, this strategy expands the model’s receptive field by recomputing the k-nearest neighbors for each candidate point, enabling the model to capture potential long-range semantic characteristics in local patches.•Experiments on various datasets demonstrate the superiority of the proposed EPR-Net in point cloud normal estimation. Normal estimation for point clouds is fundamental to 3D geometric processing and applications. Despite recent advances by deep learning-based methods, effectively representing geometric structures in regions with sharp features and complex geometries remains challenging. This limitation primarily arises from the use of general architectures (e.g., CNNs, PointNet) or conventional graph convolutions, which limits the ability to capture fine geometric details in local point cloud patches. Moreover, the persistent issue of scale ambiguity in selecting optimal neighborhoods further hinders precise encoding of local structures. To address these challenges, we propose EPR-Net, a novel framework that enhances local patch representation learning for normal estimation in point clouds. Specifically, we introduce the GraphFormer module, which builds on the PoolFormer architecture to improve feature learning and incorporates graph convolution with adaptive kernels to capture geometric details across different semantic regions, thereby enabling more discriminative feature encodings. Additionally, we design the pyramid dynamic graph update (PDGU) strategy, which guides multi-scale feature aggregation through geometric weights to alleviate the scale ambiguity in neighborhood selection. PDGU also dynamically updates the local k-nearest neighbor (kNN) graph to expand the receptive field, thereby enhancing the ability of the model to extract long-range semantic information from point cloud patches. Extensive experiments are conducted on both synthetic and real-world datasets, and the qualitative and quantitative evaluations demonstrate the superiority of our method in point cloud normal estimation.
ISSN:0010-4485
DOI:10.1016/j.cad.2025.103944