Homepage - Haiping Wang

Selected Publications (*Co-First, †Corresponding) (view all )

GAGS':' Granularity-Aware 3D Feature Distillation for Gaussian Splatting

Yuning Pang*, Haiping Wang*, Yuan Liu†, Chenglu Wen, Zhen Dong†, Bisheng Yang

3D Open-vocabulary Understanding

AAAI 2026

GAGS learns a 3D Gaussian field associated with semantic features, which enables accurate open-vocabulary 3D visual grounding in the scene.

[Paper] [Code] [Project Page]

GAGS':' Granularity-Aware 3D Feature Distillation for Gaussian Splatting

Yuning Pang*, Haiping Wang*, Yuan Liu†, Chenglu Wen, Zhen Dong†, Bisheng Yang

AAAI 2026 3D Open-vocabulary Understanding

GAGS learns a 3D Gaussian field associated with semantic features, which enables accurate open-vocabulary 3D visual grounding in the scene.

[Paper] [Code] [Project Page]

The Neural City':' A Next-generation Spatio-Temporal Intelligence Paradigm for Urban Holistic Governance

Zhen Dong (Supervisor), Haiping Wang*, Zhe Chen, Chen Long, Yuning Peng, Yuan Liu, Fuxun Liang, Jian Zhou, Yiping Chen, Fan Zhang, Bisheng Yang†, Deren Li

Spatial Intelligence System

The Innovation (IF: 25.7) 2025

We outline Neural City to enable end-to-end urban governance, seamlessly linking raw urban data to holistic urban governance, achieving "6W+4R" governance.

[Paper]

The Neural City':' A Next-generation Spatio-Temporal Intelligence Paradigm for Urban Holistic Governance

Zhen Dong (Supervisor), Haiping Wang*, Zhe Chen, Chen Long, Yuning Peng, Yuan Liu, Fuxun Liang, Jian Zhou, Yiping Chen, Fan Zhang, Bisheng Yang†, Deren Li

The Innovation (IF: 25.7) 2025 Spatial Intelligence System

We outline Neural City to enable end-to-end urban governance, seamlessly linking raw urban data to holistic urban governance, achieving "6W+4R" governance.

[Paper]

VistaDream':' Sampling multiview consistent images for single-view scene reconstruction

Haiping Wang, Yuan Liu†, Ziwei Liu, Zhen Dong†, Wenping Wang, Bisheng Yang

Text/Image-to-3D Scene Generation

International Conference on Computer Vision (ICCV) 2025

VistaDream is a training-free framework to reconstruct a high-quality 3D scene from a single-view image. The key idea is to sample multi-view consistent high-quality images from pre-trained single-view diffusion models.

[Paper] [Code] [Project Page]

VistaDream':' Sampling multiview consistent images for single-view scene reconstruction

Haiping Wang, Yuan Liu†, Ziwei Liu, Zhen Dong†, Wenping Wang, Bisheng Yang

International Conference on Computer Vision (ICCV) 2025 Text/Image-to-3D Scene Generation

[Paper] [Code] [Project Page]

SpatialLLM':' From Multi-modality Data to Urban Spatial Intelligence

Jiabin Chen, Haiping Wang*, Jinpeng Li, Yuan Liu†, Zhen Dong†, Bisheng Yang

3D-Large Language Model

arXiv 2025

Structured descriptions of raw spatial data equip LLM with zero-shot execution of advanced spatial intelligence tasks, including urban planning, ecological analysis, traffic management, etc.. Multi-field knowledge, context length, and reasoning ability are key factors influencing LLM performances in urban analysis.

[Paper] [Code]

SpatialLLM':' From Multi-modality Data to Urban Spatial Intelligence

Jiabin Chen, Haiping Wang*, Jinpeng Li, Yuan Liu†, Zhen Dong†, Bisheng Yang

arXiv 2025 3D-Large Language Model

[Paper] [Code]

CityAnchor':' City-scale 3D Visual Grounding with Multi-modality LLMs

Jinpeng Li, Haiping Wang*, Jiabin Chen, Yuan Liu†, Zhiyang Dou, Yuexin Ma, Sibei Yang, Yuan Li, Wenping Wang, Zhen Dong, Bisheng Yang†

3D-Large Language Model

International Conference on Learning Representations (ICLR) 2025

We present a two-stage (coarse-to-fine) 3D visual grounding system by tuning Large Vision Language Model (LVLM) to accurately find targets in city-scale point clouds from text descriptions.

[Paper] [Code]

CityAnchor':' City-scale 3D Visual Grounding with Multi-modality LLMs

Jinpeng Li, Haiping Wang*, Jiabin Chen, Yuan Liu†, Zhiyang Dou, Yuexin Ma, Sibei Yang, Yuan Li, Wenping Wang, Zhen Dong, Bisheng Yang†

International Conference on Learning Representations (ICLR) 2025 3D-Large Language Model

We present a two-stage (coarse-to-fine) 3D visual grounding system by tuning Large Vision Language Model (LVLM) to accurately find targets in city-scale point clouds from text descriptions.

[Paper] [Code]

FreeReg':' Image-to-Point Cloud Registration Leveraging Pretrained Diffusion Models and Monocular Depth Estimators

Haiping Wang*, Yuan Liu*, Bing Wang, Yujing Sun, Zhen Dong†, Wenping Wang, Bisheng Yang†

Image-to-Point Cloud Registration

International Conference on Learning Representations (ICLR) 2024

FreeReg extracts cross-modality features from pretrained diffusion models and monocular depth estimators for accurate zero-shot image-to-point cloud registration.

[Paper] [Code] [Project Page] [Video]

FreeReg':' Image-to-Point Cloud Registration Leveraging Pretrained Diffusion Models and Monocular Depth Estimators

Haiping Wang*, Yuan Liu*, Bing Wang, Yujing Sun, Zhen Dong†, Wenping Wang, Bisheng Yang†

International Conference on Learning Representations (ICLR) 2024 Image-to-Point Cloud Registration

FreeReg extracts cross-modality features from pretrained diffusion models and monocular depth estimators for accurate zero-shot image-to-point cloud registration.

[Paper] [Code] [Project Page] [Video]

A novel method for registration of MLS and stereo reconstructed point clouds

Xiaochen Yang*, Haiping Wang*, Zhen Dong†, Yuan Liu, Yuhan Li, Bisheng Yang†

Point Cloud Pariwise Registration

IEEE Transactions on Geoscience and Remote Sensing (T-GRS, IF:8.2) 2024

A fast and robust SO(2)-equivariant point cloud descriptor designed for aligning point clouds confirming 4DoF rigid-transformation such as MLS and TLS data.

[Paper] [Code]

A novel method for registration of MLS and stereo reconstructed point clouds

Xiaochen Yang*, Haiping Wang*, Zhen Dong†, Yuan Liu, Yuhan Li, Bisheng Yang†

IEEE Transactions on Geoscience and Remote Sensing (T-GRS, IF:8.2) 2024 Point Cloud Pariwise Registration

A fast and robust SO(2)-equivariant point cloud descriptor designed for aligning point clouds confirming 4DoF rigid-transformation such as MLS and TLS data.

[Paper] [Code]

Robust Multiview Point Cloud Registration with Reliable Pose Graph Initialization and History Reweighting

Haiping Wang*, Yuan Liu*, Zhen Dong†, Yulan Guo, Yu-Shen Liu, Wenping Wang, Bisheng Yang†

Point Cloud Multiview Registration

The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR) 2023

A simple and effective multiview point cloud registration method containing a sparse pose graph construction and a robust IRLS method, achieving SoTA registration performances on the 3D(Lo)Match, ScanNet, and ETH datasets (2023).

[Paper] [Code] [Video]

Robust Multiview Point Cloud Registration with Reliable Pose Graph Initialization and History Reweighting

Haiping Wang*, Yuan Liu*, Zhen Dong†, Yulan Guo, Yu-Shen Liu, Wenping Wang, Bisheng Yang†

The IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR) 2023 Point Cloud Multiview Registration

[Paper] [Code] [Video]

RoReg':' Pairwise Point Cloud Registration with Oriented Descriptors and Local Rotations

Haiping Wang*, Yuan Liu*, Qingyong Hu, Bing Wang, Jianguo Chen, Zhen Dong†, Yulan Guo, Wenping Wang, Bisheng Yang†

Point Cloud Pariwise Registration

IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI, IF:24.3) 2023

Group-based rotation-equivariance can benefit each components of point cloud registration, including feature extraction, feature detection, feature matching, and transformation estimation. RoReg achieves SoTA registration performances on the 3D(Lo)Match and ETH datasets (2023).

[Paper] [Code] [Project Page]

RoReg':' Pairwise Point Cloud Registration with Oriented Descriptors and Local Rotations

Haiping Wang*, Yuan Liu*, Qingyong Hu, Bing Wang, Jianguo Chen, Zhen Dong†, Yulan Guo, Wenping Wang, Bisheng Yang†

IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI, IF:24.3) 2023 Point Cloud Pariwise Registration

[Paper] [Code] [Project Page]

You Only Hypothesize Once':' Point Cloud Registration with Rotation-equivariant Descriptors

Haiping Wang*, Yuan Liu*, Zhen Dong†, Wenping Wang

Point Cloud Pariwise Registration

ACM Multimedia (MM) 2022

Endow local descriptors of point clouds with rotation equivariance based on the icosahedral group learning, achieving SoTA registration performances on the 3D(Lo)Match, ETH, and WHU-TLS datasets (2022).

[Paper] [Code] [Project Page] [Video]

You Only Hypothesize Once':' Point Cloud Registration with Rotation-equivariant Descriptors

Haiping Wang*, Yuan Liu*, Zhen Dong†, Wenping Wang

ACM Multimedia (MM) 2022 Point Cloud Pariwise Registration

[Paper] [Code] [Project Page] [Video]

News

Honors & Awards

Selected Publications (*Co-First, †Corresponding) (view all )

GAGS':' Granularity-Aware 3D Feature Distillation for Gaussian Splatting

GAGS':' Granularity-Aware 3D Feature Distillation for Gaussian Splatting

The Neural City':' A Next-generation Spatio-Temporal Intelligence Paradigm for Urban Holistic Governance

The Neural City':' A Next-generation Spatio-Temporal Intelligence Paradigm for Urban Holistic Governance

VistaDream':' Sampling multiview consistent images for single-view scene reconstruction

VistaDream':' Sampling multiview consistent images for single-view scene reconstruction

SpatialLLM':' From Multi-modality Data to Urban Spatial Intelligence

SpatialLLM':' From Multi-modality Data to Urban Spatial Intelligence

CityAnchor':' City-scale 3D Visual Grounding with Multi-modality LLMs

CityAnchor':' City-scale 3D Visual Grounding with Multi-modality LLMs

FreeReg':' Image-to-Point Cloud Registration Leveraging Pretrained Diffusion Models and Monocular Depth Estimators

FreeReg':' Image-to-Point Cloud Registration Leveraging Pretrained Diffusion Models and Monocular Depth Estimators

A novel method for registration of MLS and stereo reconstructed point clouds

A novel method for registration of MLS and stereo reconstructed point clouds

Robust Multiview Point Cloud Registration with Reliable Pose Graph Initialization and History Reweighting

Robust Multiview Point Cloud Registration with Reliable Pose Graph Initialization and History Reweighting

RoReg':' Pairwise Point Cloud Registration with Oriented Descriptors and Local Rotations

RoReg':' Pairwise Point Cloud Registration with Oriented Descriptors and Local Rotations

You Only Hypothesize Once':' Point Cloud Registration with Rotation-equivariant Descriptors

You Only Hypothesize Once':' Point Cloud Registration with Rotation-equivariant Descriptors

All publications