BuildingGPT: Auto-Regressive Building Wireframe Reconstruction Model with Reinforcement Learning

摘要

In this paper, we propose BuildingGPT, a novel auto-regressive model for building wireframe reconstruction from point clouds with reinforcement learning. Unlike prior works based on detection or diffusion models, BuildingGPT reformulates the building wireframe reconstruction task into a sequence prediction problem. Based on the hierarchical building wireframe tokenization, the wireframe sequences are organized in a structurally- and semantically-aware order for the next-token prediction. The point cloud encoder first transforms the input point cloud into a fixed-length latent code that serves as the starting of the sequence. Then, BuildingGPT auto-regressively predicts tokens conditioned on the latent code and previously generated tokens. With token sequence predicted, the building wireframe is obtained through detokenization. To enhance the model performance, we adopt a two-stage training paradigm including the pre-training and post-training. After the auto-regressive pre-training, Direct Preference Optimization (DPO) is employed as a post-training strategy to align reconstruction results with human preferences. Extensive experiments on the large-scale MunichWF dataset show that BuildingGPT outperforms existing state-of-the-art methods. We commit to release the code and dataset.

出版物
IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2026
Yuzhou Liu
刘昱州
博士研究生 (2021-至今)
Lingjie Zhu
朱灵杰
博士研究生 (2014-2020)
Hanqiao Ye
叶翰樵
博士研究生 (2022-至今)
Xiang Gao
高翔
副研究员, 硕导
Shuhan Shen
申抒含
研究员, 博导