Identification of representative buildings and building groups in urban datasets using a novel pre-processing, classification, clustering and predictive modelling approach
The formulation of energy policies for urban building stock frequently requires the evaluation of the energy use of large numbers of buildings. When urban energy modelling is utilised as part of this process, the identification of building groups and associated representative buildings can play a cr...
Saved in:
Published in | Building and environment Vol. 140; pp. 90 - 106 |
---|---|
Main Authors | , , , , |
Format | Journal Article |
Language | English |
Published |
Oxford
Elsevier Ltd
01.08.2018
Elsevier BV |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | The formulation of energy policies for urban building stock frequently requires the evaluation of the energy use of large numbers of buildings. When urban energy modelling is utilised as part of this process, the identification of building groups and associated representative buildings can play a critical role. This paper outlines a novel methodology for identifying building groups and associated representative buildings in urban datasets. The methodology utilizes a combination of building classification, building clustering and predictive modelling. First, multiple urban-scale datasets are collected, and then, classification techniques and clustering algorithms are applied to identify building clusters. Next, representative buildings (medoids) in each cluster are identified. Predictive modelling is used to expand cluster membership in the case where some buildings were excluded from the previous analysis. A number of different clustering algorithms are assessed, including K-means and hierarchical (agglomerative and divisive) and partitioning around medoids. The methodology is applied to a large dataset of mixed-use buildings in the city of Geneva, Switzerland. The results, assessed by nine validation indices, indicate the capacity of the decision support framework to identify clusters and associated representative buildings. Furthermore, post-application of predictive modelling, using a random forest approach, facilitates the incorporation of a larger portion of the building stock within the established clusters with an overall average classification accuracy of 89%. A total of 67 representative buildings were identified in the urban dataset, which consisted of 13614 mixed-use buildings in the city of Geneva.
•A novel methodology based on predictive classification and clustering is presented.•Representative buildings and groups of similar buildings are identified.•Different data normalisation procedures, clustering algorithms and building features are tested.•The novel predictive classification approach performs with average accuracy of 85%.•The method allows to partially overcome problems of data scarcity in urban datasets. |
---|---|
ISSN: | 0360-1323 1873-684X |
DOI: | 10.1016/j.buildenv.2018.05.035 |