Identification of representative buildings and building groups in urban datasets using a novel pre-processing, classification, clustering and predictive modelling approach

The formulation of energy policies for urban building stock frequently requires the evaluation of the energy use of large numbers of buildings. When urban energy modelling is utilised as part of this process, the identification of building groups and associated representative buildings can play a cr...

Full description

Saved in:
Bibliographic Details
Published inBuilding and environment Vol. 140; pp. 90 - 106
Main Authors Tardioli, Giovanni, Kerrigan, Ruth, Oates, Mike, O'Donnell, James, Finn, Donal P.
Format Journal Article
LanguageEnglish
Published Oxford Elsevier Ltd 01.08.2018
Elsevier BV
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The formulation of energy policies for urban building stock frequently requires the evaluation of the energy use of large numbers of buildings. When urban energy modelling is utilised as part of this process, the identification of building groups and associated representative buildings can play a critical role. This paper outlines a novel methodology for identifying building groups and associated representative buildings in urban datasets. The methodology utilizes a combination of building classification, building clustering and predictive modelling. First, multiple urban-scale datasets are collected, and then, classification techniques and clustering algorithms are applied to identify building clusters. Next, representative buildings (medoids) in each cluster are identified. Predictive modelling is used to expand cluster membership in the case where some buildings were excluded from the previous analysis. A number of different clustering algorithms are assessed, including K-means and hierarchical (agglomerative and divisive) and partitioning around medoids. The methodology is applied to a large dataset of mixed-use buildings in the city of Geneva, Switzerland. The results, assessed by nine validation indices, indicate the capacity of the decision support framework to identify clusters and associated representative buildings. Furthermore, post-application of predictive modelling, using a random forest approach, facilitates the incorporation of a larger portion of the building stock within the established clusters with an overall average classification accuracy of 89%. A total of 67 representative buildings were identified in the urban dataset, which consisted of 13614 mixed-use buildings in the city of Geneva. •A novel methodology based on predictive classification and clustering is presented.•Representative buildings and groups of similar buildings are identified.•Different data normalisation procedures, clustering algorithms and building features are tested.•The novel predictive classification approach performs with average accuracy of 85%.•The method allows to partially overcome problems of data scarcity in urban datasets.
ISSN:0360-1323
1873-684X
DOI:10.1016/j.buildenv.2018.05.035