Identification of representative buildings and building groups in urban datasets using a novel pre-processing, classification, clustering and predictive modelling approach

The formulation of energy policies for urban building stock frequently requires the evaluation of the energy use of large numbers of buildings. When urban energy modelling is utilised as part of this process, the identification of building groups and associated representative buildings can play a cr...

Full description

Saved in:

Bibliographic Details
Published in	Building and environment Vol. 140; pp. 90 - 106
Main Authors	Tardioli, Giovanni, Kerrigan, Ruth, Oates, Mike, O'Donnell, James, Finn, Donal P.
Format	Journal Article
Language	English
Published	Oxford Elsevier Ltd 01.08.2018 Elsevier BV
Subjects	Algorithms Building clusters Building grouping Buildings Classification Clustering Datasets Decision trees Energy Energy consumption Energy modeling Energy policy Mathematical models Methodology Modelling Predictive classification Representative buildings Urban areas Urban development Urban energy modelling Building grouping Predictive classification Building clusters Clustering Urban energy modelling Representative buildings
Online Access	Get full text

Cover

Loading…

More Information
Summary:	The formulation of energy policies for urban building stock frequently requires the evaluation of the energy use of large numbers of buildings. When urban energy modelling is utilised as part of this process, the identification of building groups and associated representative buildings can play a critical role. This paper outlines a novel methodology for identifying building groups and associated representative buildings in urban datasets. The methodology utilizes a combination of building classification, building clustering and predictive modelling. First, multiple urban-scale datasets are collected, and then, classification techniques and clustering algorithms are applied to identify building clusters. Next, representative buildings (medoids) in each cluster are identified. Predictive modelling is used to expand cluster membership in the case where some buildings were excluded from the previous analysis. A number of different clustering algorithms are assessed, including K-means and hierarchical (agglomerative and divisive) and partitioning around medoids. The methodology is applied to a large dataset of mixed-use buildings in the city of Geneva, Switzerland. The results, assessed by nine validation indices, indicate the capacity of the decision support framework to identify clusters and associated representative buildings. Furthermore, post-application of predictive modelling, using a random forest approach, facilitates the incorporation of a larger portion of the building stock within the established clusters with an overall average classification accuracy of 89%. A total of 67 representative buildings were identified in the urban dataset, which consisted of 13614 mixed-use buildings in the city of Geneva. •A novel methodology based on predictive classification and clustering is presented.•Representative buildings and groups of similar buildings are identified.•Different data normalisation procedures, clustering algorithms and building features are tested.•The novel predictive classification approach performs with average accuracy of 85%.•The method allows to partially overcome problems of data scarcity in urban datasets.
ISSN:	0360-1323 1873-684X
DOI:	10.1016/j.buildenv.2018.05.035