Efficient learning of personalized visual preferences in daylit offices: An online elicitation framework

Human preference-based control in buildings may achieve maximum occupant satisfaction as well as energy savings. Adaptive and online learning methods are needed for learning human preferences, using the minimum amount of data possible with quantified uncertainty. This paper presents an online visual...

Full description

Saved in:

Bibliographic Details
Published in	Building and environment Vol. 181; p. 107013
Main Authors	Xiong, Jie, Awalgaonkar, Nimish M., Tzempelikos, Athanasios, Bilionis, Ilias, Karava, Panagiota
Format	Journal Article
Language	English
Published	Oxford Elsevier Ltd 15.08.2020 Elsevier BV
Subjects	Algorithms Daylighting Distance learning Elicitation framework Energy conservation Entropy Exploration Illuminance Indoor environments Machine learning Offices Online learning Personalized control Preferences Switching theory Teaching methods Uncertainty Visual discrimination learning Visual preferences Visual preferences Daylighting Online learning Elicitation framework Personalized control
Online Access	Get full text
ISSN	0360-1323 1873-684X
DOI	10.1016/j.buildenv.2020.107013

Cover

More Information
Summary:	Human preference-based control in buildings may achieve maximum occupant satisfaction as well as energy savings. Adaptive and online learning methods are needed for learning human preferences, using the minimum amount of data possible with quantified uncertainty. This paper presents an online visual preference elicitation learning framework, developed for efficiently learning occupants' visual preference profiles and hidden satisfaction utility functions in daylit offices. A combination of Thompson sampling and pure exploration (uncertainty learning) methods was used in the sequential elicitation framework, to determine the set of successive visual preference queries (for visual conditions) with most information gain. In this way, a balance between exploration and exploitation is realized and the area around the satisfaction utility maximum is learned with minimum number of steps. Experiments with human subjects were conducted in private sidelit offices to demonstrate the feasibility and performance of the learning framework. A single-variable model (vertical illuminance) was used to demonstrate the method and visualize results. The entropy of the distribution of the most preferred visual condition is computed for each learned preference profile to quantify the certainty and evaluate the learning efficiency. Our method learns most individual visual preferences to an acceptable certainty level within one day, and indicates the need for personalization. Finally, we discuss the integration of visual preferences into control applications. A switching algorithm is proposed, shifting iteratively between the learning and control modes depending on the certainty of the learned preference model. This work contributes to developing comprehensive online learning methods towards preference-based tuned indoor environments. •Online visual preference elicitation learning framework for daylit offices.•Combination of Thompson sampling and pure exploration (uncertainty learning) methods.•Integration of visual preferences into control applications.•Contributions towards preference-based tuned indoor environments.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14
ISSN:	0360-1323 1873-684X
DOI:	10.1016/j.buildenv.2020.107013