Multi-sensor Based Occupancy Prediction in a Multi-zone Office Building with Transformer

Buildings are responsible for approximately 40% of the world’s energy consumption and 36% of the total carbon dioxide emissions. Building occupancy is essential, enabling Occupant-Centric Control for zero emissions and decarbonization. Although existing machine learning and deep learning methods for building occupancy prediction have achieved remarked progress, their analyses remain limited when applied to complex real-world scenarios. Besides, there is a high expectation for Transformer algorithms to predict building occupancy accurately. Therefore, this paper presents an Occupancy Prediction Transformer network (OPTnet). We fuse and feed multi-sensor data (building occupancy, indoor environmental conditions, HVAC operations) into a Transformer model to forecast the future occupancy presence in multiple zones. We perform experimental analysis and compare it to different occupancy prediction methods (e.g., Decision Tree, Long Short-Term Memory networks, Multi-layer perceptron) and diverse time horizons (1,2,3,5,10,20,30mins). The performance metrics (e.g., Accuracy and Mean Squared Error) are employed to evaluate the effectiveness of prediction algorithms. Our OPTnet method achieves superior performance on our experimental two-week data compared to existing methods. The improved performance signifies its potential to enhance HVAC control systems and energy optimization strategies. We will make the code publicly available at https://github.com/kailaisun/occupancy-prediction-binary to promote transparency and reproducibility.

Keywords:

Subject: Engineering - Architecture, Building and Construction

1. Introduction

In recent years, the world has witnessed a significant surge in demand for energy, which can be attributed to a combination of factors, including a growing global population and rising living standards [1]. While these advancements have undoubtedly enhanced the quality of life, they have also substantially increased carbon emissions. This rise in emissions is a significant contributor to climate change, resulting in severe consequences like extreme weather events, alterations in weather patterns, and global warming [2]. As the global population continues to grow, the energy demand is expected to escalate further, posing a considerable challenge in balancing energy requirements with environmental sustainability. Therefore, enhancing energy efficiency is a widely discussed topic in energy. The focus is discovering methods to use less energy while still maintaining or improving the quality of the services provided. This concept applies to various sectors, including buildings, transportation, and industry, where systems are designed with energy efficiency in mind [3]. By striving for optimal energy efficiency in these sectors, significant economic benefits can be achieved by reducing the expenses associated with energy consumption.

Buildings are responsible for approximately 40% of the world’s energy consumption and release around 36% of the total carbon dioxide emissions [4]. Within building systems, the heating, ventilation, and air conditioning (HVAC) system is the most considerable portion, accounting for 40% of the energy usage [5]. Notably, the HVAC system plays a vital role in regulating indoor temperature [6,7], ensuring thermal comfort [8], and enhancing indoor air quality (IAQ) [9]. Buildings are carefully designed to provide occupants with a pleasant and comfortable environment. However, it is essential to recognize that the number and presence of occupants can directly impact energy consumption. Various studies have explored the connection between occupants’ behavior and energy consumption, aiming to understand their impact on energy consumption. In work [10], a scientific approach was adopted to quantify occupants’ behavior consistently. This approach encompassed factors such as occupants’ presence, movement, and interactions with the energy systems installed in buildings. The objective was to integrate these behavioral aspects into building performance simulation programs for a comprehensive analysis. A comprehensive review conducted by [11] highlighted that the need for adaptation to occupancy variations and insufficient consideration of irregular or partial occupancy are primary sources of inefficiency in building systems. Prior research has demonstrated significant reductions in building energy usage by aligning HVAC systems with actual occupancy patterns [12,13]. Such as based on the simulation results presented in the research conducted by [14], incorporating occupancy information has the potential to achieve energy savings ranging from 11% to 34% across different climatic regions, all while maintaining occupant comfort levels.

Review article [15] on occupancy prediction research highlights the existence of two distinct categories: "occupancy detection/estimation" [16,17] and "occupancy forecast." To effectively understand these categories, it is essential to consider the concept of the prediction window. In the context of occupancy prediction, "occupancy detection" pertains to predicting the occupancy for the current time step, providing real-time information about the current occupancy state. On the other hand, "occupancy forecast" involves predicting the occupancy for a future time step, enabling insights into future occupancy patterns. The research on occupancy prediction specifically focused on forecasting occupancy for future time windows needs to be improved and often conflated with detection methods. As a result, the significance of occupancy forecasting may need to be noticed or mixed with real-time occupancy detection.

Predicting future occupancy holds promise in facilitating building operations and energy efficiency [18]. Nevertheless, accurately predicting occupancy is complex due to the stochastic nature of occupant presence and the inherent variability in individual behavior [19,20]. The ongoing advancements in sensor technologies, data analytics, and prediction algorithms offer promising avenues for enhancing the accuracy and reliability of future occupancy predictions. In the past decade, there have been substantial advancements in forecast algorithms, leading to significant improvements in the accuracy of occupancy predictions. These forecast methods can be categorized into four main groups [15]: conventional statistical approaches (i.e., Markov chain-based [21] and recursive models [22]), unsupervised machine learning approaches (i.e., k-means, k-nearest neighbor techniques, and support vector clustering [23,24]), supervised machine learning approaches (i.e., gradient boosting [25], support vector regression [26], decision tree [27], random forest [28], and deep neural networks [29]), and hybrid approaches. Each group employs different techniques and methodologies to forecast occupancy patterns and behaviors.

In the realm of literature, predicting occupancy has been an area of significant interest for researchers. However, a common trend in this field is that most researchers rely on single sources to collect input data [15]. While this approach may seem expedient and convenient, it carries inherent risks. If a sensor fails, the collected data may miss out on valuable information, resulting in incomplete and potentially unreliable predictions. On the other hand, every source of data collection has its limitations. For example, PIR occupancy sensors are widely used in lighting controls, but they have limitations in detecting stationary occupants [30]. CO2-based approaches have constraints such as low sensitivity to occupant mobility and slow response to drastic occupancy changes [31]. WiFi-based monitoring systems may also face challenges such as connection problems, limited battery life of connected devices, and poor connections for accurate occupancy detection in large-scale buildings with many occupants [32]. These factors can significantly affect the performance of the data collection approaches. Therefore, a better solution is to utilize multi-source data collection approaches. By adopting this approach, researchers can reduce the risk of missing out on valuable information in case one source fails. This strategy is especially critical in occupancy prediction situations where the presented information carries significant weight and is of utmost importance. Researchers can use multiple sources to ensure that their predictions are accurate, reliable, and comprehensive. However, the multi-source data for occupancy prediction needs to be improved in buildings.

To address the above problems, we introduce an Occupancy Prediction Transformer network (OPTnet) for building occupancy prediction. It can robustly predict occupancy presence in diverse rooms and time horizons. We fuse and feed multi-sensor data into a Transformer model to obtain the future occupancy presence in multiple zones. We provide experimental analysis and comparison between existing occupancy prediction methods and diverse time horizons. The main contributions of this paper are as follows:

We introduce OPTnet, a Transformer-based multi-sensor building occupancy prediction network to learn an effective fused representation.
To predict accurate occupancy, we process two-week real operating sensor data from a multi-zone office building, including building occupancy, indoor environmental conditions, and HVAC operations.
Through experimental analysis and comparison, we found that the OPTnet method outperformed existing algorithms (e.g., Decision Tree (DT), Long Short-Term Memory networks (LSTM), Multi-Layer Perceptron (MLP)).
Considering the long or short occupancy prediction applications, we provide a comprehensive analysis and comparison of diverse time horizons to highlight the importance of choosing the suitable time horizon.

2. Methods

This research aims to predict occupancy using a dataset from our previous work [33]. The dataset was collected in an office building in Hebei Province, China, from August 9th to August 21st, 2021, spanning two weeks. Further details regarding the dataset can be found in Section 3.

The research framework in this study consists of four key steps. Firstly, sensor data and corresponding occupant information were collected from the office building, serving as the ground truth for occupancy. Secondly, data normalization techniques were applied to standardize all features to a single scale or range. Thirdly, occupancy prediction was performed using the OPTnet, and its performance was compared with various machine learning algorithms, including DT, LSTM, and MLP. Lastly, performance evaluation metrics were employed to assess the accuracy and effectiveness of the prediction algorithms. The comprehensive methodology, which included each step, is thoroughly explained in the following subsections and visually illustrated in Figure 1.

2.1. Data collection

The dataset employed in this study was constructed by gathering 829,440 data points derived from the system’s operational data from August 9 to August 21, 2021. To ensure the dataset’s relevance, it was filtered to exclusively include instances when the HVAC system was active, specifically during weekdays (Monday to Friday) from 9:00 am to 7:00 pm. Consequently, the dataset encompasses data collected at a resolution of 1 minute during the operational hours of the HVAC system. The dataset spans two weeks: August 9 to 13th (week 1) and August 16 to 20th (week 2), amounting to 10 days. Each day, 600 data samples are recorded, comprising 54 numerical values (6 zones and 9 features). Therefore, the complete dataset encompasses 324,000 numerical values (10 × 600 × 54).

2.2. Data preparation

The dataset utilized in the system consists of raw data with numerical values spanning diverse ranges. For instance, the indoor temperature across each zone fluctuates between 22-32

^{\circ} C

, the number of occupants varies from 0 to 10, and the control signal for the FCUs (Fan Coil Units) ranges from 0 to 3.

Such disparate distributions in the raw data introduce complexity during the training process. To address this challenge, data normalization techniques are employed. Normalization mitigates the risk of gradient explosion in deep learning, accelerates convergence, stabilizes training, and enhances the model’s overall performance [34]. Prior to inputting the data into the algorithms, all raw data undergo normalization through the following steps:

\begin{matrix} x (t) = \frac{n (t) - n_{m i n}}{n_{m a x} - n_{m i n}}, \end{matrix}

(1)

where

n (t)

denote the true number of occupants at time t, while

n_{m i n}

and

n_{m a x}

respectively denote the minimum and maximum number of occupants.

Furthermore, when prediction algorithms generate the predicted occupancy, the output value is converted to the corresponding number of occupants through the following transformation:

\begin{matrix} \bar{n} (t) = \bar{x} (t) * (n_{m a x} - n_{m i n}) + n_{m i n}, \end{matrix}

(2)

where the predicted value of algorithms is denoted as

\bar{x} (t)

, while the predicted number of occupants is represented by

\bar{n} (t)

2.3. Algorithms

This research paper presents OPTnet, which is designed explicitly for building occupancy prediction. Furthermore, a comprehensive comparative analysis is conducted to assess the performance of this algorithm in comparison to established machine learning techniques, including DT, LSTM, and MLP. The subsequent sections of the paper delve into detailed explanations and insights into these algorithms’ underlying principles and operational mechanisms.

2.3.1. Decision Tree

The DT models categorize and generalize datasets into predefined classes in data analysis and machine learning. The primary objective of a decision tree is to construct a classification model that can predict the value of a target attribute (response) based on multiple input attributes (predictors). Each internal node or leaf node within the decision tree corresponds to one of the predictors, and the number of branches emerging from a categorical internal node (leaf node) is equivalent to the possible values of the associated predictor. The leaf nodes represent specific values of the response variable and are reached by traversing the path from the root node, which is the starting point of the tree, to the final leaf (possible answers) [35].

2.3.2. Long Short-Term Memory networks

The LSTM network is a specialized variant of recurrent neural networks (RNNs) developed in 1997 [36]. Designed to address the challenges posed by vanishing and exploding gradients in standard RNNs, LSTM networks leverage the Back-propagation Through Time (BPTT) algorithm to train and excel in tasks involving long-term dependencies. In contrast to conventional neuron-based architectures, LSTM networks feature memory blocks consisting of memory cell units capable of retaining state values over extended periods. Moreover, these memory blocks incorporate three distinct gate units responsible for learning how to preserve, utilize, or discard states as needed. The connectivity between memory blocks is established through layers, facilitating the overall functionality and effectiveness of LSTM networks [37].

2.3.3. Multi-Layer Perceptron

The MLP is a feed-forward artificial neural network (ANN) that draws inspiration from the functioning of the human brain [38]. This network comprises at least three layers of neurons, specifically the input, hidden, and output layers. MLP can effectively capture non-linear relationships between predictor variables and labels by employing activation functions, except for the input layer. In this study, the rectified linear unit (ReLU) activation function is implemented in the hidden layers, as it is commonly recommended for developing neural networks [39]. Additionally, linear and sigmoid functions are adopted in the output layers for regression and classification models. To facilitate the learning process, backpropagation, a supervised learning technique, determines optimal weights and bias values for each neuron.

2.3.4. Occupancy Prediction Transformer network

With the rapid development of Chatgpt and visual foundation models, the Transformer has become a state-of-the-art (SOTA) deep learning method. Transformer, a neural network, is adequate for dealing with sequence-to-sequence (seq2seq) tasks and learning a deep understanding of sequential data. Inspired by RNNs, the Transformer follows the encoder-decoder architecture to learn aggregated hidden-layer features. Unlike RNNs, the Transformer does not perform data processing in sequential order but processes the sequential input data in parallel. In particular, encoders and decoders, composed of multiple self-attention layers, are stacked to extract multi-layer features. Multi-head attention mechanisms are applied to learn the correlation between tokens.

\begin{matrix} \begin{matrix} A t t e n t i o n (Q, K, V) = s o f t m a x (\frac{Q K^{T}}{\sqrt{d_{k}}}) V \end{matrix} \end{matrix}

(3)

In this paper, we develop an OPTnet for the occupancy prediction model. We formulate the occupancy prediction as a sequence prediction problem. In Figure 2, we treat the history occupancy and environmental factors as the OPTnet’s inputs while treating future occupancy information (presence or number) as the OPTnet’s outputs. The structure OPTnet is shown in Figure 2.

2.4. Performance Evaluation

We define two evaluation indicators for occupancy prediction as follows:

\begin{matrix} \begin{matrix} MSE & = \frac{1}{N} \sum_{f = 1}^{N} {(n (t) - \bar{n} (t))}^{2}, \\ Accuracy & = \frac{1}{N} \sum_{t = 1}^{N} (1 - sign | n (t) - \bar{n} (t) |), \end{matrix} \end{matrix}

(4)

where

n (t)

is the true number of occupants in the time t, and

\bar{n} (t)

represents the predicted number of occupants. N is the time length. The Mean Squared Error (MSE) indicates the difference between the predicted value and the ground truth, while Accuracy indicates the hit rate of occupancy prediction. The performance is better when the MSE is smaller and the Accuracy is bigger.

3. Experiments

3.1. Experimental environment

Our experimental system in Hebei, China, represents a multi-zone office environment. It comprises seven distinct working zones, a refrigeration station, and an activity room. Within the system, we employ various components to facilitate efficient operation. These include an air source heat pump (HP) for cooling purposes, variable frequency water pumps for circulation, Fan Coil Unit (FCU) for indoor HVAC control, and cameras for video capture to monitor the environment.

The building is divided into nine regions: an activity room, a refrigeration station, and seven working zones. Zones 4 to 6 are virtually separated from a more extensive zone, following the specifications of the IoT system deployment. Figure 3 provides an overview of the entire office building system, showcasing the arrangement and functionalities of each zone. It is important to note that data from zone 3, which is the financial office, is not publicly accessible. Therefore, our experiments focus on the remaining zones (1, 2, and 4-7).

Each zone has an indoor temperature sensor, a device management panel for controlling the FCU, a video recording camera, and a Jetson Nano for video processing and zone occupancy estimation. Table 1 lists relevant sensors and actuators. Please refer to our previous work in [33] for more detailed information on the experimental system and IoT architectures.

3.2. Experimental data

Given the substantial influence of office occupants on the energy performance of an office, the experimental data for the occupancy prediction model is based on the routines and behaviors of these occupants. These data are categorized into four groups: calendar, occupancy, indoor environment, and HVAC control.

The calendar information: We collected the sensor data from 9:00 to 19:00 during the five working days (from Monday to Friday) and weekends (Saturday and Sunday).
The occupancy information: We captured videos from our cameras. Then, we analyzed and estimated occupancy presence (1 or 0) in each room using advanced artificial intelligence technologies. The time resolution is 1 min. The duty ratios of occupancy in multi-zones are shown in Table 2. The duty ratios are various, showing that the practical dataset is diverse and complete.
The indoor environment information: We collected the indoor temperature and relative humidity data, directly affecting the occupants’ thermal comfort. The temperature and relative humidity data can be used to predict future occupancy.
The HVAC control information: Our HVAC system employs FCUs for control. The control signs (FCU temperature feedback, FCU control mode, FCU on/off feedback, and FCU fan feedback) are considered for occupancy prediction.

3.3. Experimental parameters

A recommendation in [40] emphasizes the importance and effectiveness of occupancy prediction models from occupancy-based HVAC control systems.

In our experiments, We chose the historical multi-sensor data (including occupancy presence, HVAC control, FCU temp feedback, FCU control mode, FCU on-off feedback, FCU fan feedback, room temperature 1 and 2, room relative humidity 1 and 2) as methods’ inputs. We chose occupancy presence (1 or 0) as the methods’ outputs. We compare famous occupancy prediction methods (DT, LSTM, MLP) and OPTnet. To compare LSTM and OPTnet reasonably, we fixed the hyperparameters:

The Adam optimizer trained the LSTM and OPTnet model for 20 epochs.
The learning rate is $10^{- 4}$ .
The batch size is 4.
The numbers of LSTM and TOPTnet layers are 6.
The number of fully-connected layers is 5.
The dropout of the last layer is 0.5.
MSE loss function.

Besides, considering the long or short prediction applications, we used the 30-min historical multi-sensor data to predict occupancy presence in diverse time horizons. In other words, we compared the occupancy prediction performance with 1-min, 2-min, 5-min, 10-min, 20-min, and 30-min horizons.

4. Results and Discussion

The DT, LSTM, MLP, and OPTnet algorithms were implemented on a dataset obtained from a multi-zone office building. The performance of these algorithms with different time horizons was evaluated by measuring their accuracy and MSE values for each zone. The results are presented in Table 3 and Table 4, respectively, showcasing the accuracy and the corresponding MSE.

4.1. OPTnet V.S. (LSTM,MLP,DT)

In Table 3, it is evident that the OPTnet showcases exceptional accuracy values for both weeks in Zones 1 and 4. Across various time horizons, the OPTnet consistently outperforms other machine learning algorithms, demonstrating its superior predictive capabilities. Moving to Zone 2, the accuracy values of the DT algorithm are consistently perfect (the accuracy is 1) for week 1 but comparatively lower for week 2 compared to other machine learning algorithms. However, the Transformer algorithm shows high and consistent accuracy values for week 1 across different time horizons, and even higher accuracy values for week 2, surpassing the performance of other machine learning algorithms.

In Zone 5, the OPTnet achieves high accuracy values for both weeks, particularly for smaller time horizons such as 1 minute and 2 minutes. However, as the time horizon increases, the accuracy of the OPTnet decreases. Shifting to Zone 6, the accuracy values of the MLP algorithm are consistently higher than other machine learning algorithms for different time horizons in both weeks. Lastly, in zone 7, the OPTnet demonstrates higher accuracy values than other machine learning algorithms for week 1, while the MLP algorithm outperforms other algorithms for week 2.

Upon careful analysis of the results, specifically the accuracy and MSE values for each zone, it becomes evident that the OPTnet mostly outperforms other machine learning algorithms among the diverse duty ratios in Table 2. Here are some reasons why the OPTnet outperform these methods:

Occupancy patterns in buildings can exhibit long-range dependencies, where the presence or absence of occupants in one area can impact the occupancy in other areas. The self-attention mechanism in the OPTnet allows it to capture such long-range dependencies effectively. In contrast, DT, LSTM, and MLP struggle to model these dependencies explicitly.
Occupancy patterns often have temporal dynamics, where the presence or absence of occupants at one time influences the future occupancy. The OPTnet, with its self-attention mechanism, can capture these temporal dynamics by attending to relevant past occupancy information at each time step. On the other hand, DT typically considers each time step independently, LSTM focuses on short-term dependencies, and MLP lacks inherent mechanisms for capturing temporal dynamics.
OPTnet can use parallel computation, making them highly scalable and efficient, especially when dealing with large datasets. This scalability allows the Transformer model to handle complex occupancy prediction tasks efficiently. In comparison, DT, LSTM, and MLP may need to improve scalability and computational efficiency, mainly when dealing with longer sequences or large datasets.
OPTnet has shown robustness to noisy data due to its ability to attend to relevant information and suppress noise during the attention mechanism. This robustness can benefit occupancy prediction tasks, where the data may contain missing or noisy observations. DT, LSTM, and MLP are more sensitive to noisy data and require additional preprocessing or regularization techniques to handle such scenarios.

The superior performance of the OPTnet highlights its effectiveness in accurately predicting occupancy patterns within different building zones. This outcome underscores the significance of utilizing the OPTnet as a reliable and robust approach for occupancy prediction in diverse environments. The improved performance of the OPTnet signifies its potential to enhance the efficiency and effectiveness of various applications that rely on accurate occupancy forecasts, such as HVAC control systems, energy optimization strategies, and overall building management.

4.2. Time horizons V.S. Performance

We noticed a clear pattern after analyzing the performance of the OPTnet and other machine learning algorithms across different time horizons. As the time horizon gets longer, the accuracy of each algorithm tends to decrease while the MSE value tends to increase. This finding highlights the importance of selecting an appropriate time horizon based on the specific application requirements. The effect of time horizon on algorithm performance highlights the importance of choosing the right time window for different purposes. Short-term time horizons are beneficial for applications needing instant occupancy predictions or real-time monitoring. This allows better capture and response to short-term occupancy changes with higher accuracy. On the other hand, longer time horizons are better for applications that focus on long-term occupancy forecasting and trend analysis. Even though the accuracy may be slightly lower, having a broader view of occupancy patterns and trends over a more extended period is valuable for tasks like energy planning, resource allocation, and managing occupancy in the long run.

There are two exceptions against the point that the accuracy decreases with the increasing horizons. In Zone 4 and 6, we found the accuracy is increasing. We noticed that in Table. Table 2, the duty ratios of occupancy in Zone 4 and 6 are very high (0.919,0.816, 0.898, 0.898). With the increasing horizons (1,2,5,10,20,30 mins), in the historical occupancy data, the weight of occupancy presence is becoming bigger while the weight of occupancy absence is becoming smaller. Thus, the OPTnet and machine learning algorithms will become more conservative to achieve high accuracy, even though we use balanced class weights to train our models. When the time horizon grows, conservative models converge to the class with the most duty ratios and predict occupancy presence. It is a limitation of our proposed framework.

5. Conclusion

Buildings are responsible for a significant portion of global energy consumption, accounting for around 40% of the total. Additionally, they contribute to approximately 36% of the total carbon dioxide emissions. The occupancy of buildings plays a crucial role in achieving Occupant-Centric Control for zero emissions and decarbonization. This paper introduced an OPTnet for occupancy prediction to address these challenges. The framework integrated data from multiple sensors, including building occupancy, indoor environmental conditions, and HVAC operations, into an OPTnet to forecast future occupancy presence in multiple zones. Experimental analysis and comparisons are conducted among various occupancy prediction methods, including OPTnet, DT, LSTM, and MLP, across different time horizons (1, 2, 3, 5, 10, 20, and 30 minutes). Our OPTnet method performs better than existing methods when applied to our practical two-week dataset. This improved performance highlights its potential to enhance HVAC control systems and energy optimization strategies. By accurately predicting occupancy patterns, the OPTnet-based approach can contribute to more efficient building management and ultimately facilitate the reduction of energy consumption and environmental impact.

Funding

This work is supported by National Natural Science Foundation of China under Grant No. 62192751 and 61425027, in part by Key R&D Project of China under Grant No. 2017YFC0704100, 2016YFB0901900, the 111 International Collaboration Program of China under Grant No. BP2018006, 2019 Major Science and Technology Program for the Strategic Emerging Industries of Fuzhou under Grant No. 2019-Z-1, and in part by the BNRist Program under Grant No. BNR2019TD01009, the National Innovation Center of High Speed Train R&D project (CX/KJ-2020-0006).

Conflicts of Interest

The authors declare no conflict of interest.

References

Bazazzadeh, H.; Pilechiha, P.; Nadolny, A.; Mahdavinejad, M.; Hashemi safaei, S.s. The impact assessment of climate change on building energy consumption in Poland. Energies 2021, 14, 4084. [CrossRef]
Kavari, G.; Tahani, M.; Mirhosseini, M. Wind shear effect on aerodynamic performance and energy production of horizontal axis wind turbines with developing blade element momentum theory. Journal of Cleaner Production 2019, 219, 368–376. [CrossRef]
Kabbaj, O.A.; Péan, L.M.; Masson, J.B.; Marhic, B.; Delahoche, L. Occupancy states forecasting with a hidden Markov model for incomplete data, exploiting daily periodicity. Energy and Buildings 2023, 287, 112985. [CrossRef]
Li, Z.; Wang, P.; Zhang, J.; Mu, S. A strategy of improving indoor air temperature prediction in HVAC system based on multivariate transfer entropy. Building and Environment 2022, 219, 109164. [CrossRef]
Idahosa, L.O.; Akotey, J.O. A social constructionist approach to managing HVAC energy consumption using social norms–A randomised field experiment. Energy Policy 2021, 154, 112293. [CrossRef]
Jung, S.; Jeoung, J.; Hong, T. Occupant-centered real-time control of indoor temperature using deep learning algorithms. Building and Environment 2022, 208, 108633. [CrossRef]
Xing, T.; Sun, K.; Zhao, Q. MITP-Net: A deep-learning framework for short-term indoor temperature predictions in multi-zone buildings. Building and Environment 2023, 239, 110388. doi:https://doi.org/10.1016/j.buildenv.2023.110388. [CrossRef]
Li, W.; Zhang, J.; Zhao, T.; Ren, J. Experimental study of an indoor temperature fuzzy control method for thermal comfort and energy saving using wristband device. Building and Environment 2021, 187, 107432. [CrossRef]
Che, W.W.; Tso, C.Y.; Sun, L.; Ip, D.Y.; Lee, H.; Chao, C.Y.; Lau, A.K. Energy consumption, indoor thermal comfort and air quality in a commercial office with retrofitted heat, ventilation and air conditioning (HVAC) system. Energy and Buildings 2019, 201, 202–215. [CrossRef]
Yan, D.; Hong, T.; Dong, B.; Mahdavi, A.; D’Oca, S.; Gaetani, I.; Feng, X. IEA EBC Annex 66: Definition and simulation of occupant behavior in buildings. Energy and Buildings 2017, 156, 258–270. [CrossRef]
Park, J.Y.; Ouf, M.M.; Gunay, B.; Peng, Y.; O’Brien, W.; Kjærgaard, M.B.; Nagy, Z. A critical review of field implementations of occupant-centric building controls. Building and Environment 2019, 165, 106351. [CrossRef]
Dai, X.; Liu, J.; Zhang, X. A review of studies applying machine learning models to predict occupancy and window-opening behaviours in smart buildings. Energy and Buildings 2020, 223, 110159. [CrossRef]
Wang, W.; Wang, J.; Chen, J.; Huang, G.; Guo, X. Multi-zone outdoor air coordination through Wi-Fi probe-based occupancy sensing. Energy and Buildings 2018, 159, 495–507. [CrossRef]
Wang, C.; Pattawi, K.; Lee, H. Energy saving impact of occupancy-driven thermostat for residential buildings. Energy and Buildings 2020, 211, 109791. [CrossRef]
Jin, Y.; Yan, D.; Chong, A.; Dong, B.; An, J. Building occupancy forecasting: A systematical and critical review. Energy and Buildings 2021, 251, 111345. [CrossRef]
Sun, K.; Ma, X.; Liu, P.; Zhao, Q. MPSN: Motion-aware Pseudo-Siamese Network for indoor video head detection in buildings. Building and Environment 2022, 222, 109354. doi:https://doi.org/10.1016/j.buildenv.2022.109354. [CrossRef]
Choi, H.; Um, C.Y.; Kang, K.; Kim, H.; Kim, T. Review of vision-based occupant information sensing systems for occupant-centric control. Building and Environment 2021, 203, 108064. doi:https://doi.org/10.1016/j.buildenv.2021.108064. [CrossRef]
Esrafilian-Najafabadi, M.; Haghighat, F. Impact of occupancy prediction models on building HVAC control system performance: Application of machine learning techniques. Energy and Buildings 2022, 257, 111808. [CrossRef]
Esrafilian-Najafabadi, M.; Haghighat, F. Impact of predictor variables on the performance of future occupancy prediction: Feature selection using genetic algorithms and machine learning. Building and Environment 2022, 219, 109152. [CrossRef]
Sun, K.; Zhao, Q.; Zou, J. A review of building occupancy measurement systems. Energy and Buildings 2020, 216, 109965. doi:https://doi.org/10.1016/j.enbuild.2020.109965. [CrossRef]
Candanedo, L.M.; Feldheim, V.; Deramaix, D. A methodology based on Hidden Markov Models for occupancy detection and a case study in a low energy residential building. Energy and Buildings 2017, 148, 327–341. [CrossRef]
Gunay, H.B.; O’Brien, W.; Beausoleil-Morrison, I. Development of an occupancy learning algorithm for terminal heating and cooling units. Building and Environment 2015, 93, 71–85. [CrossRef]
Sun, Y.; Hao, W.; Chen, Y.; Liu, B. Data-driven occupant-behavior analytics for residential buildings. Energy 2020, 206, 118100. [CrossRef]
Jiefan, G.; Peng, X.; Zhihong, P.; Yongbao, C.; Ying, J.; Zhe, C. Extracting typical occupancy data of different buildings from mobile positioning data. Energy and buildings 2018, 180, 135–145. [CrossRef]
Razavi, R.; Gharipour, A.; Fleury, M.; Akpan, I.J. Occupancy detection of residential buildings using smart meter data: A large-scale study. Energy and Buildings 2019, 183, 195–208. [CrossRef]
Jain, R.K.; Smith, K.M.; Culligan, P.J.; Taylor, J.E. Forecasting energy consumption of multi-family residential buildings using support vector regression: Investigating the impact of temporal and spatial monitoring granularity on performance accuracy. Applied Energy 2014, 123, 168–178. [CrossRef]
D’Oca, S.; Hong, T. Occupancy schedules learning process through a data mining framework. Energy and Buildings 2015, 88, 395–408. [CrossRef]
Chen, Y.T.; Piedad Jr, E.; Kuo, C.C. Energy consumption load forecasting using a level-based random forest classifier. Symmetry 2019, 11, 956. [CrossRef]
Chen, Y.T.; Piedad Jr, E.; Kuo, C.C. Energy consumption load forecasting using a level-based random forest classifier. Symmetry 2019, 11, 956. [CrossRef]
Benezeth, Y.; Laurent, H.; Emile, B.; Rosenberger, C. Towards a sensor for detecting human presence and characterizing activity. Energy and Buildings 2011, 43, 305–314. [CrossRef]
Yang, J.; Santamouris, M.; Lee, S.E. Review of occupancy sensing systems and occupancy modeling methodologies for the application in institutional buildings. Energy and Buildings 2016, 121, 344–349. [CrossRef]
Wang, W.; Chen, J.; Song, X. Modeling and predicting occupancy profile in office space with a Wi-Fi probe-based Dynamic Markov Time-Window Inference approach. Building and Environment 2017, 124, 130–142. [CrossRef]
Xing, T.; Yan, H.; Sun, K.; Wang, Y.; Wang, X.; Zhao, Q. Honeycomb: An open-source distributed system for smart buildings. Patterns 2022, 3, 100605. [CrossRef]
Xing, T.; Sun, K.; Zhao, Q. MITP-Net: A deep-learning framework for short-term indoor temperature predictions in multi-zone buildings. Building and Environment 2023, 239, 110388. [CrossRef]
D’Oca, S.; Hong, T. Occupancy schedules learning process through a data mining framework. Energy and Buildings 2015, 88, 395–408. [CrossRef]
Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural computation 1997, 9, 1735–1780.
Amadou Boukary, N.; others. A comparison of time series forecasting learning algorithms on the task of predicting event timing. PhD thesis, 2016.
Noriega, L. Multilayer perceptron tutorial. School of Computing. Staffordshire University 2005, 4, 5.
Reed, R.; MarksII, R.J. Neural smithing: supervised learning in feedforward artificial neural networks; Mit Press, 1999.
Esrafilian-Najafabadi, M.; Haghighat, F. Impact of occupancy prediction models on building HVAC control system performance: Application of machine learning techniques. Energy and Buildings 2022, 257, 111808. [CrossRef]

Figure 1. Illustration of the step-by-step methodology for occupancy prediction in buildings.

Figure 2. The structure of OPTnet.

Figure 3. The experimental multi-zone office [33].

Table 1. Parameter details of Equipments.

Device	Brand/Model	Parameter details
FCU	FP-102 and FP-136	Low: $510 / 680 m^{3} / h$
		Medium: $765 / 1020 m^{3} / h$
		High: $1020 / 1360 m^{3} / h$
ITS	HL(HaiLin)-SRT2R1	Range: $0^{\circ} C$ to $50^{\circ} C$
		Accuracy: $\pm 0 . 3^{\circ} C$
OTS	HL(HaiLin)-SOT6R4	Range: $- 20^{\circ} C$ to $80^{\circ} C$
		Accuracy: $\pm 0 . 5^{\circ} C$
Camera	TC471-HD	Range: $> 30 c m$
		Resolution: $1920 \times 1080$
		30fps
Nano	Nvidia Jetson Nano-B01	GPU: 128 NVIDIA CUDA cores
		RAM: 4G

Table 2. Duty ratios of occupancy in multi-zones.

Weeks	Zone 1	Zone 2	Zone 4	Zone 5	Zone 6	Zone 7
Week 1	0.314	0.504	0.919	0.405	0.898	0.611
Weeke2	0.084	0.854	0.816	0.353	0.898	0.256

Table 3. The accuracy evaluation for zone 1,2,4-7

Time horizon		Week-1				Week-2
				Zone 1
	DT	LSTM	MLP	OPTnet	DT	LSTM	MLP	OPTnet
1 min	$0.818$	$0.949$	$0.686$	$0.951$	$0.854$	$0.961$	$0.916$	$0.967$
2 mins	$0.782$	$0.923$	$0.909$	$0.915$	$0.928$	$0.942$	$0.945$	$0.931$
5 mins	$0.793$	$0.821$	$0.812$	$0.82$	$0.861$	$0.92$	$0.911$	$0.92$
10 mins	$0.772$	$0.682$	$0.596$	$0.774$	$0.798$	$0.901$	$0.912$	$0.922$
20 mins	$0.701$	$0.622$	$0.478$	$0.716$	$0.825$	$0.824$	$0.767$	$0.857$
30 mins	$0.665$	$0.585$	$0.549$	$0.676$	$0.851$	$0.654$	$0.55$	$0.827$
				Zone 2
1 min	$1$	$0.798$	$0.504$	$0.751$	$0.777$	$0.865$	$0.854$	$0.863$
2 mins	$1$	$0.703$	$0.847$	$0.768$	$0.746$	$0.852$	$0.86$	$0.854$
5 mins	$1$	$0.679$	$0.814$	$0.773$	$0.772$	$0.829$	$0.834$	$0.854$
10 mins	$1$	$0.497$	$0.596$	$0.782$	$0.754$	$0.852$	$0.818$	$0.853$
20 mins	$0.984$	$0.498$	$0.787$	$0.714$	$0.736$	$0.849$	$0.839$	$0.849$
30 mins	$0.953$	$0.499$	$0.803$	$0.778$	$0.733$	$0.847$	$0.818$	$0.847$
				Zone 4
1 min	$0.893$	$0.919$	$0.919$	$0.919$	$0.718$	$0.816$	$0.816$	$0.9$
2 mins	$0.912$	$0.92$	$0.92$	$0.952$	$0.774$	$0.815$	$0.815$	$0.837$
5 mins	$0.899$	$0.938$	$0.923$	$0.923$	$0.864$	$0.814$	$0.855$	$0.817$
10 mins	$0.917$	$0.927$	$0.927$	$0.927$	$0.821$	$0.819$	$0.819$	$0.819$
20 mins	$0.909$	$0.935$	$0.935$	$0.935$	$0.836$	$0.824$	$0.824$	$0.824$
30 mins	$0.924$	$0.945$	$0.945$	$0.945$	$0.824$	$0.827$	$0.827$	$0.827$
				Zone 5
1 min	$0.779$	$0.854$	$0.405$	$0.896$	$0.411$	$0.775$	$0.353$	$0.882$
2 mins	$0.788$	$0.798$	$0.839$	$0.842$	$0.414$	$0.699$	$0.75$	$0.352$
5 mins	$0.748$	$0.77$	$0.796$	$0.777$	$0.559$	$0.367$	$0.744$	$0.359$
10 mins	$0.696$	$0.723$	$0.756$	$0.673$	$0.468$	$0.65$	$0.641$	$0.35$
20 mins	$0.631$	$0.426$	$0.61$	$0.629$	$0.474$	$0.348$	$0.343$	$0.368$
30 mins	$0.596$	$0.402$	$0.539$	$0.59$	$0.452$	$0.344$	$0.608$	$0.361$
				Zone 6
1 min	$0.872$	$0.961$	$0.96$	$0.96$	$0.895$	$0.963$	$0.96$	$0.96$
2 mins	$0.837$	$0.953$	$0.946$	$0.947$	$0.907$	$0.951$	$0.946$	$0.95$
5 mins	$0.872$	$0.915$	$0.939$	$0.933$	$0.823$	$0.915$	$0.934$	$0.929$
10 mins	$0.85$	$0.886$	$0.906$	$0.883$	$0.86$	$0.883$	$0.903$	$0.879$
20 mins	$0.72$	$0.863$	$0.875$	$0.815$	$0.707$	$0.848$	$0.873$	$0.864$
30 mins	$0.924$	$0.891$	$0.83$	$0.898$	$0.824$	$0.885$	$0.836$	$0.898$
				Zone 7
1 min	$0.756$	$0.921$	$0.912$	$0.932$	$0.854$	$0.965$	$0.968$	$0.97$
2 mins	$0.824$	$0.888$	$0.87$	$0.889$	$0.899$	$0.898$	$0.95$	$0.93$
5 mins	$0.783$	$0.82$	$0.862$	$0.809$	$0.833$	$0.875$	$0.938$	$0.817$
10 mins	$0.763$	$0.645$	$0.723$	$0.764$	$0.798$	$0.745$	$0.908$	$0.81$
20 mins	$0.667$	$0.608$	$0.695$	$0.724$	$0.834$	$0.775$	$0.877$	$0.779$
30 mins	$0.665$	$0.578$	$0.577$	$0.685$	$0.744$	$0.704$	$0.833$	$0.719$

Table 4. The MSE evaluation for zone 1,2,4-7

Time horizon		Week-1				Week-2
				Zone 1
	DT	LSTM	MLP	OPTnet	DT	LSTM	MLP	OPTnet
1 min	$0.182$	$0.051$	$0.314$	$0.049$	$0.146$	$0.039$	$0.084$	$0.033$
2 mins	$0.218$	$0.077$	$0.091$	$0.085$	$0.092$	$0.058$	$0.055$	$0.069$
5 mins	$0.207$	$0.179$	$0.188$	$0.18$	$0.139$	$0.08$	$0.089$	$0.081$
10 mins	$0.228$	$0.318$	$0.259$	$0.226$	$0.202$	$0.099$	$0.088$	$0.078$
20 mins	$0.299$	$0.372$	$0.378$	$0.284$	$0.175$	$0.176$	$0.233$	$0.143$
30 mins	$0.335$	$0.428$	$0.415$	$0.324$	$0.149$	$0.46$	$0.344$	$0.173$
				Zone 2
1 min	$0.0$	$0.202$	$0.496$	$0.249$	$0.223$	$0.135$	$0.146$	$0.137$
2 mins	$0.0$	$0.297$	$0.153$	$0.085$	$0.266$	$0.148$	$0.14$	$0.146$
5 mins	$0.0$	$0.321$	$0.186$	$0.227$	$0.228$	$0.171$	$0.166$	$0.147$
10 mins	$0.0$	$0.503$	$0.171$	$0.218$	$0.246$	$0.148$	$0.182$	$0.148$
20 mins	$0.016$	$0.502$	$0.213$	$0.286$	$0.264$	$0.151$	$0.161$	$0.151$
30 mins	$0.047$	$0.501$	$0.197$	$0.222$	$0.267$	$0.153$	$0.182$	$0.153$
				Zone 4
1 min	$0.107$	$0.081$	$0.081$	$0.081$	$0.282$	$0.184$	$0.186$	$0.1$
2 mins	$0.088$	$0.08$	$0.08$	$0.048$	$0.226$	$0.185$	$0.185$	$0.163$
5 mins	$0.101$	$0.062$	$0.077$	$0.77$	$0.136$	$0.186$	$0.145$	$0.183$
10 mins	$0.093$	$0.073$	$0.073$	$0.073$	$0.179$	$0.181$	$0.181$	$0.181$
20 mins	$0.091$	$0.065$	$0.065$	$0.065$	$0.164$	$0.176$	$0.176$	$0.176$
30 mins	$0.076$	$0.055$	$0.055$	$0.055$	$0.176$	$0.173$	$0.173$	$0.173$
				Zone 5
1 min	$0.221$	$0.146$	$0.595$	$0.104$	$0.589$	$0.225$	$0.647$	$0.118$
2 mins	$0.212$	$0.202$	$0.161$	$0.158$	$0.586$	$0.301$	$0.25$	$0.648$
5 mins	$0.252$	$0.23$	$0.204$	$0.223$	$0.441$	$0.633$	$0.256$	$0.641$
10 mins	$0.304$	$0.277$	$0.244$	$0.327$	$0.532$	$0.65$	$0.359$	$0.35$
20 mins	$0.369$	$0.574$	$0.39$	$0.371$	$0.526$	$0.652$	$0.657$	$0.632$
30 mins	$0.404$	$0.598$	$0.461$	$0.41$	$0.548$	$0.656$	$0.392$	$0.639$
				Zone 6
1 min	$0.128$	$0.039$	$0.04$	$0.04$	$0.105$	$0.037$	$0.04$	$0.04$
2 mins	$0.163$	$0.047$	$0.054$	$0.053$	$0.093$	$0.049$	$0.054$	$0.05$
5 mins	$0.128$	$0.061$	$0.204$	$0.067$	$0.177$	$0.085$	$0.066$	$0.071$
10 mins	$0.15$	$0.114$	$0.094$	$0.117$	$0.14$	$0.117$	$0.097$	$0.121$
20 mins	$0.255$	$0.137$	$0.125$	$0.185$	$0.216$	$0.152$	$0.127$	$0.136$
30 mins	$0.28$	$0.109$	$0.17$	$0.102$	$0.293$	$0.115$	$0.164$	$0.102$
				Zone 7
1 min	$0.244$	$0.079$	$0.088$	$0.068$	$0.146$	$0.035$	$0.0032$	$0.03$
2 mins	$0.176$	$0.112$	$0.13$	$0.111$	$0.101$	$0.102$	$0.05$	$0.07$
5 mins	$0.217$	$0.18$	$0.138$	$0.191$	$0.167$	$0.125$	$0.062$	$0.183$
10 mins	$0.237$	$0.355$	$0.277$	$0.236$	$0.202$	$0.255$	$0.092$	$0.19$
20 mins	$0.333$	$0.392$	$0.305$	$0.276$	$0.166$	$0.225$	$0.123$	$0.221$
30 mins	$0.335$	$0.422$	$0.423$	$0.315$	$0.256$	$0.296$	$0.167$	$0.281$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

MDPI Initiatives

Important Links

Choose an area of interest and we will send you notifications of new preprints at your preferred frequency.

Disclaimer