Preprint
Article

Walking bout Detection for People Living in Long Residential Care: A Computationally Efficient Algorithm for a 3-Axis Accelerometer on the Lower Back

Altmetrics

Downloads

132

Views

34

Comments

0

A peer-reviewed article of this preprint also exists.

Submitted:

24 August 2023

Posted:

25 August 2023

You are already at the latest version

Alerts
Abstract
Accurate and reliable measurement of real-world walking activity is clinically relevant, particularly for people with mobility difficulties. Insights on walking can help understand mobility function, disease progression, and fall risks. People living in long-term residential care environments have heterogeneous and often pathological walking patterns, making it difficult for conventional algorithms paired with wearable sensors to detect their walking activity. We designed two walking bout detection algorithms for people living in long-term residential care. Both algorithms used thresholds on the magnitude of acceleration from a 3-axis accelerometer on the lower back to classify data as “walking” or “non-walking”. One algorithm had generic thresholds, while the other used personalized thresholds. To validate and evaluate the algorithms, we compared the classifications of walking/non-walking from our algorithms to the real-time research assistant annotated labels and the classification from an algorithm validated on a healthy population. Both the generic and personalized algorithms had acceptable accuracy (0.83 and 0.82 respectively). The personalized algorithm showed the highest specificity (0.84) of all tested algorithms, meaning it was the best suited to determine input data for gait characteristic extraction. The developed algorithms were almost 60% quicker than the previously developed algorithms, suggesting they are adaptable for real-time processing.
Keywords: 
Subject: Engineering  -   Bioengineering

1. Introduction

Measurement of walking activity in long-term residential care environments is important for understanding mobility function, assessing the effects of interventions, tracking disease progression, monitoring risk of falling, and identifying unmet mobility needs [1,2,3]. People living in long-term-care homes often have mobility issues, which are typically the result of ageing, neurological or musculoskeletal disorders. The risk of falling for people living in long term care is substantially higher than for people living in the community, with the average fall rate for people living in long term care found to be between 1.5 and 2.75 times per year [4,5,6]. Falls are a common cause of many problems including bone fractures, loss of confidence in mobility, and high healthcare costs [7,8]. Many countries predict a growing population of older adults, which in turn predicts an increased incidence of mobility impacting diseases and conditions [9,10,11,12]. While we can measure and assess gait in controlled lab environments, there are numerous benefits to measuring walking in habitual environments or the “real-world” [13,14,15,16]. Measures of walking in the real world give a more truthful representation of a person’s walking ability, which usually differs to the walking ability displayed in the lab. Furthermore, real-world walking measurements can be used to evaluate the activity level, physical independence, and fall-risk of a person.
To quantify gait characteristics for medically or socially relevant insights it is imperative to first accurately identify when a person is walking. Algorithms can process walking data to determine macro and micro gait metrics [17]. Macro gait metrics include walking volume, pattern, and variability, while micro-level outcomes include pace, rhythm, variability, asymmetry and postural control. The gait metrics can be used to diagnose disease (and disease progression), evaluate risk, and determine suitable treatments. It is therefore important that the gait characteristics are accurate, which in turn, means that the walking bout data used to extract gait characteristics must be accurate. Attempting to extract gait characteristics from non-walking data will produce nonsensical results which, when included with gait characteristics derived from actual walking, confounds the overall results and findings. As such, high specificity in identifying walking bouts is vital to accurate and reliable outcomes.
Many gait identification algorithms have been developed for wearable sensors such as accelerometers alone and inertial measurement units (IMU) to isolate walking bouts in the real-world [15,18,19]. IMUs consist of a 3-axis linear accelerometer and 3-axis gyroscope. The gyroscope data provides more information on body motion and orientation, which is useful for identifying gait. However, gyroscopes have a substantially higher power consumption than accelerometers, which reduces battery life of the wearable sensor [20]. Furthermore, incorporation of gyroscope data increases the complexity of the algorithm due to sensor fusion. Some walking activity algorithms use IMUs on each of the lower body segments and, in addition to identifying walking, can produce a full kinematic analysis of gait. Algorithms have also been developed for use with a limited sensor set, such as: one IMU on the thigh [21,22]; one IMU on a foot [23,24]; and one 3-axis accelerometer on the lower back [25,26]. Limited sensor sets have quicker set-up times and reduce the burden on the user, which is particularly important for older persons and those who have age-related diseases. A recent study found that wearable sensors on the lower back had high comfort and acceptability for a varied clinical population [27]. Analytical pipelines of multiple algorithms designed for these set-ups can identify multiple gait characteristics like walking speed, gait asymmetry, and stride length. Most of these algorithms have been developed and validated for a specific target population such as healthy young adults, older people, people with Parkinson’s disease, or cerebral palsy [28,29,30,31]. Due to differences in gait characteristics, mobility capacity and performance, algorithms designed for a specific population are not necessarily suitable for a different population due to a unique movement pattern impairment.
Identifying walking bouts for people living in long-term residential care is challenging due to generally low activity, slow walking speeds, halting gait patterns, short walking bouts, and reliance on walking aids which can alter gait characteristics. The population of people living in long term care is heterogeneous, with many different conditions manifesting a variety of mobility impairments [32,33]. In general, most residents’ gait can be characterized by slow walking. No walking detection algorithm has yet been developed or validated on this population.
A variety of approaches are employed in gait detection algorithms, including generic, personalized, and machine learning design. Generic algorithms are “out-of-the-box” or standard designs that can be used immediately within the target population. At most, the height or leg length of the person is needed to derive spatial gait characteristics (i.e., step or stride length). Personalized algorithms incorporate some characteristic of the induvial in the processing which requires a form of calibration [34,35,36]. The most complex variety of walking detection algorithms are those that use machine learning to improve their walking bout detection accuracy [37,38,39]. While the machine learning algorithms can be accurate, they also require a substantial amount of time and training data to be useful. Furthermore, they have comparatively higher computational power requirements when compared to non-machine learning algorithms. For off-line data processing, high computational expense can be managed by powerful computers or cloud-based computing. However, low computational costs are very beneficial for on-line processing, allowing the sensor and processor to be packaged in one wearable device and producing real-time, useable outputs such as predicting and alerting the wearer of an imminent risk of falling.
The aim of this study was to develop and validate a computationally inexpensive algorithm to reliably detect walking in the residential care home environment. We developed both a generic and personalized algorithm to determine if incorporating the individual’s walking data in the processing would improve walking bout identification. To improve robustness of the gait classifier, we made the algorithm independent of the accelerometer orientation, ensuring that improper placement of the accelerometer in the real-world would not lead to significant data loss.

2. Materials and Methods

2.1. Data Collection

We recruited 27 participants from four long-term care facilities as a sub-study of the “Staying UpRight” study [4]. All participants gave informed consent as approved by the New Zealand (NZ) Health and Disability Ethics Committee (Approval number 18/NTB/151/AM03). Inclusion criteria were people living in long term care and over 65 years of age. Participants were excluded if they received psychogeriatric, respite or palliative care, or were acutely unwell or immobile. We made note of any walking aids the participant used during the study. The demographics are presented in Table 1.
At the care home, a trained research assistant attached a 3-axis linear accelerometer (Axivity AX3, York, UK) to the participant using a hydrogel adhesive and covered with a surgical grade adhesive dressing (OPSITE Flexifix™or Hypafix™, Smith+Nephew Ltd., Watford, UK). The wearable accelerometer was placed in-line with the 5th lumbar vertebrae and orientated such that its long axis was parallel with the spine. Accelerometery data were recorded at 100 Hz with a ±8g range.
We first collected a validation dataset with the assistance of two research assistants, familiar with the care home environment and the participants. The protocol is visualized in Figure 1. Data collection began with the participant seated in the care home. The participant was then asked to stand up from their bed, walk along a corridor at their own pace, sit-down in a chair, and rest for as long as they wanted, before standing up and retracing their path to the original starting point where they sat down again. Meanwhile, one research assistant used a digital form (Table S1) to record the timings and type of activity. Activities were categorised according to eight specific labels. For the purposes of this study, the two activity categories of “Moving in corridor” and “Moving in confined space” were labelled as walking and all other categories were considered non-walking. The research assistant recorded the clock time (in hour:minute:second format) when the participant began a new type of activity, such as “transition – sit-to-stand”, and also noted the type of activity. Throughout the protocol, a second research assistant used a video camera (Panasonic model HC-V720, frame rate of 25 Hz) to record the lower-half of the participant. To synchronize the accelerometer, research assistant labels, and video data, at the start of the protocol a research assistant aimed the video camera at the wearable sensor on the participant’s back while a mobile phone with the time displayed was held in view of the camera. The second research assistant then tapped the accelerometer three distinct times with their finger to generate a synchronization signal across all datatypes.
We also collected a “real-world” dataset over 7 days using the same wearable sensor set-up, but with no video or hand annotations. This real-world data was primarily used for the “personalized” algorithm detailed in section 2.3. Before processing, the 7-day data were cleaned using a custom built Matlab (Mathworks, v2021b) graphic user interface. We could identify and correct for periods wherein the wearable sensor was removed, replaced in a new orientation, or suffered data loss.

2.2. Data Annotation

The 3-axis accelerometer data from the validation dataset was aligned with the research assistant labels of “walking” or “non-walking” at every timepoint. We matched the video frames of the wearable sensor being tapped to the corresponding spikes in the sensor’s linear acceleration data. Using the time shown on the mobile phone in the video when the accelerometer was tapped, we aligned the written timings of activity to the accelerometer frames. This 2-step sequence matched the activity labels with the accelerometer frames. We then designated walking activity as binary 1, and all other activity as binary 0.

2.3. Algorithm Design

The algorithm identifies periods of walking using thresholds with the magnitude of the linear acceleration in all 3 axes. We developed two versions of the algorithm: one that used a generic threshold and one that used a personalized threshold. There are only two differences between the two algorithms, so we will therefore start with a description of the generic threshold algorithm, hereby referred to as the generic algorithm.
We described the generic algorithm visually in Figure 2 and provide the code in the supplementary files. We filtered the 3-axes of linear acceleration with a 4th order, zero-lag low pass Butterworth filter with a cut-off frequency of 0.25 Hz, then subtracted the filtered data from the raw data to centre the accelerations around 0 g. We took the magnitude of all three axes to create a single signal indicating overall acceleration, which is compared to the global threshold of 0.05 g. The global threshold is essentially the minimum 3D magnitude at which any type of activity could occur. We determined this threshold through pilot studies with substantial visual inspection on all participants data. Comparison of the 3D signal to the threshold creates a binary data series where 0 indicates datapoints less than the threshold and 1 represents data exceeding the threshold. The binary signal was smoothed with a gaussian filter to provide an estimate of activity likelihood with reference to the previous and future data.
We used the binary signal to identify every period of non-walking or “gap”. We then examined the smoothed “activity likelihood” signal during the gap, and counted the number of datapoints less than 0.2 g which was then compared to the “Minimum gap threshold” of 50 frames. If more than 50 datapoints were below the 0.2 threshold, the gap was retained and considered as “no activity”. If less than 20 of the “activity likelihood” datapoints fell below the 0.2 threshold, the gap was removed and the data labelled as “activity”. At this stage, we had the start and end frames of every gap and therefore bout of activity.
The 2nd last stage of the algorithm is to include or reject each identified bout of activity as walking. In each bout, we examined the gaussian smoothed 3D magnitude of the acceleration data in comparison to a heuristic threshold of 0.4 g. The purpose of this threshold is to remove non-walking activities like sit-stand transitions. If less than 2.5% of the smoothed 3D magnitude data is above the 0.4 g threshold, then the activity is considered walking and the bout is included. Conversely, if more than 2.5% of the data is above the 0.4 g threshold, it is considered non-walking activity and rejected. At the end of this stage, we have the start and end frames for every identified bout of walking.
The final stage of the algorithm rejected walking bouts that were less than the minimum duration. For the purposes of further analysis, we set the minimum walking bout duration to 2 seconds. Walking bouts shorter than 2 seconds were rejected. The output of the algorithm was both the start and end frames of each walking bout and a binary signal wherein 1 represents walking and 0 represents non-walking.
In the personalized algorithm we replaced the “generic threshold” with a threshold value determined by the participant’s data and we also modified the “minimum gap threshold”. The minimum threshold for activity was found as the median of the 3D magnitude in a 24 hour period. Theoretically, we are more often stationary than we are in movement, therefore the median acceleration value over a 24-hour period should indicate the baseline of no movement. In this study, we found the median for each participant from day 3 of the real-world data. This personalized threshold replaced the “generic” threshold value in the algorithm. The “minimum gap threshold” was set to 100 frames, as pilot testing showed the original value of 50 frames was too conservative when combined with the personalized threshold (see Figure 2.)

2.4. Algorithm Validation and Performance Evaluation

To validate and evaluate both versions of the presented algorithm, we included comparison to the research assistant annotations and a previously validated “walking bout detection” algorithm. For the purposes of this manuscript, the research assistant annotated walking bouts were considered as the “ground truth”. The walking bout detection algorithm developed by Hickey et al [37] was validated with a young, able-bodied population through comparison with chest-mounted camera footage.
For a comprehensive and quantitative evaluation of the algorithms, we applied traditional evaluation of binary classifiers matrix to compare each of the algorithms to the ground truth. The evaluation metrics were based on the prediction of the algorithm in comparison to the “ground truth” of the research assistant labelled data. Our evaluation of binary classifiers included the basic ratios: sensitivity (true positive rate), specificity (true negative rate), precision (positive predictive value), and negative predictive value. We also calculated the accuracy and F1 score. Figure 3 presents a confusion matrix to illustrate how we calculated the evaluation metrics. For every participant, we ran analysis on the binary output from each of the algorithms and quantified evaluation metrics. We then averaged the results across participants. We chose to average over participants to ensure that the trial duration was not incorporated in the final evaluation and that the effectiveness of the algorithm was weighted equally for each participant.
Finally, we also compared the run time of each of the algorithms. The run time of the codes were measured with Matlab’s built in Tic and Toc functions. We recorded the run time of each algorithm thrice for each participant’s data and then averaged across the three attempts. We then divided each participants average run time by the number of data points and multiplied by 6000 to determine the processing time per 10 minutes of data. We used 2-sample t-tests to investigate significant differences in processing time between the algorithms. Finally, we averaged the processing time per minute of data over participants to find the average processing time per 10 minutes of data (6000 frames) for each of the three algorithms.

3. Results

The final dataset used for analysis included 20 participants, the demographics of whom are summarized in Table 1. Five participants suffered from data loss and were removed from the analysis. A further participant was excluded due to inadequate attachment of the accelerometer, which resulted in the sensor hanging upside down for a portion of the trial. A final participant exhibited highly unusual accelerometer data which resulted in no walking bouts identified by the developed algorithms. The non-standard accelerations were caused by either inadequate attachment of the sensor or the participant’s additional walking aids: an orthotic shoe and ankle orthosis on the right leg. We included their data in the analysis in Tables S2, S3, & S4, but excluded the participant in the following analysis due to uncertainty in the cause of the unusual accelerometer data. In some trials, the research assistant mistakenly tapped on the phone, rather than the accelerometer. In these trials, we visually inspected the video and accelerometer data to align the activity labels with the frames of the wearable sensor.
We present representative data of a participant in Figure 4. Visual inspection showed both the generic and personalized algorithms were relatively good at identifying the start and end of each walking bout. However, the generic algorithm sometimes incorrectly considered non-walking activity (i.e. transitions between standing and sitting) to be walking. The personalized algorithm was more conservative than the generic algorithm, which was seen by fewer misclassifications of sit-stand transitions, but increased misclassification of slower walking. The reference algorithm was more likely to classify stationary data as walking compared to the generic and personalized algorithms. Overall, the walking bouts identified by the three algorithms were similar to the research assistant labelled walking bouts.
We illustrated the quantitative results for each algorithm across participants as a boxchart (Figure 5) and the means with standard deviation for each algorithm in Table 2. The generic algorithm and the references algorithm had very similar sensitivity values (0.90 and 0.89 respectively). The lower sensitivity of the personalized algorithm indicates the algorithm was a little conservative in classifying walking data, in agreement with the finding drawn from Figure 4. The specificity of the generic algorithm was the lowest of the algorithms (0.74), which supports the earlier statement that the generic algorithm was more likely to classify sit-stand transitions as walking. Similarly, the personalized algorithm had the highest specificity of the algorithms (0.84) indicating it had fewer misclassification of sit-stand transitions. Although the personalized algorithm excelled in precision, we found it had the worst negative predictive value. The generic algorithm was the least precise, although it shared mean negative predictive value with the reference algorithm. The accuracy and F1 score were similar across all algorithms, although the box chart indicated the reference algorithm had the least spread for accuracy.
Both the generic and personalized algorithms had significantly shorter processing times than the references algorithm (Figure 6) and Table 3. There was no significant difference in processing time between the generic and personalized algorithms (p=0.50). We found p<0.01 when independently comparing the reference algorithm to generic and personalized algorithm.

4. Discussion

The primary goal of this study was to determine if we could accurately and reliably measure walking bouts using computationally efficient algorithm in a heterogenous population of people with mobility limitations living in residential care environments. Our results show that both of our developed algorithms were able to reliably detect walking bouts compared to the ground truth of research assistant labels. Unobtrusively and reliably measuring walking activity of people in long-term care is a valuable tool for understanding disease progression and the impact of interventions, as well as guiding treatment and therapy plans.
We found the developed algorithms performed similarly to the reference algorithm, with some differences between all three. The reference algorithm was the most likely to include non-walking data within correctly identified walking bouts. That is to say, the reference algorithm had the poorest performance at detecting short breaks in walking bouts. Low specificity, or labelling non-walking data as walking, causes noise and issues when determining gait outcomes (such as walking speed, initial contact timings, or step length) because the algorithms will attempt to calculate gait characteristics from data that has none of the characteristics of walking data. The algorithms developed and described in this manuscript were generally very good at excluding non-walking data within walking bouts, and accurately identifying the start and end of confirmed walking bouts. The generic algorithm was the least conservative of the algorithms, which was indicated by high sensitivity and low specificity. The personalized algorithm was the most conservative at classifying walking bouts. The high specificity of the personalized algorithm is highly desirable for extracting gait characteristics as it reduces the noise and errors that are introduced by including non-walking data in the analysis. Otherwise, the results of the three algorithms were somewhat comparable and showed acceptable consensus with research assistant labelling.
All algorithms misclassified some transition data (i.e. sit-stand) as walking. Transitioning between standing and sitting incurs high linear accelerations, which causes misclassifications by all algorithms. Visually, there are no repeating cyclic patterns during transitions as there is in walking. However, none of the presented algorithms look for repeating patterns. The developed algorithms could implement an additional threshold or check to reject transition data as walking. A simple maximum magnitude threshold implemented in tandem with the minimum magnitude threshold may improve rejection of transitions. An alternative technique would be inclusion of gyroscope data, which was not measured by our sensors.
Incorporating gyroscope data with the linear acceleration could improve the performance of both developed algorithms. Specifically, the gyroscope data gives information on trunk orientation, which could be used to represent a facet of walking. Older adults exhibit a trunk flexion angle relatively close to upright when walking (average 6.3 degrees for men, and 7.0 degrees for women) [40]. The trunk orientation could be compared to a personalized threshold to classify if the trunk angle is within normal limits for walking. The position classification would then be included in the processing step to reject non-walking activity in the developed algorithms. The trunk angle threshold may also be derived from participant data, improving the individualization of the classifier. However, including gyroscope data would not only increase the processing time, but require different hardware. We think including gyroscope data would be worthwhile as improved accuracy would be worth a small increase in processing time.
The threshold of the personalized algorithm was found from a separate data set, which may have resulted in unsuitable threshold values. The dataset used for determining the personalized threshold was recorded on a different day, with no guarantee the measurement unit was positioned exactly as it was for the validation data collection. Furthermore, the participant may have had different gait patterns between the data collections. If the threshold was derived from a participant’s “bad” day, then the thresholds may not match with data from a “good” day. An adaptive personalized threshold which is determined from the most recent data may overcome this issue.
All thresholds and limits, except the personalized minimum activity threshold, were determined through human reasoning. In piloting, we tested numerous values for each threshold and limit. Through participant-level visual comparison of the data at every step, we evaluated the various threshold values. We chose the thresholds that appeared to work well with all participants, and were neither overly conservative or generous. It is highly likely that the thresholds and limits were suboptimal. Further investigation into the most appropriate thresholds and limits is warranted. In addition to finding more appropriate generic thresholds, we may also find personalized thresholds more suitable than generic thresholds.
The population for which these algorithms are designed is a heterogenous population, which explains the large spread of classification evaluation metrics. People living in long term care will typically share some gait characteristics such as slow walking. However, the variety of medical conditions and mobility issues that are present in the population mean there is a wide range of gait patterns. The majority of participants included in this study used a walking frame to complete the protocol and yet each of their acceleration patterns were visibly different. We found that the developed algorithms were far more suitable for some participants than others. For example, the developed algorithms were unable to identify walking bouts in the data of participant 21, who walked with a walking frame and orthotic shoe with ankle orthosis on the right leg. We cannot be certain if the developed algorithms performed poorly for this participant due to poor sensor attachment or unusual patterns in the acceleration data. If we were to group the participants by gait characteristics, we may discover which gait metrics improve the efficacy of the algorithms and which gait characteristics are difficult for the algorithm to handle. Unfortunately, we did not measure gait characteristics so cannot perform this analysis.
Due to methodical difficulties, there were apparent inaccuracies in the research assistant labelled data that we believe may have impacted the results and subsequent interpretation. We illustrated an example of inaccurate research assistant-labelled walking bouts in Figure 7. From visual inspection of this participant’s data, we found that the research assistant labelled walking bouts do not overlap with actual periods of walking which we identified by the low frequency fluctuations in linear acceleration. We suggest two likely causes of the inaccuracy: synchronization error, and difficulty cataloguing the precise timings of activities. The former is due to difficulty matching the accelerometer frames to the research assistant labels, which had a few contributing factors. The time shown on the mobile phone in the video data varied in precision, sometimes displaying the time to the nearest second and other times to the closest minute. In trials where the time was shown to the nearest minute, it was not trivial to match the seconds of the research assistant labels to the video and wearable sensor data. In trials where the phone was mistakenly tapped instead of the accelerometer, we visually compared accelerometer data with both the video and research assistant labels. The start and stop times of the walking bouts identified from visual inspection of the linear acceleration may not have matched precisely with the start and stop times of the research assistant labels.
The difficulty of cataloguing the precise timings of the activities in real time manifested in a multiple ways. Firstly, the research assistants noted times to the nearest second, while the accelerometer measured time in tenths of a second (0.01 s). Therefore, there is uncertainty as to which activity is being performed for nine frames of the accelerometer data. Furthermore, and perhaps more importantly, it was difficult for the research assistants to notice and record small details in walking bouts due to human reaction times. A participant may take a short break during their walking, which is challenging for the research assistant to note down in real time. In addition, identifying the exact time the participant transitions between activities is very difficult to do visually. For example, the research assistant used their best judgement to determine when the participant started walking from standing. These challenges result in less accurate labelling, which impacts the assessment of the algorithms.
Considering the difficulties in obtaining highly accurate research assistant labelled data, it is possible that we have underestimated the performance of the developed algorithms. The most impactful of these difficulties is the disagreement between the algorithms and labelled data when short breaks in walking occur. Visual inspection of the acceleration and video data shows the algorithms identified true short breaks in some walking bouts that were labelled by the research assistant as walking. As such, a number of true negatives were incorrectly evaluated as false negatives, which primarily implies underestimation of the negative predictive value as well as a slightly underestimated true positive and negative rate. To improve the quality of our evaluation, we could relabel the data using the recorded videos, however such a task is extremely time intensive and subject to human error and was therefore not included in this study. Future work may find it worthwhile to relabel the data.
The processing time and therefore computational cost of the developed algorithms were far less than that of the reference algorithm. The main difference in computational cost was likely due to the different techniques used in the initial estimate of walking bouts. The Hickey et al. algorithm initially assessed the data for activity using small windows, requiring multiple repetitions of the analysis technique. Our developed algorithms initially processed the data as a single vector, requiring no repetition or segmentation of the data. Shorter processing time (and lower computational cost) of an algorithm makes it more suitable for on-board processing as it is less draining on the battery and requires fewer resources. Although this is helpful for a personal device to estimate walking at the end of each day, our algorithms would require some modification to estimate walking bouts in real time.
Adapting our presented algorithms for real-time use is possible, but would require some fundamental changes. It is likely that the data would need to be sectioned for processing. In addition to identifying the optimal window size and style (such as overlapping or adjacent), consideration must be paid to how the gaussian smoothing and filtering impact the data. New thresholds may be needed to adapt for changes in the processed signal.
Finally, our algorithms are not reliant on the orientation of the wearable sensor but they do necessitate good adhesion to the skin. The algorithm combines all three axes of linear acceleration into one vector, which is used for processing. As such, the algorithm results should be unchanged if the sensor was placed in any other orientation. We have not yet tested this theory. However, it is important that the wearable sensor remains in place – if the accelerometer can move relative to the skin, the linear accelerations will be changed and the algorithms are less likely to properly detect walking bouts.
In conclusion, our developed algorithms were able to measure walking bouts in a real-world environment for people living in long-term-care with acceptable accuracy and reliability. Our generic algorithm was less conservative than the personalized algorithm, but both were able to identify short breaks in walking. Both algorithms were able to identify the start and end of the walking bouts with high accuracy, although sit-to-stand and stand-sit-transitions were sometimes mislabelled as walking. Our results showed good agreement between the developed algorithms, reference algorithm, and ground truth (research assistant labelled data), but also highlighted difficulties and inaccuracies of real-time, human, annotation of activities. We were also able to decrease processing time and computational complexity compared to a previously validated algorithm from Hickey et al.. Going forward, we can optimize the generic and personalized thresholds of the algorithms and consider adaptation for real-time implementation.

Supplementary Materials

The following supporting information can be downloaded at: www.mdpi.com/xxx/s1, Table S1: Data collection form used by research assistants to label and time activities during the experimental protocol.; Table S2: Individual results of the developed generic algorithm.; Table S3: Individual results of the developed personalized algorithm.; Table S4: : Individual results of the walking “reference” walking bout detection algorithm from Hickey et al.

Author Contributions

The algorithms presented in this manuscript were developed by M.K.M, with supervision from S.D.D. Conceptualization, M.K.M. and S.D.D.; methodology, M.K.M., R.R. and S.D.D.; software, M.K.M., R.R. and S.D.D.; validation, M.K.M.; formal analysis, M.K.M.; investigation, M.K.M., N.K., L.T., L.R., S.D.D.; resources, N.K., L.T., L.R., S.D.D.; data curation, R.R.; writing—original draft preparation, M.K.M.; writing—review and editing, M.K.M., R.R., N.K., L.T., L.R., S.D.D.; visualization, M.K.M.; supervision, S.D.D.; project administration, L.T.; funding acquisition, N.K., L.T. All authors have read and agreed to the published version of the manuscript.

Funding

The study has been funded by a Health Research Council of New Zealand project grant (reference 18/414). The study funder plays no role in study design, data collection or analyses. SDD and LR were supported by the Innovative Medicines Initiative 2 Joint Undertaking (IMI2 JU) project IDEA-FAST - Grant Agreement 853981. This JU receives support from the European Union’s Horizon 2020 research and innovation program and the European Federation of Pharmaceutical Industries and Associations (EFPIA). SDD and LR were also supported by the Mobilise-D project that has received funding from the Innovative Medicines Initiative 2 Joint Undertaking (JU) under grant agreement No. 820820. SDD and LR were supported by the National Institute for Health Research (NIHR) Newcastle Biomedical Research Centre (BRC) based at The Newcastle upon Tyne Hospital NHS Foundation Trust, Newcastle University and the Cumbria, Northumberland and Tyne and Wear (CNTW) NHS Foundation Trust. SDD and LR were supported by the NIHR/Wellcome Trust Clinical Research Facility (CRF) infrastructure at Newcastle upon Tyne Hospitals NHS Foundation Trust. All opinions are those of the authors and not the funders.

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki, and approved by the NZ Health and Disability Ethics Committee (protocol code 18/NTB/151/AM03m and date of approval 18 November 2020).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Acknowledgments

The authors would like to acknowledge the care home residents who participated in this study, and the study research assistants for their time in data collection and analysis.

Conflicts of Interest

S. Del Din reports consultancy activity with Hoffmann-La Roche Ltd. outside of this study. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Barber, S.E.; Forster, A.; Birch, K.M. Levels and Patterns of Daily Physical Activity and Sedentary Behavior Measured Objectively in Older Care Home Residents in the United Kingdom. Journal of Aging and Physical Activity 2015, 23, 133–143. [Google Scholar] [CrossRef] [PubMed]
  2. Rochester, L.; Mazzà, C.; Mueller, A.; Caulfield, B.; McCarthy, M.; Becker, C.; Miller, R.; Piraino, P.; Viceconti, M.; Dartee, W.P.; et al. A Roadmap to Inform Development, Validation and Approval of Digital Mobility Outcomes: The Mobilise-D Approach. Digital Biomarkers 2020, 4, 13–27. [Google Scholar] [CrossRef] [PubMed]
  3. Polhemus, A.; Ortiz, L.D.; Brittain, G.; Chynkiamis, N.; Salis, F.; Gaßner, H.; Gross, M.; Kirk, C.; Rossanigo, R.; Taraldsen, K.; et al. Walking on Common Ground: A Cross-Disciplinary Scoping Review on the Clinical Utility of Digital Mobility Outcomes. npj Digit. Med. 2021, 4, 1–14. [Google Scholar] [CrossRef] [PubMed]
  4. Taylor, L.; Parsons, J.; Taylor, D.; Binns, E.; Lord, S.; Edlin, R.; Rochester, L.; Del Din, S.; Klenk, J.; Buckley, C.; et al. Evaluating the Effects of an Exercise Program (Staying UpRight) for Older Adults in Long-Term Care on Rates of Falls: Study Protocol for a Randomised Controlled Trial. Trials 2020, 21, 46. [Google Scholar] [CrossRef]
  5. Rubenstein, L.Z.; Josephson, K.R.; Robbins, A.S. Falls in the Nursing Home. Ann Intern Med 1994, 121, 442–451. [Google Scholar] [CrossRef]
  6. Kerse, N.; Butler, M.; Robinson, E.; Todd, M. Physical Activity: Wearing Slippers, Falls and Injury in Residential Care. Australian and New Zealand Journal of Public Health 2004, 28, 180–187. [Google Scholar] [CrossRef]
  7. Silva, R.B.; Eslick, G.D.; Duque, G. Exercise for Falls and Fracture Prevention in Long Term Care Facilities: A Systematic Review and Meta-Analysis. Journal of the American Medical Directors Association 2013, 14, 685–689.e2. [Google Scholar] [CrossRef]
  8. Heinrich, S.; Rapp, K.; Rissmann, U.; Becker, C.; König, H.-H. Cost of Falls in Old Age: A Systematic Review. Osteoporos Int 2010, 21, 891–902. [Google Scholar] [CrossRef]
  9. Daley, M.J.; Spinks, W.L. Exercise, Mobility and Aging. Sports Med 2000, 29, 1–12. [Google Scholar] [CrossRef]
  10. Kanasi, E.; Ayilavarapu, S.; Jones, J. The Aging Population: Demographics and the Biology of Aging. Periodontology 2000 2016, 72, 13–18. [Google Scholar] [CrossRef]
  11. Atella, V.; Piano Mortari, A.; Kopinska, J.; Belotti, F.; Lapi, F.; Cricelli, C.; Fontana, L. Trends in Age-Related Disease Burden and Healthcare Utilization. Aging Cell 2019, 18, e12861. [Google Scholar] [CrossRef] [PubMed]
  12. Dall, T.M.; Gallo, P.D.; Chakrabarti, R.; West, T.; Semilla, A.P.; Storm, M.V. An Aging Population And Growing Disease Burden Will Require ALarge And Specialized Health Care Workforce By 2025. Health Affairs 2013, 32, 2013–2020. [Google Scholar] [CrossRef] [PubMed]
  13. Mazzà, C.; Alcock, L.; Aminian, K.; Becker, C.; Bertuletti, S.; Bonci, T.; Brown, P.; Brozgol, M.; Buckley, E.; Carsin, A.-E.; et al. Technical Validation of Real-World Monitoring of Gait: A Multicentric Observational Study. BMJ Open 2021, 11, e050785. [Google Scholar] [CrossRef]
  14. Mc Ardle, R.; Del Din, S.; Donaghy, P.; Galna, B.; Thomas, A.J.; Rochester, L. The Impact of Environment on Gait Assessment: Considerations from Real-World Gait Analysis in Dementia Subtypes. Sensors 2021, 21, 813. [Google Scholar] [CrossRef]
  15. Micó-Amigo, M.E.; Bonci, T.; Paraschiv-Ionescu, A.; Ullrich, M.; Kirk, C.; Soltani, A.; Küderle, A.; Gazit, E.; Salis, F.; Alcock, L.; et al. Assessing Real-World Gait with Digital Technology? Validation, Insights and Recommendations from the Mobilise-D Consortium. Journal of NeuroEngineering and Rehabilitation 2023, 20, 78. [Google Scholar] [CrossRef] [PubMed]
  16. Storm, F.A.; Buckley, C.J.; Mazzà, C. Gait Event Detection in Laboratory and Real Life Settings: Accuracy of Ankle and Waist Sensor Based Methods. Gait & Posture 2016, 50, 42–46. [Google Scholar] [CrossRef]
  17. Del Din, S.; Galna, B.; Godfrey, A.; Bekkers, E.M.J.; Pelosin, E.; Nieuwhof, F.; Mirelman, A.; Hausdorff, J.M.; Rochester, L. Analysis of Free-Living Gait in Older Adults With and Without Parkinson’s Disease and With and Without a History of Falls: Identifying Generic and Disease-Specific Characteristics. The Journals of Gerontology: Series A 2019, 74, 500–506. [Google Scholar] [CrossRef]
  18. Taborri, J.; Palermo, E.; Rossi, S.; Cappa, P. Gait Partitioning Methods: A Systematic Review. Sensors 2016, 16, 66. [Google Scholar] [CrossRef]
  19. Prasanth, H.; Caban, M.; Keller, U.; Courtine, G.; Ijspeert, A.; Vallery, H.; von Zitzewitz, J. Wearable Sensor-Based Real-Time Gait Detection: A Systematic Review. Sensors 2021, 21, 2727. [Google Scholar] [CrossRef]
  20. Liu, Q.; Williamson, J.; Li, K.; Mohrman, W.; Lv, Q.; Dick, R.P.; Shang, L. Gazelle: Energy-Efficient Wearable Analysis for Running. IEEE Transactions on Mobile Computing 2017, 16, 2531–2544. [Google Scholar] [CrossRef]
  21. Garcia, F.A.; Pérez-Ibarra, J.C.; Terra, M.H.; Siqueira, A.A.G. Adaptive Algorithm for Gait Segmentation Using a Single IMU in the Thigh Pocket. IEEE Sensors Journal 2022, 22, 13251–13261. [Google Scholar] [CrossRef]
  22. Abhayasinghe, N.; Murray, I. Human Gait Phase Recognition Based on Thigh Movement Computed Using IMUs. In Proceedings of the 2014 IEEE Ninth International Conference on Intelligent Sensors, Sensor Networks and Information Processing (ISSNIP); April 2014; pp. 1–4. [Google Scholar]
  23. Anwary, A.R.; Yu, H.; Vassallo, M. Optimal Foot Location for Placing Wearable IMU Sensors and Automatic Feature Extraction for Gait Analysis. IEEE Sensors Journal 2018, 18, 2555–2567. [Google Scholar] [CrossRef]
  24. Pérez-Ibarra, J.C.; Siqueira, A.A.G.; Krebs, H.I. Real-Time Identification of Gait Events in Impaired Subjects Using a Single-IMU Foot-Mounted Device. IEEE Sensors Journal 2020, 20, 2616–2624. [Google Scholar] [CrossRef]
  25. González, R.C.; López, A.M.; Rodriguez-Uría, J.; Álvarez, D.; Alvarez, J.C. Real-Time Gait Event Detection for Normal Subjects from Lower Trunk Accelerations. Gait & Posture 2010, 31, 322–325. [Google Scholar] [CrossRef]
  26. Zijlstra, W.; Hof, A.L. Assessment of Spatio-Temporal Gait Parameters from Trunk Accelerations during Human Walking. Gait & Posture 2003, 18, 1–10. [Google Scholar] [CrossRef]
  27. Keogh, A.; Alcock, L.; Brown, P.; Buckley, E.; Brozgol, M.; Gazit, E.; Hansen, C.; Scott, K.; Schwickert, L.; Becker, C.; et al. Acceptability of Wearable Devices for Measuring Mobility Remotely: Observations from the Mobilise-D Technical Validation Study. Digit Health 2023, 9, 20552076221150744. [Google Scholar] [CrossRef]
  28. Carcreff, L.; Gerber, C.N.; Paraschiv-Ionescu, A.; De Coulon, G.; Newman, C.J.; Armand, S.; Aminian, K. What Is the Best Configuration of Wearable Sensors to Measure Spatiotemporal Gait Parameters in Children with Cerebral Palsy? Sensors 2018, 18, 394. [Google Scholar] [CrossRef]
  29. Brégou Bourgeois, A.; Mariani, B.; Aminian, K.; Zambelli, P.Y.; Newman, C.J. Spatio-Temporal Gait Analysis in Children with Cerebral Palsy Using, Foot-Worn Inertial Sensors. Gait & Posture 2014, 39, 436–442. [Google Scholar] [CrossRef]
  30. Schlachetzki, J.C.M.; Barth, J.; Marxreiter, F.; Gossler, J.; Kohl, Z.; Reinfelder, S.; Gassner, H.; Aminian, K.; Eskofier, B.M.; Winkler, J.; et al. Wearable Sensors Objectively Measure Gait Parameters in Parkinson’s Disease. PLOS ONE 2017, 12, e0183989. [Google Scholar] [CrossRef]
  31. Del Din, S.; Godfrey, A.; Rochester, L. Validation of an Accelerometer to Quantify a Comprehensive Battery of Gait Characteristics in Healthy Older Adults and Parkinson’s Disease: Toward Clinical and at Home Use. IEEE Journal of Biomedical and Health Informatics 2016, 20, 838–847. [Google Scholar] [CrossRef]
  32. Tobis, S.; Jaracz, K.; Kropińska, S.; Talarska, D.; Hoe, J.; Wieczorowska-Tobis, K.; Suwalska, A. Needs of Older Persons Living in Long-Term Care Institutions: On the Usefulness of Cluster Approach. BMC Geriatrics 2021, 21, 316. [Google Scholar] [CrossRef] [PubMed]
  33. Manton, K.G. Planning Long-Term Care for Heterogeneous Older Populations. In Varieties of Aging; Maddox, G.L., Lawton, M.P., Eds.; Springer: Berlin, Heidelberg, 1988; pp. 217–255. ISBN 978-3-662-40050-0. [Google Scholar]
  34. Cola, G.; Avvenuti, M.; Musso, F.; Vecchio, A. Personalized Gait Detection Using a Wrist-Worn Accelerometer. In Proceedings of the 2017 IEEE 14th International Conference on Wearable and Implantable Body Sensor Networks (BSN); May 2017; pp. 173–177. [Google Scholar]
  35. Vienne-Jumeau, A.; Oudre, L.; Moreau, A.; Quijoux, F.; Edmond, S.; Dandrieux, M.; Legendre, E.; Vidal, P.P.; Ricard, D. Personalized Template-Based Step Detection From Inertial Measurement Units Signals in Multiple Sclerosis. Frontiers in Neurology 2020, 11. [Google Scholar] [CrossRef] [PubMed]
  36. Dobkin, B.H.; Martinez, C. Wearable Sensors to Monitor, Enable Feedback, and Measure Outcomes of Activity and Practice. Curr Neurol Neurosci Rep 2018, 18, 87. [Google Scholar] [CrossRef]
  37. Hickey, A.; Din, S.D.; Rochester, L.; Godfrey, A. Detecting Free-Living Steps and Walking Bouts: Validating an Algorithm for Macro Gait Analysis. Physiol. Meas. 2016, 38, N1. [Google Scholar] [CrossRef]
  38. Potluri, S.; Chandran, A.B.; Diedrich, C.; Schega, L. Machine Learning Based Human Gait Segmentation with Wearable Sensor Platform. In Proceedings of the 2019 41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC); July 2019; pp. 588–594. [Google Scholar]
  39. O’Callaghan, B.P.F.; Doheny, E.P.; Goulding, C.; Fortune, E.; Lowery, M.M. Adaptive Gait Segmentation Algorithm for Walking Bout Detection Using Tri-Axial Accelerometers. In Proceedings of the 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC); July 2020; pp. 4592–4595. [Google Scholar]
  40. Lee, S.W.; Verghese, J.; Holtzer, R.; Mahoney, J.R.; Oh-Park, M. Trunk Sway during Walking among Older Adults: Norms and Correlation with Gait Velocity. Gait & Posture 2014, 40, 676–681. [Google Scholar] [CrossRef]
Figure 1. The validation protocol used for this study, wherein participants started seated, stood, walked through a bedroom and hallway, sat down on a chair, then retraced their journey.
Figure 1. The validation protocol used for this study, wherein participants started seated, stood, walked through a bedroom and hallway, sat down on a chair, then retraced their journey.
Preprints 83226 g001
Figure 2. Figure 2. Flow chart showing the steps of the algorithm. The differences between the developed algorithms are shown by the dotted-green shapes, which represents the generic algorithm, and the dot-dash-yellow shape, which illustrates the personalized algorithm.
Figure 2. Figure 2. Flow chart showing the steps of the algorithm. The differences between the developed algorithms are shown by the dotted-green shapes, which represents the generic algorithm, and the dot-dash-yellow shape, which illustrates the personalized algorithm.
Preprints 83226 g002
Figure 3. Confusion matrix which defines “true positive”, “true negative”, “false positive”, and “false negative” in the context of research assistant labelled and algorithm estimated data points. The equations used to calculate the sensitivity, specificity, precision, negative predictive value, accuracy, and F1 score are displayed and positioned with reference to their components.
Figure 3. Confusion matrix which defines “true positive”, “true negative”, “false positive”, and “false negative” in the context of research assistant labelled and algorithm estimated data points. The equations used to calculate the sensitivity, specificity, precision, negative predictive value, accuracy, and F1 score are displayed and positioned with reference to their components.
Preprints 83226 g003
Figure 4. Representative data and walking bout classification for a single participant. The shaded boxes represent the walking bout as identified by the research assistant (top), and generic, personalized, and reference algorithms (second to fourth plots).
Figure 4. Representative data and walking bout classification for a single participant. The shaded boxes represent the walking bout as identified by the research assistant (top), and generic, personalized, and reference algorithms (second to fourth plots).
Preprints 83226 g004
Figure 5. The metrics used to evaluate the efficacy of the walking bout detection algorithms.
Figure 5. The metrics used to evaluate the efficacy of the walking bout detection algorithms.
Preprints 83226 g005
Figure 6. The processing time for each of the algorithms. Each algorithm’s processing time has been fit with the line of best fit (linear regression).
Figure 6. The processing time for each of the algorithms. Each algorithm’s processing time has been fit with the line of best fit (linear regression).
Preprints 83226 g006
Figure 7. Representative data of a single participant exhibiting poor match between walking bouts identified by research assistant labels and linear acceleration data.
Figure 7. Representative data of a single participant exhibiting poor match between walking bouts identified by research assistant labels and linear acceleration data.
Preprints 83226 g007
Table 1. Demographics of the participants included in the data analysis.
Table 1. Demographics of the participants included in the data analysis.
Participant Age (years) Height (m) Weight (kg) Sex Walking aid(s)
1 93 1.58 52 Female None
2 104 1.68 53 Male None
3 81 1.57 62 Female None
4 94 1.65 57 Female None
5 68 1.65 118 Female Stick, right hand
6 69 1.75 83 Male Walking frame
7 77 1.78 109 Male Walking frame
8 85 1.55 47 Female Walking frame
9 87 1.61 37 Female Walking frame
10 88 1.74 82 Male Walking frame
11 81 1.48 64 Female Walking frame
12 80 1.78 102 Male Walking frame
13 83 1.60 56 Female Walking frame
14 89 1.61 61 Female Walking frame
15 94 1.71 74 Male Walking frame
16 97 1.75 68 Male Walking frame
17 79 1.71 90 Male Walking frame
18 88 1.73 73 Male Walking frame
19 81 1.50 52 Female Walking frame
20 85 1.65 48 Female Walking frame
21 65 1.54 75 Male Walking frame, Right Ankle Brace and Orthotic shoe
Table 2. The evaluation metrics of the generic, personalized, and reference algorithms, averaged over participants.
Table 2. The evaluation metrics of the generic, personalized, and reference algorithms, averaged over participants.
Generic Personalized Reference
Mean s.d Mean s.d Mean s.d
Sensitivity (True Positive Rate) 0.90 0.08 0.81 0.13 0.89 0.11
Specificity (True Negative Rate) 0.74 0.12 0.84 0.13 0.80 0.11
Precision (Positive Predictive Value) 0.82 0.11 0.87 0.11 0.85 0.11
Negative Predictive Value 0.85 0.12 0.77 0.13 0.85 0.14
Accuracy 0.83 0.09 0.82 0.09 0.85 0.09
F1 0.86 0.09 0.83 0.09 0.87 0.10
Table 3. Mean processing time per 6,000 frames (time is in milliseconds).
Table 3. Mean processing time per 6,000 frames (time is in milliseconds).
Generic Personalized Reference
Mean 3.31 3.13 7.59
s.d. 0.39 0.36 0.64
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

© 2024 MDPI (Basel, Switzerland) unless otherwise stated