1. Introduction
Over the past few decades, transforaminal lumbar interbody fusion (TLIF) has been commonly used to treat lumbar degenerative diseases, demonstrating the benefits of achieving satisfactory arthrodesis through a unilateral approach with minimal impingement on neural components [
1,
2]. In addition to relieving spinal nerve compression, the primary objective of TLIF is to restore sagittal balance and the intervertebral body height [
3,
4,
5].
In terms of sagittal alignment, several studies have reported a close relationship between postoperative sagittal malalignment and postoperative residual symptoms in patients with lumbar fusion [
5,
6]. Among the parameters of spinal alignment, subtraction of lumbar lordosis (LL) from the pelvic incidence (PI) is a crucial indicator of postoperative outcomes after short-segment lumbar interbody fusion for lumbar pathologies. Patients with PI-LL (PI minus LL) mismatch have increased risks of adjacent segment disease (ASD), late surgical complications, and revision surgery [
7,
8,
9]. Therefore, postoperative alignment prognosis, especially for critical parameters such as PI-LL, is required for optimal preoperative planning for lumbar fusion. However, predicting postoperative alignment in patients is challenging. Ailon et al. [
10] reported that only 42% of cases were accurately predicted by 17 experienced surgeons specializing in treating spinal deformity. Although various methods exist for predicting postoperative parameters in patients with adult spinal deformity [
11,
12], a method for predicting the value of PI-LL in TLIF procedures still needs to be developed.
Selecting an interbody cage with the correct height is a crucial aspect of lumbar interbody fusion. Utilizing an undersized cage may result in the inability to restore the intervertebral height and segmental lordosis, as well as in complications such as pseudarthrosis and cage migration [
13,
14,
15]. By contrast, utilizing an oversized cage may increase the likelihood of nerve root compression, ASD, or cage subsidence [
15]. In clinical practice, the cage height has long been selected subjectively by surgeons depending on their operational experience. Few studies have predicted the height of fusion cages on the basis of the intervertebral height of the pathological segment [
16] or the anterior and posterior disc height on a preoperative computed tomography (CT) image [
17]. However, in severe degenerative diseases, such as spondylolisthesis and spinal deformity, when the disc height is greatly reduced, these methods are often inaccurate. Thus, estimating the height of interbody cages remains a challenge.
The choice of the cage height affects sagittal balance (and vice versa), and preoperative spinal parameters play a key role in determining the appropriate size of the implanted device for achieving favorable parameters after surgery [
16,
18]. Therefore, regression models should be developed to predict the interbody cage height and postoperative parameters from preoperative parameters. However, manual measurements are time-consuming for obtaining all parameters and are prone to rater-dependent errors. Currently, automated tools involving artificial intelligence (AI) are used to increase the accuracy and speed of measuring spinal alignment parameters from radiographic images [
12,
19]. The purpose of this study was to develop a dedicated pipeline based on AI and machine learning (ML) that can reliably predict the interbody cage height and postoperative PI-LL in TLIF surgery from preoperative X-ray images.
4. Discussion
Spinopelvic alignment restoration is essential for both adult spinal deformity surgery and short-segment lumbar interbody fusion [
8,
26,
27]. However, determining the influence of each factor on sagittal alignment is difficult because the normal standing posture is jointly determined by multiple lumbosacral factors [
28,
29]. As shown in
Figure 6 in the present study, the postoperative value of PI-LL is substantially influenced by the preoperative values of LL, RLL, and PI. However, because the PI value is regarded as
a constant anatomic feature with slight variation in pathologic disorders or lumbar spine interventions [
30]
, determining the postoperative LL is typically necessary for predicting the optimal PI-LL. According to previous research, LL restoration after surgery is closely linked to preoperative LL and PI [
31,
32,
33,
34]. Therefore, LL and PI can be used to predict the LL and PI-LL values after surgery, as in our model.
Appropriate parameters must be obtained for enhancing surgical quality, and surgeons must develop effective strategies to achieve harmonious sagittal alignment. Our model demonstrated a strong capacity to generate a satisfactory PI-LL value while being able to forecast the potential range of this value. By selecting patients without ASD for the dataset, the algorithm trained on these data was able to generate a favorable PI-LL value, which can be used to reduce the incidence of ASD in patients [
7]. Our PI-LL prediction model was also able to provide predictions for surgical planning in selecting the appropriate surgical technique and instruments. Actually, the optimal PI-LL has been the subject of debate. Satoshi et al. [
35] reported that this value is inconsistent. Meanwhile, multiple studies have suggested that surgeons must strive to reduce PI-LL to 10° or less whenever possible [
8,
36,
37]. According to our model, if unsatisfactory PI-LL prediction values are obtained before surgery, surgeons could consider implementing additional intraoperative techniques. To achieve an adequate LL value, strong fixation with a curved rod system can be implemented. In some cases of severe hypolordosis, osteotomy techniques such as pedicle subtraction osteotomy are also a viable option [
38]. Furthermore, the predictive results of postoperative PI-LL from our algorithm may aid in rod bending or in the determination of the number of spinal levels requiring fixation when a surgeon receives intraoperative fluoroscopic images. However, previous studies have revealed substantial discrepancies between standing and prone angle measurements [
39,
40]. Therefore, these models must be further developed to ensure their seamless integration from preoperative planning to actual surgery.
Size, shape, and position play a crucial role in the insertion of an intervertebral cage. However, findings regarding the importance of the implant shape and placement have been inconsistent. Cage lordosis and final LL after surgery are strongly correlated, with more anterior placement resulting in greater intervertebral lordosis [
18]. Conversely, some
in vitro biomechanical and clinical studies have reported that the cage position and geometry do not affect sagittal alignment after lumbar interbody fusion [
41,
42,
43]. The cage height typically serves as a key factor applied by surgeons for improving lordosis [
44,
45], and our research has primarily focused on predicting this index. Most of our cage height values were between 12 and 13 mm, which are consistent with the recommended cage heights of 11, 12, or 13 mm for the L3-4 and L4-5 levels in a previous study conducted in the Chinese population [
16]. In addition, our model performed well for cases within this range, indicating its potential clinical applicability for the Asian population. Overall, predicting the appropriate interbody cage size can assist surgeons in decision-making and improve postoperative outcomes, particularly for inexperienced surgeons. Prediction using our model can also provide the cage height with an error of approximately 1 mm only (
Figure 4). Consequently, fewer cages need to be sterilized, thus reducing the costs of surgery. In addition, the costs of treatment decrease due to the reduced operation duration and complication rates. Therefore, patients evidently benefit from the development of these models.
Our results indicated that the disc height of the pathological segment and the two adjacent levels plays a crucial role in predicting the height of the interbody cage (
Figure 6). To predict this value, Wang et al. [
16] developed a regression model that emphasizes the importance of the intervertebral height at the midpoint of the pathological segment (MIVH): interbody cage height = 11.123 − 0.563*gender + 0.149*MIVH. In our study, gender was one of the final 23 features used to build the optimal model, but its influence was not as evident as that of the other parameters. With the exception of the parameters associated with the intervertebral disc height, PI and LL contributed to the prediction of the interbody cage height. These two parameters also contributed to the aforementioned prediction of postoperative PI-LL. Lafage et al. [
11] discovered that pelvic retroversion and global sagittal balance in adult patients with spinal deformities were primarily influenced by the PI and LL values. Here, we emphasized that PI and LL are among the most crucial parameters for both long- and short-segment fusion surgeries.
Multiple researchers have attempted to develop algorithms for predicting postoperative sagittal parameters and the interbody cage height, with the primary goal of improving the accuracy and usability of these algorithms in daily practice. Previous formulas typically included a limited number of significant variables to simplify computations. Lafage et al. [
11,
46] developed one of the most accurate formulas for predicting the sagittal vertical axis (SVA). They used only four variables in their formula: PI, LL, thoracic kyphosis, and age. Legaye and Duval-Beaupère [
30,
47] proposed multilinear regression models for calculating LL by using only basic parameters, such as thoracic kyphosis, SS, PI, PT, and T9 spinopelvic inclination. In contrast to previous research, our goal was to incorporate all significant parameters that can be measured in the lumbar region for developing an algorithm. Because our prediction models (LR for the interbody cage height and SVR for postoperative PI-LL) and previous models share the same characteristic of utilizing multiple linear algorithms, we took advantage of the current technological advancements and incorporated as many variables as possible. However, certain factors, such as the width and length of the vertebral body, were found to be crucial features in our model, which has never been mentioned in the medical literature; this is likely due to a coincidence during model training with our dataset, necessitating further verification in future research. Previously, the use of multiple parameters may have been inconvenient for daily practice. With the help of computers, the accuracy of predictions can now be improved, and these predictions can be applied in clinical settings. According to Langella et al. [
48], computer-assisted methods are associated with a failure rate below 20% for predicting PI and SVA. To the best of our knowledge
, this is the first study to provide a pipeline and various models for predicting PI-LL and the cage height from preoperative X-ray images by using AI.
This study has several limitations. First, this was a retrospective, single-center study with a small sample size. As a result, the optimal interbody cage height or postoperative PI-LL may be influenced by subjective factors such as the surgeon’s technique and patient demographics. Nevertheless, by using multiple algorithms, this study introduced a novel concept and laid the foundation to ensure that these predictions are more accurate in future multicenter studies. Second, this study was limited to patients with monosegmental TLIF at the L4L5 level, and only one sagittal parameter, PI-LL, was predicted. However, using our algorithms, a large number of postoperative parameters can be predicted not only for single-level fusion surgery but also for surgeries involving multiple levels. Third, sagittal balance is associated with factors such as SVA, T1 spinopelvic inclination, and C7 plumb line, which are evaluated using full-length spine radiographs [
37,
49,
50]. Because we focused only on short-segment fusion, we examined only the lumbar region. Therefore, global sagittal balance factors must be examined for TLIF surgery in the future. Finally, our model comprised many steps, thereby increasing the likelihood of errors. To increase the accuracy of predictions, a synthetic model must be developed using radiographic parameters from X-ray, MRI, and CT scans.
Author Contributions
Conceptualization, A.T.B., T.-J.H. and M.-H.W.; Methodology, A.T.B. and M.-H.W.; Software, H.L., P.-I.T., and K.-J.C.; Validation, H.-C.S., E.-W.H. and C.-C.H.; Formal Analysis, H.L. and K.L.-C.H.; Data Curation, K.L.-C.H., C.-Y.L. and P.-Y.W. ; Writing – Original Draft Preparation, A.T.B., H.L., and G.M.T.; Writing – Review & Editing, T.T.H., H.-C.S., M.M., C.-Y.L., T.-J.H., and M.-H.W.; Visualization, A.T.B. and H.L.; Supervision, T.T.H., H.-C.S., T.-J.H., and M.-H.W.; Project Administration, M.-H.W. All authors have read and agreed to the published version of the manuscript.