1. Introduction
With the rapid and excessive development of motorization, most cities worldwide are confronted with traffic congestion, air deterioration, noise pollution, fossil energy consumption, carbon emissions [
1]. Public transport, especially the train, has excellent advantages in alleviating traffic congestion, reducing fossil energy consumption, improving transportation efficiency, and reducing carbon emissions [
2]. However, unlike private cars, the train provides a “station” to “station” service rather than a “door” to “door” service. Therefore, the last-mile problem has become one of the main obstacles for travelers choosing public transport [
3,
4]. The common last-mile travel modes include walking, shared bicycles, and buses. However, each mode has its advantages and disadvantages. For example, walking has excellent flexibility, economy, and environment-friendliness, and it benefits physical and mental health [
5,
6]. However, walking is only suitable for travel within 10 minutes [
7], and its use is restricted in bad weather (e.g., rain and snow). Shared bicycles solve the problem of access and parking but are likely to cause traffic accidents and traffic congestion[
1]. Similar to walking, shared bicycles are restricted in bad weather. The bus has the advantages of short travel time and low travel cost, but the walking and waiting time are relatively long, and the attractiveness is not enough.
Autonomous vehicle (AV) technology has made great progress in the past few years. The application of AVs will reduce traffic accidents [
8,
9,
10], alleviate traffic congestion [
11,
12], improve fuel economy [
13,
14,
15], and reduce carbon emissions [
16,
17]. It will also improve the mobility of the elderly and the disabled [
18], reduce the travel pressure of drivers [
19], and improve the efficiency of multi-task work [
20]. Therefore, shared AVs have the potential to solve the problem of the last mile of train trips.
Although there is much literature on the impact of shared AVs on travel behavior, most studies mainly focus on the impact of vehicle travel mileage [
21,
22,
23,
24], travel mode choice [
25,
26,
27,
28], and vehicle travel time [
29,
30]. Few studies have explored the use of shared AVs to solve the problem of the last mile of train trips. In addition, most of the literature studies the influence of an individual’s attitude and perception towards AVs and other psychological latent variables on the behavioral intention of choosing AVs. However, a mature theoretical framework that combines behavior with attitude [
31], such as the technology acceptance model and theory of planned behavior, has not been adopted.
In order to fill these gaps, this paper studies latent psychological variables of travelers toward AVs based on the mature theory of planned behavior, including attitudes (ATTs), subjective norms (SNs), perceived behavior control (PBC), and behavioral intention to use (BIU). Then, these variables are incorporated into the discrete choice model to establish a hybrid model with Wuhan as an example to conduct an empirical analysis of the influencing factors and degree of influence of the last-mile travel mode choice behavior of train trips. The results of this paper are expected to improve the service level of the train and provide insights for transportation planning within 1.5 kilometers of subway stations. The main contributions of this paper are listed as follows:
(1) There was a huge heterogeneity of travellers in the travel mode choice of the last mile of train trips; (2) Respondents’ latent psychological variables towards AVs had a significant impact on their travel behaviour, but the impacts vary among different segments; (3) Demographic characteristics, such as education, career, and monthly household income., had a significant impact on the membership of each latent class (LC); (4) The willingness to pay for walking and waiting time. in-vehicle time varied significantly among travellers from different segments; (5) Elastic analysis results illustrated that a 1% increase in the travel cost for shared AV in segment 1 leads to a 7.598% decrease in the choice probability of shared AV.
The paper is structured as follows.
Section 2 presents the literature review. In
Section 3, the structural equation models (SEMs) and LC choice models are shortly discussed.
Section 4 presents survey design, data collection, and descriptive statistics.
Section 5 shows and discusses the results of the final estimated model. At last, conclusions and recommendations for further research are presented in
Section 6. An AV in this paper refers to a fully self-driving vehicle.
2. Literature Review
Travel time and travel costs are considered the most critical factors affecting travel behavior. Ortúzar [
32] applied multinomial logit (MNL) and nested logit (NL) models to investigate travel mode choice in urban corridors and found that in-vehicle time, out-of-vehicle time, and travel costs were significant factors affecting travelers’ mode choice in the Garforth Corridor in West Yorkshire, England. Stern [
33] proposed a correlated MNL model and a Poisson regression model to determine the travel mode choice of elderly and disabled people in rural Virginia and demonstrated that travel cost is related to travel mode. Ewing et al. [
34] confirmed that travel time significantly influenced the school travel mode choice of students in Gainesville, Florida. Frank et al. [
35] established a discrete choice model to explore factors affecting the mode choice and trip chaining patterns of residents in the Central Puget Sound (Seattle) region and ascertained that travel time and cost significantly affect travel behavior. Wang et al. [
36] believed that travel cost determines the travel mode share in Beijing using an NL model. Travelers’ socioeconomic characteristics are also thought to influence travel mode choice significantly. Schwanen et al. [
37] confirmed that contributory factors of travel mode choice of senior citizens for leisure trips include age, gender, car ownership, driver’s license, and educational attainment. Zhang [
38] analyzed the travel mode choice of travelers in Metropolitan Boston and Hong Kong and found that their choices are affected by socioeconomic characteristics, such as age, job, homeownership, children, and car availability. Verplanken et al. [
39] believed that the travel mode choice of university employees is related to age and gender. Tilahun et al. [
40] developed a discrete choice model to explore travel mode choice of commuters in the North-Eastern Illinois area and demonstrated that the choice is effected by gender, age, vehicle/household size, income, and vehicle availability.
Besides travel time, travel costs, and socioeconomic characteristics, the latent psychological variables, such as values, norms, ATTs, perceptions, and desire, are integral to an individual’s travel mode choice [
39,
41,
42]. Many studies explored individuals’ opinions and ATTs regarding BIU AVs. Sanbonmatsu et al. [
43] found that an individual with a higher awareness of AVs has a stronger intention to use the AV. Panagiotopoulos et al. [
44] included that latent variables, such as perceived usefulness, perceived ease to use, perceived trust, and social influence, influence the respondents’ behavioral intentions to use AVs. Choi et al. [
45] and Kaur et al. [
46] concluded that perceived trust positively affects the adoption of AVs. Haboucha et al. [
47] stated that pro-AV sentiments, environmental concern, and technology interest are related to users’ preferences regarding AVs. Lavieri et al. [
48] suggested that privacy sensitivity is related to individuals’ willingness to share trips with strangers. Nevertheless, most studies has not adopted a mature theoretical framework that combines behavior with attitude [
31], such as the technology acceptance model and theory of planned behavior.
The discrete choice model has been widely used to study individual travel behavior. The MNL model is the most common in practical applications due to fewer sample requirements, mature technology, and easy implementation [
49]. Nevertheless, the MNL models have pronounced shortcomings. The model assumes homogenous preferences across different individuals and independence of irrelevant alternatives. If alternatives are independent, the IIA characteristics are not consistent with the actual situation, which may easily lead to the problem of red bus or blue bus [
50,
51]. The NL model came into being in response to the flaw of IIA. The NL model establishes a tree structure based on the correlation between the alternatives: the alternatives are dependent on the same nest but independent among different nests, which overcomes the IIA problem to a certain extent [
32]. The difficulty of NL modeling is to determine the tree structure reasonably [
52].
Unlike the fixed coefficients in the MNL model, the mixed logit (ML) model assumes that the coefficients of the explanatory variables are random and obey a specific probability distribution. Therefore, the ML model can solve the preference heterogeneity problem[
53,
54]. The ML model can also be called the random parameter logit (RPL) model. The ML model needs to determine the distribution type that the model coefficients obey in advance, and then the corresponding parameter values can be estimated [
55]. Typical parameter values are the mean and standard deviation. The former reflects the average preference, while the latter is the magnitude of the preference difference. Like the ML model, the LC model handles the problem of random preference heterogeneity by dividing the respondents into several classes and applying different coefficients, respectively [
56]. As the two primary tools for dealing with preference heterogeneity, the LC and the ML models have relatively similar results. However, most studies show that the LC model is slightly better than the ML model in terms of goodness of fit, theoretical basis, and information richness [
57,
58].
5. Results
5.1. Results of latent variable model
Stata 15.0 was used to test the latent variable model in the study. Confirmatory factor analysis (CFA) was developed to determine the influence of variables on the adoption of AVs. The data needed to be evaluated the reliability and validity before performing CFA.
Table 4 provides details of the reliability and convergent validity of constructs. The standardized factor loadings of 12 observed variables ranged from 0.851 to 0.961, exceeding the standard of 0.5 [
67]. All Cronbach’s alpha values of 4 latent variables were above the acceptable level of 0.70[
68]. The minimum composite reliability (CR) value was 0.921, and all values were higher than the minimum threshold of 0.7 [
69]. The average variance extracted (AVE) values of all constructs were between 0.797 and 0.893, indicating that the measurement model has a good structural reliability and convergence validity [
69].
Table 5 shows the results of the discriminant validity test. All square values of AVE are higher than the inter-construct correlations, demonstrating that the latent variables have acceptable discriminant validity. The measurement model has been validated and used for structural model analysis.
The estimation results of latent variable measurement and SEMs indicated that the model fits the data well based on fit indices such as chi-square/degree of freedom (), the root mean squared error of approximation (RMSEA), the comparative fit index (CFI), the Tucker-Lewis index (TLI), and standardized root mean square residual (SRMR).
=3.775 (critical value is between 1 and 5 when the sample size exceeds 500 according to [
60]), RMSEA=0.064 (less than the critical value of 0.08 based on [
70]); CFI=0.983 (more than the critical value of 0.90 on the basis of [
71]), TLI=0.977 (more than the critical value of 0.90 in accordance with [
68]), SRMR=0.024 (less than the critical value of 0.08 on the strength of [
72]).
5.2. Results of latent class choice model
In principle, more classes mean better goodness of fit at the cost of decreasing parsimony. The BIC and CAIC were proposed to penalize the number of classes.
Table 6 summarizes these measures concerning the models with classes between one to six. Among them, the four-segment LC model has the lowest BIC and CAIC, and the rho-bar squared between 4 and 5 is 0.0009. Four classes might be the optimal number of classes considering the objective of the study and its simplicity. As shown in
Table 7, all models with LCs perform better than the no segment model (MNL model), confirming the heterogeneity of the preferences of the sample.
The selected 4-segment model has a rho-bar squared of 0.400. The class probability model includes socioeconomic characteristics as explanatory variables, and the parameters are shown in
Table 8.
Table 7 presents the model statistics of multinomial and four-segment LC models. Segment 4 only accounts for 7.3% of the sample, with the coefficient of walking and waiting time positively significant while the coefficient of in-vehicle time and trip cost statistically insignificant. The results mean that the respondents of this segment failed to understand the choice task fully.
Segment 1 comprises 28.7% of the sample. The parameter estimates and corresponding z-values in segment 1 indicate that walking and waiting time and trip cost are statistically significant, suggesting that increasing walking and waiting time and travel cost will reduce travelers’ willingness to use a certain travel mode. Most respondents favored walk (93.9%), while a smaller preferred shared bikes (4.8%). The results indicate that class 1 is interested in walking. According to the class member model, respondents who belonged to class 1 are more likely to possess bachelor’s degrees and upper middle income (10001-20000 CNY monthly) and are less likely to own secondary school and below. Furthermore, the latent psychological variables such as ATT and BIU are related to the total utility for using AVs. The ATT towards AVs has a strongly negative contribution (marginal value equals -4.251) to the total utility of using AVs as last-mile transport. The results show that respondents who are positive about adopting AVs are less willing to use AVs as egress mode. The latent psychological variable regarding AV BIU contributes positively to the total utility, indicating that a higher AV BIU decreases the disutility for AVs for the last mile trip.
Segment 2 consists of 37.8% of the respondents. Travelers, who opt for walk, shared bikes, shared AVs, and buses, accounted for 21.8%, 33.5%, 16.2%, and 28.4%. They are more likely to have a bachelor’s degree, work in public servants/public institutions and Enterprise employees, and have middle income compared with other classes. The parameter estimates of travel characteristics are all statistically significant. However, none of the latent psychological variables regarding AVs influence travelers’ preference for AVs as egress mode in segment 2. The results may indicate that respondents in segment 2 are not familiar with AVs.
Segment 3 includes 26.2% of the sample. With 91.3% choosing shared bikes, it is dominated by those who are more likely to take a bike for the last mile. In terms of socio-characteristics, individuals in this class are more likely to possess a bachelor’s degree in the upper-middle-income category, demonstrating that shared bikes are attractive for these travelers in the last mile trip. The waiting and walking time, in-vehicle time, and travel cost proved statistically significant. BIU has a significant positive influence, while PBC has a significant negative effect on the utility function of choosing AVs for last-mile transport.
5.3. Elasticities
Table 9 provides the direct and cross elasticities of the travel cost for all travel modes to study the preference difference among the three LCs, except the residual class 4. The elasticities illustrate the percentage change in the choice probability of four travel modes due to a 1% change in the level of travel cost. For example, a 1% increase in the travel cost for shared AV leads to a 0.649% decrease in the choice probability of shared AV (i.e., direct elasticity), while it causes a 0.105% increase in the probability of choosing walk, shared bike, and bus (i.e., cross elasticity) when considering the entire sample. The direct elasticities of travel cost for all travel modes of the MNL model and class2 were bigger than negative 1, showing that a 1% increase in the travel cost for all travel modes will decrease choice probabilities by less than 1%. However, the direct elasticities of travel cost for all travel modes of class1 are smaller than negative1 and them in MNL model, class2 and class3, which indicates that respondents from class1 are more sensitive to travel cost than individuals from class2 and class3. In class1, a 1% increase in the travel cost for shared AV leads to a 7.598% decrease in the choice probability of shared AV.
5.4. Willingness to pay
Table 10 shows travelers’ willingness to pay for each of the segments. Individuals from segment 1 who are sensitive to travel costs were willing to pay 14.5 CNY to reduce one-hour walking and waiting time. In-vehicle time was found to have an insignificant influence on respondents’ last-mile egress mode. Conversely, travelers in segment 3 were willing to pay for as much as 441.3 CNY and 63.0 CNY to decrease one-hour walking and waiting time and in-vehicle time. The results are consistent with previous research. Seelhorst and Liu (2015) found that price-sensitive travelers were willing to pay less to reduce travel time [
73]. Wen and Lai (2010) discovered that travelers with high incomes were willing to pay more to improve service quality [
74].
6. Discussion
6.1. Summary of results
This study positioned shared AVs in the public transport market and applied an LC model to understand the unobserved preference heterogeneity across respondents. Four distinct market segments concerning the preference were identified for the last mile travel mode choice of multimodal train trips. By analyzing the preference heterogeneity and group characteristics of these LCs, we can determine the target group for using shared AVs in the last mile of train trips. Travelers who choose shared AVs with the highest proportion belong to segment 2. These people are more likely to possess a bachelor’s degree, work in public servants/public institutions, and enterprise employees with a middle income.
The impact of latent psychological variables on different groups is significantly different. The latent psychological variables of AVs have no significant impact on the trips of travelers from segment 2, indicating that travelers lack sufficient knowledge of AVs. The direct elasticity analysis shows that the travelers from segment 1 are most sensitive to travel costs, and the direct elasticity value reaches -7.598, which indicates that a 1% increase in the travel cost for shared AV leads to a 7.598% decrease in the choice probability of shared AV. The cross-elasticity analysis shows that the cost of shared bicycles has the greatest impact on shared AVs. In segment 3, a 1% increase in the travel cost for shared bikes will cause a 0.472% increase in the choice probability of shared AV.
Travelers from different segments have different willingness to pay for walking and waiting time and in-vehicle time. In-vehicle time has an insignificant effect on travelers from segment 1. Unlike travelers from segment 1, travelers from segment 2 are willing to pay 20.4 CNY to reduce one-hour In-vehicle time, while travelers from segment 3 are willing to pay 60.3 CNY to reduce one-hour in-vehicle time. Walking and waiting time significantly impact travelers from all segments, but the magnitude of the impact varies greatly. Travelers from segment 2 are willing to pay 11.0 CNY to reduce one-hour walking and waiting time, while travelers from segment 3 are willing to pay up to 441.3 CNY to reduce one-hour walking and waiting time. The results show that the value of travel time for high-income groups is relatively higher.
6.2. Contributions and comparison to literature
The contribution of this paper mainly includes three aspects. From the perspective of research objects, although Chinese AVs have made rapid progress, there is no research on Chinese shared AVs to solve the problem of the last mile of train trips. Although the Avs have been studied in the Netherlands, Atlanta area, Ann Arbor-Detroit Area, and other areas [
3,
4,
75], the results of these areas are not applicable to other areas.
From the perspective of research methods, this paper is different from the existing research, which uses cluster analysis to identify user groups [
76]. This paper is the first time that used the LC logit model to study travelers’ preferences to choose shared AVs to solve the last mile problem of train trips.
From the perspective of influencing factors, in addition to socioeconomic attributes and travel characteristic variables that have an impact on travel behavior. Travelers’ psychological latent variables have a significant impact on travel mode choice. For example, the perceived trust and perceived reliability of AVs affect travelers’ preference for Avs [
3]. Nevertheless, few studies focused on mature theoretical frameworks that combine behavior and attitude [
31]. This paper adopts the theory of planned behavior to study latent psychological variables, including ATTs, SNs, PBC, and BIU of travelers towards AVs.
6.3. Limitations and further work
This paper still needs to be further studied. Firstly, the results of this paper are mainly for the research area. Research results in different countries and regions will also vary. Secondly, the COVID-19 has an important impact on the travel behavior of travelers. Future research can consider the impact of COVID-19 on travelers’ preference for shared AVs. Finally, this paper mainly studies the option of train and transfer while ignoring the option of private AVs throughout the journey. More options for travel indicate more complex the system and greater uncertainty. Studying more modes of travel helps to understand this uncertainty.