TY - JOUR
T1 - Predicting passenger satisfaction in public transportation using machine learning models
AU - Ruiz, Elkin
AU - Yushimito, Wilfredo F.
AU - Aburto, Luis
AU - de la Cruz, Rolando
N1 - Publisher Copyright:
© 2024 Elsevier Ltd
PY - 2024/3
Y1 - 2024/3
N2 - Enhancing the understanding of passenger satisfaction in public transportation is crucial for operators to refine transit services and to establish and elevate quality standards. While many researchers have tackled this issue using diverse tools and methods, the prevalent approach involves surveys with discrete choice models or structural equations. However, a common limitation of these models lies in their inherent assumptions and predefined relationships between dependent and independent variables. To address these limitations, we introduce a novel perspective by harnessing machine learning (ML) models to gauge and predict passenger satisfaction. ML models are advantageous when dealing with complex, non-linear relationships and massive datasets, and do not rely on predefined assumptions. Thus, in this paper, we evaluate four ML models for the prediction of ratings of the quality of transit service. These models were calibrated using data from the Transantiago bus system in Chile. Among the ML models, the Random Forest model emerges as the most effective, showcasing its ability to analyze and predict passengers’ satisfaction levels. We delve deeper into its capabilities by examining the impact of three pivotal variables on passengers’ score ratings: waiting time, bus occupation, and bus speed. The Random Forest model is able to capture threshold values for these variables that significantly influence or have no effect on passenger preferences.
AB - Enhancing the understanding of passenger satisfaction in public transportation is crucial for operators to refine transit services and to establish and elevate quality standards. While many researchers have tackled this issue using diverse tools and methods, the prevalent approach involves surveys with discrete choice models or structural equations. However, a common limitation of these models lies in their inherent assumptions and predefined relationships between dependent and independent variables. To address these limitations, we introduce a novel perspective by harnessing machine learning (ML) models to gauge and predict passenger satisfaction. ML models are advantageous when dealing with complex, non-linear relationships and massive datasets, and do not rely on predefined assumptions. Thus, in this paper, we evaluate four ML models for the prediction of ratings of the quality of transit service. These models were calibrated using data from the Transantiago bus system in Chile. Among the ML models, the Random Forest model emerges as the most effective, showcasing its ability to analyze and predict passengers’ satisfaction levels. We delve deeper into its capabilities by examining the impact of three pivotal variables on passengers’ score ratings: waiting time, bus occupation, and bus speed. The Random Forest model is able to capture threshold values for these variables that significantly influence or have no effect on passenger preferences.
KW - Bus public transportation
KW - Machine learning
KW - Passenger satisfaction
KW - Prediction
UR - http://www.scopus.com/inward/record.url?scp=85185202114&partnerID=8YFLogxK
U2 - 10.1016/j.tra.2024.103995
DO - 10.1016/j.tra.2024.103995
M3 - Article
AN - SCOPUS:85185202114
SN - 0965-8564
VL - 181
JO - Transportation Research Part A: Policy and Practice
JF - Transportation Research Part A: Policy and Practice
M1 - 103995
ER -