TY - JOUR
T1 - Error-rate estimation in discriminant analysis of non-linear longitudinal data
T2 - A comparison of resampling methods
AU - de la Cruz, Rolando
AU - Fuentes, Claudio
AU - Meza, Cristian
AU - Núñez-Antón, Vicente
N1 - Publisher Copyright:
© 2016, © The Author(s) 2016.
PY - 2018/4/1
Y1 - 2018/4/1
N2 - Consider longitudinal observations across different subjects such that the underlying distribution is determined by a non-linear mixed-effects model. In this context, we look at the misclassification error rate for allocating future subjects using cross-validation, bootstrap algorithms (parametric bootstrap, leave-one-out,.632 and .632+), and bootstrap cross-validation (which combines the first two approaches), and conduct a numerical study to compare the performance of the different methods. The simulation and comparisons in this study are motivated by real observations from a pregnancy study in which one of the main objectives is to predict normal versus abnormal pregnancy outcomes based on information gathered at early stages. Since in this type of studies it is not uncommon to have insufficient data to simultaneously solve the classification problem and estimate the misclassification error rate, we put special attention to situations when only a small sample size is available. We discuss how the misclassification error rate estimates may be affected by the sample size in terms of variability and bias, and examine conditions under which the misclassification error rate estimates perform reasonably well.
AB - Consider longitudinal observations across different subjects such that the underlying distribution is determined by a non-linear mixed-effects model. In this context, we look at the misclassification error rate for allocating future subjects using cross-validation, bootstrap algorithms (parametric bootstrap, leave-one-out,.632 and .632+), and bootstrap cross-validation (which combines the first two approaches), and conduct a numerical study to compare the performance of the different methods. The simulation and comparisons in this study are motivated by real observations from a pregnancy study in which one of the main objectives is to predict normal versus abnormal pregnancy outcomes based on information gathered at early stages. Since in this type of studies it is not uncommon to have insufficient data to simultaneously solve the classification problem and estimate the misclassification error rate, we put special attention to situations when only a small sample size is available. We discuss how the misclassification error rate estimates may be affected by the sample size in terms of variability and bias, and examine conditions under which the misclassification error rate estimates perform reasonably well.
KW - Parametric bootstrap
KW - bootstrap.632 and.632+
KW - classification error rate
KW - cross-validation bootstrap
KW - leave-one-out bootstrap
KW - longitudinal data
KW - mixed-effects models
KW - non-linear models
UR - http://www.scopus.com/inward/record.url?scp=85020502396&partnerID=8YFLogxK
U2 - 10.1177/0962280216656246
DO - 10.1177/0962280216656246
M3 - Article
C2 - 27405324
AN - SCOPUS:85020502396
SN - 0962-2802
VL - 27
SP - 1153
EP - 1167
JO - Statistical Methods in Medical Research
JF - Statistical Methods in Medical Research
IS - 4
ER -