TY - JOUR
T1 - A machine learned classifier for RR Lyrae in the VVV survey
AU - Elorrieta, Felipe
AU - Eyheramendy, Susana
AU - Jordán, Andrés
AU - Dékány, István
AU - Catelan, Márcio
AU - Angeloni, Rodolfo
AU - Alonso-García, Javier
AU - Contreras-Ramos, Rodrigo
AU - Gran, Felipe
AU - Hajdu, Gergely
AU - Espinoza, Néstor
AU - Saito, Roberto K.
AU - Minniti, Dante
N1 - Publisher Copyright:
© 2016 ESO.
PY - 2016/11/1
Y1 - 2016/11/1
N2 - Variable stars of RR Lyrae type are a prime tool with which to obtain distances to old stellar populations in the Milky Way. One of the main aims of the Vista Variables in the Via Lactea (VVV) near-infrared survey is to use them to map the structure of the Galactic Bulge. Owing to the large number of expected sources, this requires an automated mechanism for selecting RR Lyrae, and particularly those of the more easily recognized type ab (i.e., fundamental-mode pulsators), from the 106-107 variables expected in the VVV survey area. In this work we describe a supervised machine-learned classifier constructed for assigning a score to a Ks-band VVV light curve that indicates its likelihood of being ab-type RR Lyrae. We describe the key steps in the construction of the classifier, which were the choice of features, training set, selection of aperture, and family of classifiers. We find that the AdaBoost family of classifiers give consistently the best performance for our problem, and obtain a classifier based on the AdaBoost algorithm that achieves a harmonic mean between false positives and false negatives of ≈7% for typical VVV light-curve sets. This performance is estimated using cross-validation and through the comparison to two independent datasets that were classified by human experts.
AB - Variable stars of RR Lyrae type are a prime tool with which to obtain distances to old stellar populations in the Milky Way. One of the main aims of the Vista Variables in the Via Lactea (VVV) near-infrared survey is to use them to map the structure of the Galactic Bulge. Owing to the large number of expected sources, this requires an automated mechanism for selecting RR Lyrae, and particularly those of the more easily recognized type ab (i.e., fundamental-mode pulsators), from the 106-107 variables expected in the VVV survey area. In this work we describe a supervised machine-learned classifier constructed for assigning a score to a Ks-band VVV light curve that indicates its likelihood of being ab-type RR Lyrae. We describe the key steps in the construction of the classifier, which were the choice of features, training set, selection of aperture, and family of classifiers. We find that the AdaBoost family of classifiers give consistently the best performance for our problem, and obtain a classifier based on the AdaBoost algorithm that achieves a harmonic mean between false positives and false negatives of ≈7% for typical VVV light-curve sets. This performance is estimated using cross-validation and through the comparison to two independent datasets that were classified by human experts.
KW - Methods: data analysis
KW - Methods: statistical
KW - Stars: variables: RR Lyrae
KW - Techniques: photometric
UR - http://www.scopus.com/inward/record.url?scp=84994669203&partnerID=8YFLogxK
U2 - 10.1051/0004-6361/201628700
DO - 10.1051/0004-6361/201628700
M3 - Article
AN - SCOPUS:84994669203
SN - 0004-6361
VL - 595
JO - Astronomy and Astrophysics
JF - Astronomy and Astrophysics
M1 - A82
ER -