TY - JOUR
T1 - Automatic vetting of planet candidates from ground based surveys: Machine learning with NGTS
AU - Armstrong, David J.
AU - Günther, Maximilian N.
AU - McCormac, James
AU - Smith, Alexis M. S.
AU - Bayliss, Daniel
AU - Bouchy, François
AU - Burleigh, Matthew R.
AU - Casewell, Sarah
AU - Eigmüller, Philipp
AU - Gillen, Edward
AU - Goad, Michael R.
AU - Hodgkin, Simon T.
AU - Jenkins, James S.
AU - Louden, Tom
AU - Metrailler, Lionel
AU - Pollacco, Don
AU - Poppenhaeger, Katja
AU - Queloz, Didier
AU - Raynard, Liam
AU - Rauer, Heike
AU - Udry, Stéphane
AU - Walker, Simon R.
AU - Watson, Christopher A.
AU - West, Richard G.
AU - Wheatley, Peter J.
PY - 2018/5/18
Y1 - 2018/5/18
N2 - State of the art exoplanet transit surveys are producing ever increasing
quantities of data. To make the best use of this resource, in detecting
interesting planetary systems or in determining accurate planetary
population statistics, requires new automated methods. Here we describe
a machine learning algorithm that forms an integral part of the pipeline
for the NGTS transit survey, demonstrating the efficacy of machine
learning in selecting planetary candidates from multi-night ground based
survey data. Our method uses a combination of random forests and
self-organising-maps to rank planetary candidates, achieving an AUC
score of 97.6% in ranking 12368 injected planets against 27496 false
positives in the NGTS data. We build on past examples by using injected
transit signals to form a training set, a necessary development for
applying similar methods to upcoming surveys. We also make the autovet
code used to implement the algorithm publicly accessible. autovet is
designed to perform machine learned vetting of planetary candidates, and
can utilise a variety of methods. The apparent robustness of machine
learning techniques, whether on space-based or the qualitatively
different ground-based data, highlights their importance to future
surveys such as TESS and PLATO and the need to better understand their
advantages and pitfalls in an exoplanetary context.
AB - State of the art exoplanet transit surveys are producing ever increasing
quantities of data. To make the best use of this resource, in detecting
interesting planetary systems or in determining accurate planetary
population statistics, requires new automated methods. Here we describe
a machine learning algorithm that forms an integral part of the pipeline
for the NGTS transit survey, demonstrating the efficacy of machine
learning in selecting planetary candidates from multi-night ground based
survey data. Our method uses a combination of random forests and
self-organising-maps to rank planetary candidates, achieving an AUC
score of 97.6% in ranking 12368 injected planets against 27496 false
positives in the NGTS data. We build on past examples by using injected
transit signals to form a training set, a necessary development for
applying similar methods to upcoming surveys. We also make the autovet
code used to implement the algorithm publicly accessible. autovet is
designed to perform machine learned vetting of planetary candidates, and
can utilise a variety of methods. The apparent robustness of machine
learning techniques, whether on space-based or the qualitatively
different ground-based data, highlights their importance to future
surveys such as TESS and PLATO and the need to better understand their
advantages and pitfalls in an exoplanetary context.
KW - planets and satellites: detection
KW - planets and satellites: general
KW - methods: data analysis
KW - methods: statistical
U2 - 10.1093/mnras/sty1313
DO - 10.1093/mnras/sty1313
M3 - Article
SN - 0035-8711
JO - Monthly Notices of the Royal Astronomical Society
JF - Monthly Notices of the Royal Astronomical Society
ER -