Exoplanet Detection Using Kepler Mission Data with Machine Learning
Keywords:
Exoplanet Detection, Supervised Learning, Kepler Mission, Machine Learning, Astronomical Data AnalysisAbstract
The search for habitable planets beyond our solar system has long captivated the scientific community and remains one of the foremost pursuits in modern astronomy. With the advent of space-based missions, such as NASA’s Kepler telescope, our observational capabilities have expanded significantly, resulting in vast volumes of high-quality astronomical data. This data deluge necessitates the development of scalable, automated methods to support astronomers in efficiently analyzing and interpreting these observations. In recent years, machine learning has emerged as a powerful paradigm for automating complex, human-intensive tasks. This study investigates the application of supervised machine learning techniques to the detection of exoplanets using data from NASA’s Kepler mission. The data set comprises Kepler Objects of Interest (KOIs), including both physical and orbital parameters, along with their confirmed classification. We evaluate a range of supervised classifiers, spanning probabilistic, decision tree-based, and neural network models. Our best-performing model, Histogram Gradient Boosting, achieves a precision of 94.6% and a recall of 94.1% on a held-out test set. These results underscore the promise of machine learning in advancing exoplanet detection and offer a pathway toward automating the discovery of planetary systems beyond our own.
References
[1] P. Brennan, “Why Do Scientists Search for Exoplanets? Here Are 7 Reasons,” NASA Exoplanet Exploration Program, 2019. [Online]. Available: https://exoplanets.nasa.gov/news/1610/why-do-scientists-search-forexoplanets-here-are-7-reasons/
[2] A. Malik, B. P. Moster, and C. Obermeier, “Exoplanet detection using machine learning,” Mon. Not. R. Astron. Soc., vol. 513, no. 4, pp. 5505–5516, Jul. 2022, doi: 10.1093/mnras/stab3692.
[3] G. C. Sturrock, B. Manry, and S. Rafiqi, “Machine Learning Pipeline for Exoplanet Classification,” SMU Data Sci. Rev., vol. 2, no. 1, Art. no. 9, 2019. [Online]. Available: https://scholar.smu.edu/datasciencereview/vol2/iss1/9
[4] Y. Jin, L. Yang, and C.-E. Chiang, “Identifying exoplanets with machine learning methods: a preliminary study,” arXiv preprint arXiv:2204.00721, 2022.
[5] “Kepler/K2 Mission Overview,” NASA Astrobiology, [online]. Available: https://astrobiology.nasa.gov/missions/kepler/
[6] “NASA Exoplanet Archive Documentation,” NASA Exoplanet Science Institute, [online]. Available: https://exoplanetarchive.ipac.caltech.edu/docs/data.html
[7] C. R. Harris, K. J. Millman, S. J. van der Walt, R. Gommers, P. Virtanen, D. Cournapeau, E. Wieser, J. Taylor, S. Berg, N. J. Smith, R. Kern, M. Picus, S. Hoyer, M. H. van Kerkwijk, M. Brett, A. Haldane, J. F. del Río, M. Wiebe, P. Peterson, P. Gérard-Marchant, K. Sheppard, T. Reddy, W. Weckesser, H. Abbasi, C. Gohlke, and T. E. Oliphant, “Array programming with NumPy,” Nature, vol. 585, no. 7825, pp. 357–362, Sep. 2020.
[8] F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion, O. Grisel, M. Blondel, P. Prettenhofer, R. Weiss, V. Dubourg, J. Vanderplas, A. Passos, D. Cournapeau, M. Brucher, M. Perrot, and E. Duchesnay, “Scikit-learn: Machine Learning in Python,” J. Mach. Learn. Res., vol. 12, pp. 2825–2830, 2011.
[9] A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. Kaiser, and I. Polosukhin, “Attention is All You Need,” in Adv. Neural Inf. Process. Syst., vol. 30, pp. 5998–6008, 2017.
Downloads
Published
Issue
Section
License
Copyright (c) 2025 American Scientific Research Journal for Engineering, Technology, and Sciences

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Authors who submit papers with this journal agree to the following terms.