NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
MLtool: Universal Supervised Machine Learning Tool to Model Tabulated DataMachine Learning (ML) is a subfield of Artificial Intelligence that gives computers the ability to learn from past data without being explicitly programmed. The predictive capabilities of ML models have already been used to facilitate several scientific breakthroughs. However, the practical application of ML is often limited due to the gaps in technical knowledge of its users. The common issue faced by many scientific researchers is the inability to choose the appropriate ML pipelines that are needed to treat real-world data, which is often sparse and noisy. To solve this problem, we have developed an automated Machine Learning tool (MLtool) that includes a set of ML algorithms and approaches to aid scientific researchers. The current version of MLtool is implemented as an object-oriented Python code that is easily extensible. It includes 44 different regression algorithms used to model data. MLtool helps users select the best model for their data, based on the scoring metrics used. Besides regression algorithms, MLtool also includes a suite of pre- and post-processing techniques such as missing value imputation, categorical variable encoding, input feature normalization, uncertainty quantification, exploratory data analysis (EDA), etc. MLtool was tested on several publicly available multi-dimensional data sets and was found capable of making accurate predictions.
Document ID
20220003102
Acquisition Source
Ames Research Center
Document Type
Technical Publication (TP)
Authors
Nishan Senanayake ORCID
(Case Western Reserve University Cleveland, Ohio, United States)
Joshua Stuckner
(Glenn Research Center Cleveland, Ohio, United States)
Shreyas J Honrao ORCID
(KBR (United States) Houston, Texas, United States)
Stephen Raymond Xie
(KBR (United States) Houston, Texas, United States)
Bethany Wu
(Universities Space Research Association Columbia, Maryland, United States)
Nikolai A Zarkevich ORCID
(Ames Research Center Mountain View, California, United States)
Date Acquired
February 24, 2022
Publication Date
March 1, 2022
Subject Category
Cybernetics, Artificial Intelligence and Robotics
Report/Patent Number
NASA/TP-20220003102
Funding Number(s)
CONTRACT_GRANT: SAA3-1681
CONTRACT_GRANT: 80ARC020D0010
Distribution Limits
Public
Copyright
Portions of document may include copyright protected material.
Keywords
Machine learning
Data science
No Preview Available