NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Topic Modeling Tool for PeTaL (Periodic Table of Life)A topic modeling tool is constructed for the purpose of providing insights from biology to the engineer within the framework of PeTaL (Periodic Table of Life). The machine learning text mining tools–latent Dirichlet allocation (LDA) and nonnegative matrix factorization (NMF) with Kullback-Leibler (KL) divergence—are used to provide topic clusters to the user. Topic clusters are the underlying themes of a paper. For the text modeling problem, NMF-KL is the equivalent of probabilistic latent semantic analysis. Both LDA and NMF-KL are top-performing modeling tools. These tools are used to identify biological specimens relevant to the user. Various organisms solve a particular survival problem in nature differently. The topic clusters allow people without domain expertise to find these cross-topic themes in the body of documents and then branch out and examine papers whose target organisms solve the engineer’s problem. Abstracts from the Journal of Experimental Biology were used as input for the clustering tool in addition to a curated set of articles for validation. The tool is able to accept alternate input sources.
Document ID
20200004326
Acquisition Source
Glenn Research Center
Document Type
Technical Memorandum (TM)
Authors
Brian A Whiteaker
(University of California, San Diego San Diego, California, United States)
Vikram Shyam
(Glenn Research Center Cleveland, Ohio, United States)
Date Acquired
May 11, 2020
Publication Date
March 1, 2021
Subject Category
Cybernetics, Artificial Intelligence And Robotics
Report/Patent Number
GRC-E-DAA-TN62426
E-19659
NASA/TM-2021-220069
Report Number: GRC-E-DAA-TN62426
Funding Number(s)
WBS: 081876.01.03.01
Distribution Limits
Public
Copyright
Public Use Permitted.
Technical Review
Single Expert
Keywords
Machine learning
LDA
NMF
Topic modeling
Biomimicry
Clustering
No Preview Available