NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Natural Language Processing Techniques for Intelligent Knowledge Management of Safety ReportsSafety, failure, and incident reports are common artifacts across various domains, including aviation and wildfire response. These reports are often mandatory to submit, resulting in the culmination of large repositories of text-based documents. Simultaneously, these reports and corresponding repositories are often only manually analyzed and queried by users via out-of-date search engines. As a consequence, we have been developing the Manager for Intelligent Knowledge Access (MIKA) toolkit, which uses natural language processing to improve information access and reuse. In this presentation, we discuss natural language processing techniques for knowledge discovery and apply these methods to a repository of aerial wildfire mishap reports. Two methods are used for knowledge discovery: topic modeling and named-entity recognition. We use topic modeling to identify hazards and perform a trend analysis to produce a data-driven risk matrix. A custom named-entity recognition model, build from fine tuning a pre-trained language model, is used to identify failure modes, failure causes, failure effects, control processes, and recommendations to aid in failure modes and effects analysis (FMEA). Throughout the presentation, we discuss and apply natural language processing techniques to better leverage the vast amount of information contained in report repositories.
Document ID
20220016425
Acquisition Source
Ames Research Center
Document Type
Presentation
Authors
Sequoia Andrade ORCID
(Wyle (United States) El Segundo, California, United States)
Hannah Walsh
(Ames Research Center Mountain View, California, United States)
Date Acquired
October 31, 2022
Publication Date
November 15, 2022
Subject Category
Documentation And Information Science
Meeting Information
Meeting: NASA Data Science Summit
Location: Hampton, VA
Country: US
Start Date: November 15, 2022
End Date: November 17, 2022
Sponsors: National Aeronautics and Space Administration
Funding Number(s)
CONTRACT_GRANT: 80ARC020D0010
Distribution Limits
Public
Copyright
Public Use Permitted.
Technical Review
NASA Peer Committee
Keywords
Machine learning
Information extraction
Natural language processing
Safety
No Preview Available