NASA Logo

NTRS

NTRS - NASA Technical Reports Server

Back to Results
Identification of Security related Bug Reports via Text Mining using Supervised and Unsupervised ClassificationThis paper is focused on automated classification of software bug reports to security and non-security related, using both supervised and unsupervised approaches. For both approaches, three types of feature vectors are used. For supervised learning, we experiment with multiple learning algorithms and training sets with different sizes. Furthermore, we propose a novel unsupervised approach based on anomaly detection. The evaluated is based on three NASA datasets. The results show that supervised classification is affected more by the learning algorithms than by feature vectors and using only 25% of the data for training provides as good results as if 90% of data are used for training. Both supervised and unsupervised learning can be used for identification of security bug reports; the former slightly outperforms the latter at the expense of labeling the testing set. In general, the performance differs across datasets, mainly due to the different amounts of security related information.
Document ID
20180004739
Acquisition Source
Goddard Space Flight Center
Document Type
Conference Paper
Authors
Goseva-Popstojanova, Katerina
(West Virginia Univ. Morgantown, WV, United States)
Tyo, Jacob
(West Virginia Univ. Morgantown, WV, United States)
Date Acquired
August 28, 2018
Publication Date
July 17, 2018
Subject Category
Computer Programming And Software
Report/Patent Number
GSFC-E-DAA-TN53739
Meeting Information
Meeting: IEEE International Conference on Software Quality, Reliability and Security
Location: Lisbon
Country: Portugal
Start Date: July 16, 2018
End Date: July 20, 2018
Sponsors: Institute of Electrical and Electronics Engineers
Funding Number(s)
CONTRACT_GRANT: NSF-CNS-1618629
CONTRACT_GRANT: NNG12SA03C
Distribution Limits
Public
Copyright
Use by or on behalf of the US Gov. Permitted.
No Preview Available