(Publisher of Peer Reviewed Open Access Journals)

International Journal of Advanced Computer Research (IJACR)

ISSN (Print):2249-7277    ISSN (Online):2277-7970
Volume-11 Issue-56 September-2021
Full-Text PDF
Paper Title : Classifying bug reports to bugs and other requests: an approach using topic modelling and fuzzy set theory
Author Name : Mohammed D. Ali and Ahmed A. Abusnaina
Abstract :

Stakeholders of a software system usually deliver bug reports to a software bug tracking system to report problems they encounter during the use of that system. After that, those incoming requests are assigned to the proper technicians to be analysed and fixed. However, issue reporters frequently misclassify bug requests as non-bug and vice versa. This problem is called bug report misclassification. In fact, it is very costly in terms of time and effort as developers have to manually reclassify those requests to then be able to take the appropriate action. Accordingly, the automatic classification of incoming new reports would be of a valuable demanded feature in bug tracking systems. Careful analysis for literature related to this problem was carried out. It has been found that a hybrid approach combining between topic modelling as a feature extraction technique and the fuzzy logic as a classification technique is promising but rarely utilized approach. In this work, a combined approach of topic modelling and fuzzy logic was introduced to classify bug reports to bugs and non-bugs. The proposed approach was validated using three open-source projects. Finally, the conducted experiments have shown that proposed classifier achieves significant and competitive predictive accuracy.

Keywords : Software maintenance, Bug reports, Fuzzy sets, Topic modelling, Classification, Bug tracking systems.
Cite this article : Ali MD, Abusnaina AA. Classifying bug reports to bugs and other requests: an approach using topic modelling and fuzzy set theory. International Journal of Advanced Computer Research. 2021; 11(56):103-115. DOI:10.19101/IJACR.2021.1152031.
References :
[1]Pandey N, Sanyal DK, Hudait A, Sen A. Automated classification of software issue reports using machine learning techniques: an empirical study. Innovations in Systems and Software Engineering. 2017; 13(4):279-97.
[Crossref] [Google Scholar]
[2]Xia X, Lo D, Ding Y, Al-Kofahi JM, Nguyen TN, Wang X. Improving automated bug triaging with specialized topic model. IEEE Transactions on Software Engineering. 2016; 43(3):272-97.
[Crossref] [Google Scholar]
[3]Pingclasai N, Hata H, Matsumoto KI. Classifying bug reports to bugs and other requests using topic modeling. In Asia-pacific software engineering conference 2013(pp. 13-8). IEEE.
[Crossref] [Google Scholar]
[4]Otoom AF, Al-jdaeh S, Hammad M. Automated classification of software bug reports. In proceedings of the 9th international conference on information communication and management 2019 (pp. 17-21).
[Crossref] [Google Scholar]
[5]Chawla I, Singh SK. An automated approach for bug categorization using fuzzy logic. In proceedings of the 8th india software engineering conference 2015 (pp. 90-9).
[Crossref] [Google Scholar]
[6]Zhou Y, Tong Y, Gu R, Gall H. Combining text mining and data mining for bug report classification. Journal of Software: Evolution and Process. 2016; 28(3):150-76.
[Crossref] [Google Scholar]
[7]Chawla I, Singh SK. Automated labeling of issue reports using semi supervised approach. Journal of Computational Methods in Sciences and Engineering. 2018; 18(1):177-91.
[Crossref] [Google Scholar]
[8]Jangra M, Singh SK. Evaluating topic modeling as pre-processing for component prediction in bug reports. In advanced computing and communication technologies 2016 (pp. 465-73). Springer, Singapore.
[Crossref] [Google Scholar]
[9]Terdchanakul P, Hata H, Phannachitta P, Matsumoto K. Bug or not? bug report classification using n-gram idf. In international conference on software maintenance and evolution 2017 (pp. 534-8). IEEE.
[Crossref] [Google Scholar]
[10]Qin H, Sun X. Classifying bug reports into bugs and non-bugs using lstm. In proceedings of the tenth Asia-Pacific symposium on internetware 2018 (pp. 1-4).
[Crossref] [Google Scholar]
[11]Kukkar A, Mohana R. A supervised bug report classification with incorporate and textual field knowledge. Procedia Computer Science. 2018; 132:352-61.
[Crossref] [Google Scholar]
[12]Herzig K, Just S, Zeller A. Its not a bug, its a feature: how misclassification impacts bug prediction. In 2013 international conference on software engineering 2013(pp. 392-401). IEEE.
[Crossref] [Google Scholar]
[13]Hammad M, Alzyoudi R, Otoom AF. Automatic clustering of bug reports. International Journal of Advanced Computer Research. 2018; 8(39):313-23.
[Crossref] [Google Scholar]
[14]Behl D, Handa S, Arora A. A bug mining tool to identify and analyze security bugs using naive bayes and tf-idf. In international conference on reliability optimization and information technology 2014 (pp. 294-9). IEEE.
[Crossref] [Google Scholar]
[15]Antoniol G, Ayari K, Di Penta M, Khomh F, Guéhéneuc YG. Is it a bug or an enhancement? a text-based approach to classify change requests. In proceedings of the conference of the center for advanced studies on collaborative research: meeting of minds 2008 (pp. 304-18).
[Crossref] [Google Scholar]
[16]Chen TH, Thomas SW, Hassan AE. A survey on the use of topic models when mining software repositories. Empirical Software Engineering. 2016; 21(5):1843-919.
[Crossref] [Google Scholar]
[17]Zhang T, Chen J, Yang G, Lee B, Luo X. Towards more accurate severity prediction and fixer recommendation of software bugs. Journal of Systems and Software. 2016; 17:166-84.
[Crossref] [Google Scholar]
[18]Zolkeply MS, Shao J. Classifying software issue reports through association mining. In proceedings of the 34th ACM/SIGAPP symposium on applied computing 2019 (pp. 1860-3).
[Crossref] [Google Scholar]
[19]Panda RR, Nagwani NK. Software bug categorization technique based on fuzzy similarity. In 9th international conference on advanced computing 2019 (pp. 1-6). IEEE.
[Crossref] [Google Scholar]
[20]http://nlp.stanford.edu/software/tmt/tmt- 0.4/. Accessed: 10-September-2021.
[21]Zadeh LA. Fuzzy sets. In fuzzy sets, fuzzy logic, and fuzzy systems: selected papers by Lotfi A Zadeh 1996 (pp. 394-432).
[Crossref] [Google Scholar]
[22]http://jackrabbit.apache.org/. Accessed: 10-September-2021.
[23]http://lucene.apache.org/. Accessed: 10-September-2021.
[24]http://hc.apache.org/. Accessed: 10-September-2021.
[25]https://www.st.cs.uni-saarland.de//softevo/. Accessed: 10-September-2021.
[26]Brownlee J. Machine learning mastery with python. Machine Learning Mastery Pty Ltd. 2016; 527:100-20.
[Google Scholar]
[27]Han J, Pei J, Kamber M. Data mining: concepts and techniques. Elsevier; 2011.
[Google Scholar]
[28]Brownlee J. Imbalanced classification with Python: better metrics, balance skewed classes, cost-sensitive learning. Machine Learning Mastery; 2020.
[Google Scholar]