A Novel Two-Step Classification Approach for Runtime Performance Improvement of Duplicate Bug Report Detection

سال انتشار: 1402
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 59

فایل این مقاله در 14 صفحه با فرمت PDF قابل دریافت می باشد

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_CKE-6-1_001

تاریخ نمایه سازی: 3 آبان 1402

چکیده مقاله:

Duplicate Bug Report Detection (DBRD) is one of the famous problems in software triage systems like Bugzilla. There are two main approaches to this problem, including information retrieval and machine learning. The second one is more effective for validation performance. Duplicate detection needs feature extraction, which is a time-consuming process. Both approaches suffer runtime issues, because they should check the new bug report to all bug reports in the repository, and it takes a long time for feature extraction and duplicate detection. This study proposes a new two-step classification approach which tries to reduce the search space of the bug repository search space in the first step and then check the duplicate detection using textual features. The Mozilla and Eclipse datasets are used for experimental evaluation. The results show that overall, ۸۷.۷۰% and ۸۹.۰۱% validation performance achieved averagely for accuracy and F۱-measure, respectively. Moreover, ۹۵.۸۵% and ۸۷.۶۵% of bug reports can be classified in step one very fast for Eclipse and Mozilla datasets, respectively, and the other one needs textual feature extraction until it can be checked by the traditional DBRD approach. An average of ۹۰% runtime improvement is achieved using the proposed method.

نویسندگان

Behzad Soleimani Neysiani

Department of Software Engineering, University of Kashan, Kashan, Iran.

Seyed Morteza Babamir

Department of Software Engineering, University of Kashan, Kashan, Iran.

مراجع و منابع این مقاله:

لیست زیر مراجع و منابع استفاده شده در این مقاله را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود مقاله لینک شده اند :
  • Zhang, J., Wang, X., Hao, D., Xie, B., Zhang, L., ...
  • Soleimani, Neysiani, B., and Babamir, S. M., "Methods of Feature ...
  • Runeson, P., Alexandersson, M., and Nyholm, O., "Detection of duplicate ...
  • Sun, C., Lo, D., Khoo, S. -C., and Jiang, J., ...
  • Soleimani Neysiani, B., and Babamir, S. M., "Improving Performance of ...
  • Banerjee, S., Cukic, B., and Adjeroh, D., "Automated duplicate bug ...
  • Soleimani Neysiani, B., and Babamir, S. M., "New Methodology of ...
  • Aggarwal, K., Rutgers, T., Timbers, F., Hindle, A., Greiner, R., ...
  • Aggarwal, K., Timbers, F., Rutgers, T., Hindle, A., Stroulia, E., ...
  • Soleimani Neysiani, B., and Babamir, S. M., "Automatic Typos Detection ...
  • Soleimani Neysiani, B., and Babamir, S. M., "Automatic Interconnected Lexical ...
  • Soleimani Neysiani, B., and Babamir, S. M., "Fast Language-Independent Correction ...
  • Soleimani Neysiani, B., and Babamir, S. M., "New labeled dataset ...
  • Soleimani Neysiani, B., and Babamir, S. M., "Effect of Typos ...
  • Soleimani Neysiani, B., and Babamir, S. M., "Duplicate Detection Models ...
  • Soleimani Neysiani, B., and Babamir, S. M., "Automatic Duplicate Bug ...
  • Hindle, A., "Stopping duplicate bug reports before they start with ...
  • Hindle, A., and Onuczko, C., "Preventing duplicate bug reports by ...
  • A New Feature Selection in Email Spam Detection by Particle Swarm Optimization and Fruit Fly Optimization Algorithms [مقاله ژورنالی]
  • Soleimani Neysiani, B., Doostali, S., Babamir, S. M., and Aminoroaya, ...
  • Banerjee, S., Syed, Z., Helmick, J., Culp, M., Ryan, K., ...
  • Yang, X., Lo, D., Xia, X., Bao, L., and Sun, ...
  • Lin, M.-J., Yang, C.-Z., Lee, C.-Y., and Chen, C.-C., "Enhancements ...
  • Budhiraja, A., Dutta, K., Reddy, R., and Shrivastava, M., "DWEN: ...
  • Lazar, A., Ritchey, S., and Sharif, B., "Improving the accuracy ...
  • Wang, S., Khomh, F., and Zou, Y., "Improving bug localization ...
  • Wang, X., Zhang, L., Xie, T., Anvik, J., and Sun, ...
  • Kim, S., Zimmermann, T., and Nagappan, N., "Crash graphs: An ...
  • Ebrahimi, N., Trabelsi, A., Islam, M. S., Hamou-Lhadj, A., and ...
  • Alipour, A., Hindle, A., and Stroulia, E., "A Contextual Approach ...
  • Nguyen, A. T., Nguyen, T. T., Nguyen, T. N., Lo, ...
  • Bagal, P. V., et al., "Duplicate bug report detection using ...
  • Koochekian Sabor, K., Hamou-Lhadj, A., and Larsson, A., "DURFEX: A ...
  • Deshmukh, J., Podder, S., Sengupta, S., and Dubash, N., "Towards ...
  • Ebrahimi Koopaei, N., "Machine Learning And Deep Learning Based Approaches ...
  • Xie, Q., Wen, Z., Zhu, J., Gao, C., and Zheng, ...
  • Aminoroaya, Z., Soleimani Neysiani, B., and Nadimi Shahraki, M. H., ...
  • Bettenburg, N., Premraj, R., Zimmermann, T., and Kim, S., "Duplicate ...
  • Sun, C., Lo, D., Wang, X., Jiang, J., and Khoo, ...
  • Tian, Y., Sun, C., and Lo, D., "Improved duplicate bug ...
  • Liu, K., Tan, H. B. K., and Chandramohan, M., "Has ...
  • Alipour, A., "A Contextual Approach Towards More Accurate Duplicate Bug ...
  • Feng, L., Song, L., Sha, C., and Gong, X., "Practical ...
  • Tsuruda, A., Manabe, Y., and Aritsugi, M., "Can We Detect ...
  • Sharma, A., and Sharma, S., "Bug Report Triaging Using Textual, ...
  • Hindle, A., Alipour, A., and Stroulia, E., "A contextual approach ...
  • Pasala, A., Guha, S., Agnihotram, G., Prateek B, S., and ...
  • Rakha, M. S., Shang, W., and Hassan, A. E., "Studying ...
  • Su, E., and Joshi, S., "Leveraging product relationships to generate ...
  • Soleimani Neysiani, B., Babamir, S. M., and Aritsugi, M., "Efficient ...
  • Kukkar, A., Mohana, R., Kumar, Y., Nayyar, A., Bilal, M., ...
  • Kim, T., and Yang, G., "Predicting Duplicate in Bug Report ...
  • Zhang, T., et al., "Duplicate Bug Report Detection: How Far ...
  • IngoRM., "Confidence values", RapidMiner. https://community.rapidminer.com/discussion/۱۷۰۵۸/confidence-values, accessed ۱۲/۱۰/۲۰۲۰, ۲۰۲۰ ...
  • Alipour, A., Hindle, A., Rutgers, T., Dawson, R., Timbers, F., ...
  • Šarić, F., Glavaš, G., Karan, M., Šnajder, J., and Bašić, ...
  • RapidMiner Studio (۹.۵.۱) RapidMiner Inc. [Online]. Available: rapidminer.com, , (۲۰۱۹) ...
  • Candel, A., Parmar, V., LeDell, E., and Arora, A., "Deep ...
  • Cook, D., "Practical machine learning with H۲O: powerful, scalable techniques ...
  • SSLBM: A New Fraud Detection Method Based on Semi- Supervised Learning [مقاله ژورنالی]
  • نمایش کامل مراجع