A Novel Scheme for Improving Accuracy of KNN Classification Algorithm Based on the New Weighting Technique and Stepwise Feature Selection

سال انتشار: 1399
نوع سند: مقاله ژورنالی
زبان: انگلیسی
مشاهده: 139

فایل این مقاله در 15 صفحه با فرمت PDF قابل دریافت می باشد

این مقاله در بخشهای موضوعی زیر دسته بندی شده است:

استخراج به نرم افزارهای پژوهشی:

لینک ثابت به این مقاله:

شناسه ملی سند علمی:

JR_JITM-12-4_004

تاریخ نمایه سازی: 25 بهمن 1400

چکیده مقاله:

K nearest neighbor algorithm is one of the most frequently used techniques in data mining for its integrity and performance. Though the KNN algorithm is highly effective in many cases, it has some essential deficiencies, which affects the classification accuracy of the algorithm. First, the effectiveness of the algorithm is affected by redundant and irrelevant features. Furthermore, this algorithm does not consider the differences between samples, which led the algorithm to have inaccurate predictions. In this paper, we proposed a novel scheme for improving the accuracy of the KNN classification algorithm based on the new weighting technique and stepwise feature selection. First, we used a stepwise feature selection method to eliminate irrelevant features and select highly correlated features with the class category. Then a new weighting method was proposed to give authority value to each sample in train dataset based on neighbor categories and Euclidean distances. This weighting approach gives a higher preference to samples that have neighbors with close Euclidean distance while they are in the same category, which can effectively increase the classification accuracy of the algorithm. We evaluated the accuracy rate of the proposed method and analyzed it with the traditional KNN algorithm and some similar works with the use of five real-world UCI datasets. The experiment results determined that the proposed scheme (denoted by WAD-KNN) performed better than the traditional KNN algorithm and considered approaches with the improvement of approximately ۱۰% accuracy.

نویسندگان

Sheikhi

MSc, Department of Computer, Gorgan Branch, Islamic Azad University, Gorgan, Iran.

Kheirabadi

Assistant Prof., Department of Computer, Gorgan Branch, Islamic Azad University, Gorgan, Iran.

Bazzazi

Assistant Prof., Department of Computer, Gorgan Branch, Islamic Azad University, Gorgan, Iran.

مراجع و منابع این مقاله:

لیست زیر مراجع و منابع استفاده شده در این مقاله را نمایش می دهد. این مراجع به صورت کاملا ماشینی و بر اساس هوش مصنوعی استخراج شده اند و لذا ممکن است دارای اشکالاتی باشند که به مرور زمان دقت استخراج این محتوا افزایش می یابد. مراجعی که مقالات مربوط به آنها در سیویلیکا نمایه شده و پیدا شده اند، به خود مقاله لینک شده اند :
  • A Grouping Hotel Recommender System Based on Deep Learning and Sentiment Analysis [مقاله ژورنالی]
  • Alpaydin, E. (۱۹۹۷). Voting over multiple condensed nearest neighbors. In ...
  • Angiulli, F. (۲۰۰۵, August). Fast condensed nearest neighbor rule. In ...
  • Bagui, S. C., Bagui, S., Pal, K., & Pal, N. ...
  • Bailey, T. (۱۹۷۸). A note on distance-weighted k-nearest neighbor rules. ...
  • Biswas, N., Chakraborty, S., Mullick, S. S., & Das, S. ...
  • Cheng, Y., Chen, K., Sun, H., Zhang, Y., & Tao, ...
  • Dua, D., & Graff, C. (۲۰۱۷). UCI machine learning repository ...
  • Gates, G. (۱۹۷۲). The reduced nearest neighbor rule (Corresp.). IEEE ...
  • Gowda, K., & Krishna, G. (۱۹۷۹). The condensed nearest neighbor ...
  • Guo, G., Wang, H., Bell, D., Bi, Y., & Greer, ...
  • Kafaf, D. A., Kim, D. K., & Lu, L. (۲۰۱۷). ...
  • Kotenko, I., Saenko, I., &Branitskiy, A. (۲۰۱۸). Framework for Mobile ...
  • Kumar, M., Rath, N. K., &Rath, S. K. (۲۰۱۶). Analysis ...
  • Lin, W. C., Ke, S. W., & Tsai, C. F. ...
  • Pan, Z., Wang, Y., & Ku, W. (۲۰۱۷). A new ...
  • Parvin, H., Alizadeh, H., &Minati, B. (۲۰۱۰). A modification on ...
  • Investigating the Role of Code Smells in Preventive Maintenance [مقاله ژورنالی]
  • Serpen, G., &Aghaei, E. (۲۰۱۸). Host-based misuse intrusion detection using ...
  • Sun, C., Yao, C., Shen, L., & Yu, X. (۲۰۱۶). ...
  • Wu, X., Yang, J., & Wang, S. (۲۰۱۸). Tea category ...
  • Zeng, Y., Yang, Y., & Zhao, L. (۲۰۰۹). Pseudo nearest ...
  • Zhao, M., & Chen, J. (۲۰۱۶). Improvement and comparison of ...
  • نمایش کامل مراجع