CIVILICA We Respect the Science
(ناشر تخصصی کنفرانسهای کشور / شماره مجوز انتشارات از وزارت فرهنگ و ارشاد اسلامی: ۸۹۷۱)

Step by Step from Machine Learning Algorithm to Making QSAR model: Present a Combined Feature Selection with a Case Study

عنوان مقاله: Step by Step from Machine Learning Algorithm to Making QSAR model: Present a Combined Feature Selection with a Case Study
شناسه ملی مقاله: IBIS07_178
منتشر شده در هفتمین همایش بیوانفورماتیک ایران در سال 1396
مشخصات نویسندگان مقاله:

Mazaher Maghsoudloo - University of Tehran, Kish International Campus, Iran
Masoud Arabfard - University of Tehran, Kish International Campus, Iran
Sajjad Gharaghani - Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran
Kaveh kavousi - Institute of Biochemistry and Biophysics (IBB), University of Tehran, Tehran, Iran

خلاصه مقاله:
Due to the large amount of data in various bioinformatics sciences, such as drug design, gene selection, QSAR, etc., the use of machine learning techniques such as feature selection has become an essential requirement for the construction of the model. Feature selection is one of the most important steps in pattern recognition, machine learning and data mining. The purpose of the feature selection is to select the most optimal subset of the feature from the entire space of the main features problem, so that, while reducing the dimensions, the descriptive accuracy of the machine s learning techniques can be achieved.Optimization algorithms, random search, evolution, etc. In selecting features, new and effective methods are used to find optimal solutions for problems. The randomness of these algorithms prevents them from falling into local optimal points. Therefore, the use of feature selection methods in various bioinformatics sciences is inevitable.In this paper, we have used a combination of classic and random methods to select important descriptors in a set of QSAR data in drug design. In this paper, the combination of Grey number theory methods and firefly algorithm has been used to select the feature, so that we first use the Grey number theory algorithm to rank the features belonging to our database.Then, using some of these features that have the highest priority and rank, as well as the random firefly algorithm, we choose the best feature among these priority characteristics. The combination of these methods suggests acceptable results, which can be used for feature selection algorithms in bioinformatics.

کلمات کلیدی:
QSAR, Feature Selection; Grey Number algorithm; Drug Design; Machine Learning

صفحه اختصاصی مقاله و دریافت فایل کامل: https://civilica.com/doc/712573/