Discriminative Gene Selection Employing Linear Regression Model

Article ID

CSTSDE4DFZY

Discriminative Gene Selection Employing Linear Regression Model

Abid Hasan
Abid Hasan Islamic University of Technology (IUT), Dhaka, Bangladesh
Shaikh Jeeshan Kabeer
Shaikh Jeeshan Kabeer
Kamrul Hasan
Kamrul Hasan
Md. Abdul Mottalib
Md. Abdul Mottalib
DOI

Abstract

Microarray datasets enables the analysis of expression of thousands of genes across hundreds of samples. Usually classifiers do not perform well for large number of features (genes) as is the case of microarray datasets. That is why a small number of informative and discriminative features are always desirable for efficient classification. Many existing feature selection approaches have been proposed which attempts sample classification based on the analysis of gene expression values. In this paper a linear regression based feature selection algorithm for two class microarray datasets has been developed which divides the training dataset into two subtypes based on the class information. Using one of the classes as the base condition, a linear regression based model is developed. Using this regression model the divergence of each gene across the two classes are calculated and thus genes with higher divergence values are selected as important features from the second subtype of the training data. The classification performance of the proposed approach is evaluated with SVM, Random Forest and AdaBoost classifiers. Results show that the proposed approach provides better accuracy values compared to other existing approaches i.e. ReliefF, CFS, decision tree based attribute selector and attribute selection using correlation analysis.

Discriminative Gene Selection Employing Linear Regression Model

Microarray datasets enables the analysis of expression of thousands of genes across hundreds of samples. Usually classifiers do not perform well for large number of features (genes) as is the case of microarray datasets. That is why a small number of informative and discriminative features are always desirable for efficient classification. Many existing feature selection approaches have been proposed which attempts sample classification based on the analysis of gene expression values. In this paper a linear regression based feature selection algorithm for two class microarray datasets has been developed which divides the training dataset into two subtypes based on the class information. Using one of the classes as the base condition, a linear regression based model is developed. Using this regression model the divergence of each gene across the two classes are calculated and thus genes with higher divergence values are selected as important features from the second subtype of the training data. The classification performance of the proposed approach is evaluated with SVM, Random Forest and AdaBoost classifiers. Results show that the proposed approach provides better accuracy values compared to other existing approaches i.e. ReliefF, CFS, decision tree based attribute selector and attribute selection using correlation analysis.

Abid Hasan
Abid Hasan Islamic University of Technology (IUT), Dhaka, Bangladesh
Shaikh Jeeshan Kabeer
Shaikh Jeeshan Kabeer
Kamrul Hasan
Kamrul Hasan
Md. Abdul Mottalib
Md. Abdul Mottalib

No Figures found in article.

Abid Hasan. 2013. “. Global Journal of Computer Science and Technology – C: Software & Data Engineering GJCST-C Volume 13 (GJCST Volume 13 Issue C4): .

Download Citation

Journal Specifications

Crossref Journal DOI 10.17406/gjcst

Print ISSN 0975-4350

e-ISSN 0975-4172

Classification
Not Found
Article Matrices
Total Views: 9642
Total Downloads: 2434
2026 Trends
Research Identity (RIN)
Related Research
Our website is actively being updated, and changes may occur frequently. Please clear your browser cache if needed. For feedback or error reporting, please email [email protected]

Request Access

Please fill out the form below to request access to this research paper. Your request will be reviewed by the editorial or author team.
X

Quote and Order Details

Contact Person

Invoice Address

Notes or Comments

This is the heading

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut elit tellus, luctus nec ullamcorper mattis, pulvinar dapibus leo.

High-quality academic research articles on global topics and journals.

Discriminative Gene Selection Employing Linear Regression Model

Abid Hasan
Abid Hasan Islamic University of Technology (IUT), Dhaka, Bangladesh
Shaikh Jeeshan Kabeer
Shaikh Jeeshan Kabeer
Kamrul Hasan
Kamrul Hasan
Md. Abdul Mottalib
Md. Abdul Mottalib

Research Journals