Text Categorization and Machine Learning Methods: Current State Of The Art

Article ID

CSTSDE05N02

Text Categorization and Machine Learning Methods: Current State Of The Art

Durga Bhavani Dasari
Durga Bhavani Dasari Jawaharlal nehru university - Hyderabad
Dr. Venu Gopala Rao. K
Dr. Venu Gopala Rao. K
DOI

Abstract

In this informative age, we find many documents are available in digital forms which need classification of the text. For solving this major problem present researchers focused on machine learning techniques: a general inductive process automatically builds a classifier by learning, from a set of pre classified documents, the characteristics of the categories. The main benefit of the present approach is consisting in the manual definition of a classifier by domain experts where effectiveness, less use of expert work and straightforward portability to different domains are possible. The paper examines the main approaches to text categorization comparing the machine learning paradigm and present state of the art. Various issues pertaining to three different text similarity problems, namely, semantic, conceptual and contextual are also discussed.

Text Categorization and Machine Learning Methods: Current State Of The Art

In this informative age, we find many documents are available in digital forms which need classification of the text. For solving this major problem present researchers focused on machine learning techniques: a general inductive process automatically builds a classifier by learning, from a set of pre classified documents, the characteristics of the categories. The main benefit of the present approach is consisting in the manual definition of a classifier by domain experts where effectiveness, less use of expert work and straightforward portability to different domains are possible. The paper examines the main approaches to text categorization comparing the machine learning paradigm and present state of the art. Various issues pertaining to three different text similarity problems, namely, semantic, conceptual and contextual are also discussed.

Durga Bhavani Dasari
Durga Bhavani Dasari Jawaharlal nehru university – Hyderabad
Dr. Venu Gopala Rao. K
Dr. Venu Gopala Rao. K

No Figures found in article.

Durga Bhavani Dasari. 2012. “. Global Journal of Computer Science and Technology – C: Software & Data Engineering GJCST-C Volume 12 (GJCST Volume 12 Issue C11): .

Download Citation

Journal Specifications

Crossref Journal DOI 10.17406/gjcst

Print ISSN 0975-4350

e-ISSN 0975-4172

Issue Cover
GJCST Volume 12 Issue C11
Pg. 37- 46
Classification
Not Found
Article Matrices
Total Views: 10141
Total Downloads: 2575
2026 Trends
Research Identity (RIN)
Related Research
Our website is actively being updated, and changes may occur frequently. Please clear your browser cache if needed. For feedback or error reporting, please email [email protected]

Request Access

Please fill out the form below to request access to this research paper. Your request will be reviewed by the editorial or author team.
X

Quote and Order Details

Contact Person

Invoice Address

Notes or Comments

This is the heading

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut elit tellus, luctus nec ullamcorper mattis, pulvinar dapibus leo.

High-quality academic research articles on global topics and journals.

Text Categorization and Machine Learning Methods: Current State Of The Art

Durga Bhavani Dasari
Durga Bhavani Dasari Jawaharlal nehru university - Hyderabad
Dr. Venu Gopala Rao. K
Dr. Venu Gopala Rao. K

Research Journals