Character Segmentation for Telugu Image Document using Multiple Histogram Projections

1
Prof E.Sreenivasa Reddy
Prof E.Sreenivasa Reddy
2
N.Anupama
N.Anupama
3
Ch.Rupa
Ch.Rupa Ph.D.
1 Acharya Nagarjuna University,Guntur, AP, India.
3 Department of Computer Science and Systems Engineering Andhra University

Send Message

To: Author

GJCST Volume 13 Issue F5

Article Fingerprint

ReserarchID

CSTGV9KWE1

Character Segmentation for Telugu Image Document using Multiple Histogram Projections Banner
  • English
  • Afrikaans
  • Albanian
  • Amharic
  • Arabic
  • Armenian
  • Azerbaijani
  • Basque
  • Belarusian
  • Bengali
  • Bosnian
  • Bulgarian
  • Catalan
  • Cebuano
  • Chichewa
  • Chinese (Simplified)
  • Chinese (Traditional)
  • Corsican
  • Croatian
  • Czech
  • Danish
  • Dutch
  • Esperanto
  • Estonian
  • Filipino
  • Finnish
  • French
  • Frisian
  • Galician
  • Georgian
  • German
  • Greek
  • Gujarati
  • Haitian Creole
  • Hausa
  • Hawaiian
  • Hebrew
  • Hindi
  • Hmong
  • Hungarian
  • Icelandic
  • Igbo
  • Indonesian
  • Irish
  • Italian
  • Japanese
  • Javanese
  • Kannada
  • Kazakh
  • Khmer
  • Korean
  • Kurdish (Kurmanji)
  • Kyrgyz
  • Lao
  • Latin
  • Latvian
  • Lithuanian
  • Luxembourgish
  • Macedonian
  • Malagasy
  • Malay
  • Malayalam
  • Maltese
  • Maori
  • Marathi
  • Mongolian
  • Myanmar (Burmese)
  • Nepali
  • Norwegian
  • Pashto
  • Persian
  • Polish
  • Portuguese
  • Punjabi
  • Romanian
  • Russian
  • Samoan
  • Scots Gaelic
  • Serbian
  • Sesotho
  • Shona
  • Sindhi
  • Sinhala
  • Slovak
  • Slovenian
  • Somali
  • Spanish
  • Sundanese
  • Swahili
  • Swedish
  • Tajik
  • Tamil
  • Telugu
  • Thai
  • Turkish
  • Ukrainian
  • Urdu
  • Uzbek
  • Vietnamese
  • Welsh
  • Xhosa
  • Yiddish
  • Yoruba
  • Zulu

TEXT line segmentation is one of the major component of document image analysis. Text line segmentation is necessary to detect all text regions in the document image. In this paper we propose an algorithm based on multiple histogram projections using morphological operators to extract features of the image. Horizontal projection is performed on the text image, and then line segments are identified by the peaks in the horizontal projection. Threshold is applied to divide the text image into segments. False lines are eliminated using another threshold. Vertical histogram projections are used for the line segments and decomposed into words using threshold and further decomposed to characters. This approach provides best performance based on the experimental results such as Detection rate DR (98%) and Recognition Accuracy RA (98%).

16 Cites in Articles

References

  1. C Lakshmi,C Patvardhan (2004). An optical character recognition system for printed Telugu text.
  2. Mudit Agrawal,David Doermann (2009). Voronoi++: A Dynamic Page Segmentation Approach Based on Voronoi and Docstrum Features.
  3. K Sesh Kumar,Anoop Namboodiri,C Jawahar (2006). Learning Segmentation of Documents with Complex Scripts.
  4. B Sagar,G Shobha,P Kumar (2008). Character segmentation algorithms for Kannada optical character recognition.
  5. U Pal,B Chaudhuri (2004). Indian script character recognition: a survey.
  6. B Chaudhuri,U Pal (1998). A complete printed Bangla OCR system.
  7. Vijay Kumar,Pankaj Sengar (2010). Segmentation of Printed Text in Devanagari Script and Gurmukhi Script.
  8. U Pal,Sagarika Datta (2003). Segmentation of Bangla unconstrained handwritten text.
  9. K Wong,R Casey,F Wahl (1982). Document Analysis System.
  10. L Likforman-Sulem,A Zahour,B Taconet (2007). Text line Segmentation of Historical Documents: a Survey.
  11. U Pal,P Roy (2004). Multi-oriented and curved text lines extraction from Indian documents.
  12. U Pal,B Chaudhuri (2004). Indian script character recognition: a survey.
  13. B Anuradhaand,Arun Agarwal,C Rao (2008). An Overview of OCR Research in Indian Scripts.
  14. U Pal,B Chaudhuri (2004). Indian script character recognition: a survey.
  15. N Otsu (1979). A threshold selection method from gray-level histograms.
  16. I Phillips,A Chhabra (1999). Empirical performance evaluation of graphics recognition systems.

Funding

No external funding was declared for this work.

Conflict of Interest

The authors declare no conflict of interest.

Ethical Approval

No ethics committee approval was required for this article type.

Data Availability

Not applicable for this article.

Prof E.Sreenivasa Reddy. 2013. \u201cCharacter Segmentation for Telugu Image Document using Multiple Histogram Projections\u201d. Global Journal of Computer Science and Technology - F: Graphics & Vision GJCST-F Volume 13 (GJCST Volume 13 Issue F5): .

Download Citation

Issue Cover
GJCST Volume 13 Issue F5
Pg. 11- 15
Journal Specifications

Crossref Journal DOI 10.17406/gjcst

Print ISSN 0975-4350

e-ISSN 0975-4172

Classification
Not Found
Version of record

v1.2

Issue date

June 29, 2013

Language

English

Experiance in AR

The methods for personal identification and authentication are no exception.

Read in 3D

The methods for personal identification and authentication are no exception.

Article Matrices
Total Views: 9500
Total Downloads: 2511
2026 Trends
Research Identity (RIN)
Related Research

Published Article

TEXT line segmentation is one of the major component of document image analysis. Text line segmentation is necessary to detect all text regions in the document image. In this paper we propose an algorithm based on multiple histogram projections using morphological operators to extract features of the image. Horizontal projection is performed on the text image, and then line segments are identified by the peaks in the horizontal projection. Threshold is applied to divide the text image into segments. False lines are eliminated using another threshold. Vertical histogram projections are used for the line segments and decomposed into words using threshold and further decomposed to characters. This approach provides best performance based on the experimental results such as Detection rate DR (98%) and Recognition Accuracy RA (98%).

Our website is actively being updated, and changes may occur frequently. Please clear your browser cache if needed. For feedback or error reporting, please email [email protected]
×

This Page is Under Development

We are currently updating this article page for a better experience.

Request Access

Please fill out the form below to request access to this research paper. Your request will be reviewed by the editorial or author team.
X

Quote and Order Details

Contact Person

Invoice Address

Notes or Comments

This is the heading

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut elit tellus, luctus nec ullamcorper mattis, pulvinar dapibus leo.

High-quality academic research articles on global topics and journals.

Character Segmentation for Telugu Image Document using Multiple Histogram Projections

N.Anupama
N.Anupama
Ch.Rupa
Ch.Rupa Department of Computer Science and Systems Engineering Andhra University
Prof E.Sreenivasa Reddy
Prof E.Sreenivasa Reddy Acharya Nagarjuna University,Guntur, AP, India.

Research Journals