A Frame Work for Text Mining using Learned Information Extraction System

α
Sathish Kuppani
Sathish Kuppani
σ
M.Vasavi
M.Vasavi
α Sri Venkateswara University Sri Venkateswara University

Send Message

To: Author

A Frame Work for Text Mining using Learned Information Extraction System

Article Fingerprint

ReserarchID

CSTSDE4CLAT

A Frame Work for Text Mining using Learned Information Extraction System Banner

AI TAKEAWAY

Connecting with the Eternal Ground
  • English
  • Afrikaans
  • Albanian
  • Amharic
  • Arabic
  • Armenian
  • Azerbaijani
  • Basque
  • Belarusian
  • Bengali
  • Bosnian
  • Bulgarian
  • Catalan
  • Cebuano
  • Chichewa
  • Chinese (Simplified)
  • Chinese (Traditional)
  • Corsican
  • Croatian
  • Czech
  • Danish
  • Dutch
  • Esperanto
  • Estonian
  • Filipino
  • Finnish
  • French
  • Frisian
  • Galician
  • Georgian
  • German
  • Greek
  • Gujarati
  • Haitian Creole
  • Hausa
  • Hawaiian
  • Hebrew
  • Hindi
  • Hmong
  • Hungarian
  • Icelandic
  • Igbo
  • Indonesian
  • Irish
  • Italian
  • Japanese
  • Javanese
  • Kannada
  • Kazakh
  • Khmer
  • Korean
  • Kurdish (Kurmanji)
  • Kyrgyz
  • Lao
  • Latin
  • Latvian
  • Lithuanian
  • Luxembourgish
  • Macedonian
  • Malagasy
  • Malay
  • Malayalam
  • Maltese
  • Maori
  • Marathi
  • Mongolian
  • Myanmar (Burmese)
  • Nepali
  • Norwegian
  • Pashto
  • Persian
  • Polish
  • Portuguese
  • Punjabi
  • Romanian
  • Russian
  • Samoan
  • Scots Gaelic
  • Serbian
  • Sesotho
  • Shona
  • Sindhi
  • Sinhala
  • Slovak
  • Slovenian
  • Somali
  • Spanish
  • Sundanese
  • Swahili
  • Swedish
  • Tajik
  • Tamil
  • Telugu
  • Thai
  • Turkish
  • Ukrainian
  • Urdu
  • Uzbek
  • Vietnamese
  • Welsh
  • Xhosa
  • Yiddish
  • Yoruba
  • Zulu

Abstract

Text mining is a very exciting research area as it tries to discover knowledge from unstructured texts. These texts can be found on a computer desktop, intranets and the internet. The aim of this paper is to give an overview of text mining in the contexts of its techniques, application domains and the most challenging issue. The Learned Information Extraction (LIE) is about locating specific items in natural-language documents. This paper presents a framework for text mining, called DTEX (Discovery Text Extraction), using a learned information extraction system to transform text into more structured data which is then mined for interesting relationships. The initial version of DTEX integrates an LIE module acquired by an LIE learning system, and a standard rule induction module. In addition, rules mined from a database extracted from a corpus of texts are used to predict additional information to extract from future documents, thereby improving the recall of the underlying extraction system. Applying these techniques best results are presented to a corpus of computer job announcement postings from an Internet newsgroup.

References

37 Cites in Article
  1. R Agrawal,R Srikant (1994). Fast algorithms for mining association rules.
  2. R Baeza-Yates,B Ribeiro-Neto (1999). Modern Information RetrLIEval.
  3. S Basu,R Mooney,K Pasupuleti,J Ghosh (2001). Evaluating the novelty of text-mined rules using lexical knowledge.
  4. M Berry (2003). Third IEEE International Conference on Data Mining.
  5. M Califf (1999). Papers from the Sixteenth National Conference on Artificial Intelligence (AAAI-99) Workshop on Machine Learning for Information Extraction.
  6. M Califf,R Mooney (1999). Relational learning of pattern-match rules for information extraction.
  7. C Cardlie (1997). Empirical methods in information extraction.
  8. C Cardlie,R Mooney (1999). Machine learning and natural language (Introduction to special issue on natural language learning).
  9. F Ciravegna,N Kushmerick (2003). Papers from the 14th European Conference on Machine Learning(ECML-2003) and the 7th European Conference on Principles and Practice of Knowledge Discovery in Databases(PKDD-2003) Workshop on Adaptive Text Extraction and Mining.
  10. W Cohen (1995). Fast effective rule induction.
  11. W Cohen (1996). Learning to classify English text with ILP methods.
  12. W Cohen (2003). Improving a page classifLIEr with anchor extraction and link analysis.
  13. (1998). Proceedings of the Seventh Message Understanding Evaluation and Conference (MUC-98).
  14. Ronen Feldman,Moshe Fresko,Yakkov Kinar,Yehuda Lindell,Orly Liphstat,Martin Rajman,Yonatan Schler,Oren Zamir (1998). Text mining at the term level.
  15. D Freitag,N Kushmerick (2000). Boosted wrapper induction.
  16. R Ghani,A Fano (2002). Using text mining to infer semantic attirbutes for retail data mining.
  17. R Ghani,R Jones,D Mladenic´,K Nigam,S Slattery (2016). Data mining on symbolic knowledge Year.
  18. C A Frame (2000). Work for Text Mining using Learned Information Extraction System extracted from the Web.
  19. M Grobelnik (2001). Proceedings of LIEEE International Conference on Data Mining (ICDM2001) Workshop on Text Mining (TextDM'2001).
  20. M Grobelnik (2003). Proceedings of the Eighteenth International Joint Conference on Artificial Intelligence(IJCAI-2003) Workshop on Text Mining and Link Analysis (TextLink-2003).
  21. J Han,M Kamber (2000). Data Mining: Concepts and Techniques.
  22. Marti Hearst (1999). Untangling text data mining.
  23. Marti Hearst (2003). Text Data Mining.
  24. N Kushmerick (2001). Proceedings of the Seventeenth International Joint Conference on Artificial Intelligence (IJCAI-2001) Workshop on Adaptive Text Extraction and Mining.
  25. Stanley Loh,Leandro Wives,José De Oliveira (2000). Concept-based knowledge discovery in texts extracted from the Web.
  26. A Mccallum,D Jensen (2003). A note on the unification of information extraction and data mining using conditional-probability, relational models.
  27. K Mccallum,Nigam (1998). A comparison of event models for naive Bayes text classification.
  28. D Mladenic´ (2000). Proceedings of the Sixth International Conference on Knowledge Discovery and Data Mining (KDD-2000) Workshop on Text Mining.
  29. R Mooney,L Roy (2000). Content-based book recommending using learning for text categorization.
  30. U Nahm,R Mooney (2000). A mutually beneficial integration of data mining and information extraction.
  31. U Nahm,R Mooney (2000). Using information extraction to aid the discovery of prediction rules from texts.
  32. Un Nahm,Raymond Mooney (2001). Mining soft-matching association rules.
  33. U Nahm,R Mooney (2002). Mining soft-matching association rules.
  34. J Plierre Mining knowledge from text collections using automatically generated metadata.
  35. (2002). Practical Aspects of Knowledge Management.
  36. J Quinlan C4.5: Programs for Machine Learning.
  37. Morgan Kaufmann (1993). Unknown Title.

Funding

No external funding was declared for this work.

Conflict of Interest

The authors declare no conflict of interest.

Ethical Approval

No ethics committee approval was required for this article type.

Data Availability

Not applicable for this article.

How to Cite This Article

Sathish Kuppani. 2016. \u201cA Frame Work for Text Mining using Learned Information Extraction System\u201d. Global Journal of Computer Science and Technology - C: Software & Data Engineering GJCST-C Volume 16 (GJCST Volume 16 Issue C3): .

Download Citation

Issue Cover
GJCST Volume 16 Issue C3
Pg. 41- 50
Journal Specifications

Crossref Journal DOI 10.17406/gjcst

Print ISSN 0975-4350

e-ISSN 0975-4172

Keywords
Classification
GJCST-C Classification: I.2.4, D.3.3
Version of record

v1.2

Issue date

July 1, 2016

Language
en
Experiance in AR

Explore published articles in an immersive Augmented Reality environment. Our platform converts research papers into interactive 3D books, allowing readers to view and interact with content using AR and VR compatible devices.

Read in 3D

Your published article is automatically converted into a realistic 3D book. Flip through pages and read research papers in a more engaging and interactive format.

Article Matrices
Total Views: 7450
Total Downloads: 1862
2026 Trends
Related Research

Published Article

Text mining is a very exciting research area as it tries to discover knowledge from unstructured texts. These texts can be found on a computer desktop, intranets and the internet. The aim of this paper is to give an overview of text mining in the contexts of its techniques, application domains and the most challenging issue. The Learned Information Extraction (LIE) is about locating specific items in natural-language documents. This paper presents a framework for text mining, called DTEX (Discovery Text Extraction), using a learned information extraction system to transform text into more structured data which is then mined for interesting relationships. The initial version of DTEX integrates an LIE module acquired by an LIE learning system, and a standard rule induction module. In addition, rules mined from a database extracted from a corpus of texts are used to predict additional information to extract from future documents, thereby improving the recall of the underlying extraction system. Applying these techniques best results are presented to a corpus of computer job announcement postings from an Internet newsgroup.

Our website is actively being updated, and changes may occur frequently. Please clear your browser cache if needed. For feedback or error reporting, please email [email protected]

Request Access

Please fill out the form below to request access to this research paper. Your request will be reviewed by the editorial or author team.
X

Quote and Order Details

Contact Person

Invoice Address

Notes or Comments

This is the heading

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut elit tellus, luctus nec ullamcorper mattis, pulvinar dapibus leo.

High-quality academic research articles on global topics and journals.

A Frame Work for Text Mining using Learned Information Extraction System

M.Vasavi
M.Vasavi
Sathish Kuppani
Sathish Kuppani Sri Venkateswara University

Research Journals