Semantic Clustering of Genomic Documents using GO Terms as Feature Set

Semantic Clustering of Genomic Documents using GO Terms as Feature Set

Article ID

CSTSDE6IBVU

Semantic Clustering of Genomic Documents using GO Terms as Feature Set

Dr. B.L.Shivakumar

V.Bhuvaneswari Bharathiar University

DOI

Access OJS

Abstract

The biological databases generate huge volume of genomics and proteomics data. The sequence information is used by researches to find similarity of genes, proteins and to find other related information. The genomic sequence database consists of large number of attributes as annotations, represented for defining the sequences in Xml format. It is necessary to have proper mechanism to group the documents for information retrieval. Data mining techniques like clustering and classification methods can be used to group the documents. The objective of the paper is to analyze the set of keywords which can be represented as features for grouping the documents semantically. This paper focuses on clustering genomic documents based on both structural and content similarity .The structural similarity is found using structural path between the documents. The semantic similarity is found for the structurally similar documents. We have proposed a methodology to cluster the genomic documents using sequence attributes without using the sequence data. The sequence attributes for genomic documents are analyzed using Filter based feature selection methods to find the relevant feature set for grouping the similar documents. Based on the attribute ranking we have clustered the similar documents using All Keyword approach (KBA) and GO Terms based approach (GOTA). The experimental results of the clusters are validated for two approaches by inferring biological meaning using Gene Ontology. From the results it was inferred that all keywords based approach grouped documents based on the semantic meaning of Gene Ontology terms. The GO terms based approach grouped larger number of documents without considering any other keywords, which is semantically relevant which results in reducing the complexity of the attributes considered. We claim that using GO terms can alone be used as features set to group genomic documents with high similarity.

Semantic Clustering of Genomic Documents using GO Terms as Feature Set

Authors

Dr. B.L.Shivakumar

V.Bhuvaneswari Bharathiar University

Figure

No Figures found in article.

References

How to Cite

V.Bhuvaneswari. 2012. “. Global Journal of Computer Science and Technology – C: Software & Data Engineering GJCST-C Volume 12 (GJCST Volume 12 Issue C10): .

More Citation Formats

Select Citation Style:

Download Citation

Journal Specifications

Crossref Journal DOI 10.17406/gjcst

Print ISSN 0975-4350

e-ISSN 0975-4172

GJCST Volume 12 Issue C10
Pg. 13- 19

Explore Journals Explore Volume Read This Issue

Classification

Not Found

Submission Received December 13, 2011
Accepted December 31, 2011
Published January 15, 2012

Keywords

Attributes Feature Set Go Terms Semantic Clustering XML

Article Matrices

Total Score: 107

Country: India

Subject: Global Journal of Computer Science and Technology - C: Software & Data Engineering

Authors: Dr. B.L.Shivakumar, V.Bhuvaneswari (PhD/Dr. count: 1)

View Count (all-time): 237

Total Views (Real + Logic): 10258

Total Downloads (simulated): 2748

Publish Date: 2012 06, Thu

Monthly Totals (Real + Logic):

Month 1: 31 views
Month 2: 32 views
Month 3: 66 views
Month 4: 25 views
Month 5: 53 views
Month 6: 42 views
Month 7: 17 views
Month 8: 34 views
Month 9: 36 views
Month 10: 39 views
Month 11: 42 views
Month 12: 28 views
Month 13: 47 views
Month 14: 40 views
Month 15: 32 views
Month 16: 34 views
Month 17: 31 views
Month 18: 18 views
Month 19: 26 views
Month 20: 39 views
Month 21: 37 views
Month 22: 26 views
Month 23: 36 views
Month 24: 30 views
Month 25: 23 views
Month 26: 36 views
Month 27: 42 views
Month 28: 37 views
Month 29: 19 views
Month 30: 30 views
Month 31: 19 views
Month 32: 35 views
Month 33: 27 views
Month 34: 47 views
Month 35: 43 views
Month 36: 18 views
Month 37: 39 views
Month 38: 35 views
Month 39: 37 views
Month 40: 21 views
Month 41: 40 views
Month 42: 15 views
Month 43: 28 views
Month 44: 41 views
Month 45: 26 views
Month 46: 36 views
Month 47: 26 views
Month 48: 32 views
Month 49: 41 views
Month 50: 31 views
Month 51: 37 views
Month 52: 41 views
Month 53: 21 views
Month 54: 41 views
Month 55: 46 views
Month 56: 41 views
Month 57: 21 views
Month 58: 33 views
Month 59: 32 views
Month 60: 25 views
Month 61: 34 views
Month 62: 37 views
Month 63: 25 views
Month 64: 29 views
Month 65: 14 views
Month 66: 34 views
Month 67: 27 views
Month 68: 37 views
Month 69: 26 views
Month 70: 23 views
Month 71: 18 views
Month 72: 29 views
Month 73: 36 views
Month 74: 12 views
Month 75: 30 views
Month 76: 15 views
Month 77: 33 views
Month 78: 33 views
Month 79: 27 views
Month 80: 16 views
Month 81: 38 views
Month 82: 17 views
Month 83: 43 views
Month 84: 25 views
Month 85: 18 views
Month 86: 19 views
Month 87: 15 views
Month 88: 19 views
Month 89: 43 views
Month 90: 26 views
Month 91: 33 views
Month 92: 24 views
Month 93: 48 views
Month 94: 44 views
Month 95: 18 views
Month 96: 19 views
Month 97: 40 views
Month 98: 26 views
Month 99: 41 views
Month 100: 19 views
Month 101: 37 views
Month 102: 20 views
Month 103: 26 views
Month 104: 39 views
Month 105: 33 views
Month 106: 25 views
Month 107: 42 views
Month 108: 24 views
Month 109: 45 views
Month 110: 33 views
Month 111: 26 views
Month 112: 40 views
Month 113: 41 views
Month 114: 34 views
Month 115: 34 views
Month 116: 20 views
Month 117: 23 views
Month 118: 25 views
Month 119: 29 views
Month 120: 26 views
Month 121: 34 views
Month 122: 11 views
Month 123: 40 views
Month 124: 14 views
Month 125: 27 views
Month 126: 31 views
Month 127: 28 views
Month 128: 37 views
Month 129: 34 views
Month 130: 40 views
Month 131: 43 views
Month 132: 41 views
Month 133: 35 views
Month 134: 40 views
Month 135: 44 views
Month 136: 39 views
Month 137: 36 views
Month 138: 33 views
Month 139: 32 views
Month 140: 35 views
Month 141: 28 views
Month 142: 24 views
Month 143: 44 views
Month 144: 44 views
Month 145: 25 views
Month 146: 33 views
Month 147: 21 views
Month 148: 28 views
Month 149: 30 views
Month 150: 44 views
Month 151: 47 views
Month 152: 28 views
Month 153: 19 views
Month 154: 17 views
Month 155: 35 views
Month 156: 20 views
Month 157: 25 views
Month 158: 40 views
Month 159: 39 views
Month 160: 36 views
Month 161: 24 views
Month 162: 42 views
Month 163: 45 views
Month 164: 37 views
Month 165: 20 views
Month 166: 52 views

Total Views: 10258

Total Downloads: 2748

2026 Trends

Research Identity (RIN)

Our website is actively being updated, and changes may occur frequently. Please clear your browser cache if needed. For feedback or error reporting, please email [email protected]