An Effective XML Keyword Search with User Search Intention over XML Documents

α
Mr. Pradeep Kumar Reddy Gade
Mr. Pradeep Kumar Reddy Gade
α Gurunanak Engineering College

Send Message

To: Author

An Effective XML Keyword Search with User Search Intention over XML Documents

Article Fingerprint

ReserarchID

W47HY

An Effective XML Keyword Search with User Search Intention over XML Documents Banner

AI TAKEAWAY

Connecting with the Eternal Ground
  • English
  • Afrikaans
  • Albanian
  • Amharic
  • Arabic
  • Armenian
  • Azerbaijani
  • Basque
  • Belarusian
  • Bengali
  • Bosnian
  • Bulgarian
  • Catalan
  • Cebuano
  • Chichewa
  • Chinese (Simplified)
  • Chinese (Traditional)
  • Corsican
  • Croatian
  • Czech
  • Danish
  • Dutch
  • Esperanto
  • Estonian
  • Filipino
  • Finnish
  • French
  • Frisian
  • Galician
  • Georgian
  • German
  • Greek
  • Gujarati
  • Haitian Creole
  • Hausa
  • Hawaiian
  • Hebrew
  • Hindi
  • Hmong
  • Hungarian
  • Icelandic
  • Igbo
  • Indonesian
  • Irish
  • Italian
  • Japanese
  • Javanese
  • Kannada
  • Kazakh
  • Khmer
  • Korean
  • Kurdish (Kurmanji)
  • Kyrgyz
  • Lao
  • Latin
  • Latvian
  • Lithuanian
  • Luxembourgish
  • Macedonian
  • Malagasy
  • Malay
  • Malayalam
  • Maltese
  • Maori
  • Marathi
  • Mongolian
  • Myanmar (Burmese)
  • Nepali
  • Norwegian
  • Pashto
  • Persian
  • Polish
  • Portuguese
  • Punjabi
  • Romanian
  • Russian
  • Samoan
  • Scots Gaelic
  • Serbian
  • Sesotho
  • Shona
  • Sindhi
  • Sinhala
  • Slovak
  • Slovenian
  • Somali
  • Spanish
  • Sundanese
  • Swahili
  • Swedish
  • Tajik
  • Tamil
  • Telugu
  • Thai
  • Turkish
  • Ukrainian
  • Urdu
  • Uzbek
  • Vietnamese
  • Welsh
  • Xhosa
  • Yiddish
  • Yoruba
  • Zulu

Abstract

The extreme success of web search engines makes keyword search the most popular search model for ordinary users. Keyword search on XML is a user friendly way to query XML databases since it allows users to pose queries without the knowledge of complex query languages and the database schema. The three main challenges faces in XML keyword search: 1) Identify the user search intention, i.e., identify the XML node types that users want to search for and search via. 2) Resolve keyword ambiguity problems: a keyword can appear as both a tag name and a text value of some node; a keyword can appear as the text values of different XML node types and carry different meanings; a keyword can appear as the tag name of different XML node types with different meanings. 3) As the search results are sub trees of the XML documents, new scoring function is needed to estimate its relevance to a given query. However, existing methods cannot resolve these challenges, thus return low result quality in term of query relevance. In this paper, we propose an IR-style approach which basically utilizes the statistics of underlying XML data to address these challenges. We first propose specific guidelines that a search engine should meet in both search intention identification and relevance oriented rankingfor search results over XML documents. Then, based on theseguidelines, we design novel formulae to identify the search fornodes and search via nodes of a query, and present a novelXML TF*IDF ranking strategy to rank the individual matches ofall possible search intentions over XML documents.

References

21 Cites in Article
  1. Sihem Amer-Yahia,Laks Lakshmanan,Shashank Pandit (2004). FleXPath.
  2. David Carmel,Yoelle Maarek,Matan Mandelbrod,Yosi Mass,Aya Soffer (2003). Searching XML documents via XML fragments.
  3. Sara Cohen,Yaron Kanza,Benny Kimelfeld,Yehoshua Sagiv (2005). Interconnection semantics for keyword search in XML.
  4. N Fuhr,K Großjohann (2001). XIRQL: A Query Language for Information Retrieval in XML Documents.
  5. R Jones,B Rey,O Madani,W Greiner (2006). Generating Query Substitutions.
  6. V Kacholia,S Pandit,S Chakrabarti,S Sudarshan,R Desai,H Karambelkar (2005). Bidirectional Expansion for Keyword Search on Graph Databases.
  7. V Hristidis,N Koudas,Y Papakonstantinou,D Srivastava (2006). Keyword Proximity Search in XML Trees.
  8. V Hristidis,Y Papakonstantinou,A Balmin (2003). Keyword Proximity Search on XML Graphs.
  9. Hao He,Haixun Wang,Jun Yang,Philip Yu (2007). BLINKS.
  10. M Ley (2009). Unknown Title.
  11. Guoliang Li,Jianhua Feng,Jianyong Wang,Lizhu Zhou (2007). Effective keyword search for valuable lcas over xml documents.
  12. G Li,B Ooi,J Feng,J Wang,L Zhou (2008). Ease: Efficient and Adaptive Keyword Search on Unstructured, Semi-Structured and Structured Data.
  13. Y Li,C Yu,H Jagadish (2004). Schema-Free XQuery.
  14. Z Liu,Y Chen (2007). Identifying Meaningful Return Information for XML Keyword Search.
  15. Ziyang Liu,Yi Cher (2008). Reasoning and identifying relevant matches for XML keyword search.
  16. G Salton,M Mcgill (1986). Introduction to Modern Information Retrieval.
  17. A Schmidt,M Kersten,M Windhouwer (2001). Querying XML Documents Made Easy: Nearest Concept Queries.
  18. C Sun,C Chan,A Goenka (2007). Multiway SLCA-Based Keyword Search in XML Data.
  19. Anja Theobald,Gerhard Weikum (2002). The Index-Based XXL Search Engine for Querying XML Data with Relevance Ranking.
  20. Yu Xu,Yannis Papakonstantinou (2005). Efficient keyword search for smallest LCAs in XML databases.
  21. Zhifeng Bao,Jiaheng Lu,Tok Ling,Senior Member,Ieee,Bo Chen (2010). Towards an Effective XML Keyword Search.

Funding

No external funding was declared for this work.

Conflict of Interest

The authors declare no conflict of interest.

Ethical Approval

No ethics committee approval was required for this article type.

Data Availability

Not applicable for this article.

How to Cite This Article

Mr. Pradeep Kumar Reddy Gade. 1970. \u201cAn Effective XML Keyword Search with User Search Intention over XML Documents\u201d. Unknown Journal GJCST Volume 11 (GJCST Volume 11 Issue 16): .

Download Citation

Issue Cover
GJCST Volume 11 Issue 16
Pg. 41- 45
Journal Specifications
Version of record

v1.2

Issue date

September 7, 2011

Language
en
Experiance in AR

Explore published articles in an immersive Augmented Reality environment. Our platform converts research papers into interactive 3D books, allowing readers to view and interact with content using AR and VR compatible devices.

Read in 3D

Your published article is automatically converted into a realistic 3D book. Flip through pages and read research papers in a more engaging and interactive format.

Article Matrices
Total Views: 20314
Total Downloads: 10932
2026 Trends
Related Research

Published Article

The extreme success of web search engines makes keyword search the most popular search model for ordinary users. Keyword search on XML is a user friendly way to query XML databases since it allows users to pose queries without the knowledge of complex query languages and the database schema. The three main challenges faces in XML keyword search: 1) Identify the user search intention, i.e., identify the XML node types that users want to search for and search via. 2) Resolve keyword ambiguity problems: a keyword can appear as both a tag name and a text value of some node; a keyword can appear as the text values of different XML node types and carry different meanings; a keyword can appear as the tag name of different XML node types with different meanings. 3) As the search results are sub trees of the XML documents, new scoring function is needed to estimate its relevance to a given query. However, existing methods cannot resolve these challenges, thus return low result quality in term of query relevance. In this paper, we propose an IR-style approach which basically utilizes the statistics of underlying XML data to address these challenges. We first propose specific guidelines that a search engine should meet in both search intention identification and relevance oriented rankingfor search results over XML documents. Then, based on theseguidelines, we design novel formulae to identify the search fornodes and search via nodes of a query, and present a novelXML TF*IDF ranking strategy to rank the individual matches ofall possible search intentions over XML documents.

Our website is actively being updated, and changes may occur frequently. Please clear your browser cache if needed. For feedback or error reporting, please email [email protected]

Request Access

Please fill out the form below to request access to this research paper. Your request will be reviewed by the editorial or author team.
X

Quote and Order Details

Contact Person

Invoice Address

Notes or Comments

This is the heading

Lorem ipsum dolor sit amet, consectetur adipiscing elit. Ut elit tellus, luctus nec ullamcorper mattis, pulvinar dapibus leo.

High-quality academic research articles on global topics and journals.

An Effective XML Keyword Search with User Search Intention over XML Documents

Mr. Pradeep Kumar Reddy Gade
Mr. Pradeep Kumar Reddy Gade Gurunanak Engineering College

Research Journals