Search Swinburne Research Bank
Please use this identifier to cite or link to this item: http://hdl.handle.net/1959.3/158791
- Title
- XClean: providing valid spelling suggestions for XML keyword queries
- Author(s)
- Lu, Yifei; Wang, Wei; Li, Jianxin; Liu, Chengfei
- Abstract
- An important facility to aid keyword search on XML data is suggesting alternative queries when user queries contain typographical errors. Query suggestion thus can improve users’ search experience by avoiding returning empty result or results of poor qualities. In this paper, we study the problem of effectively and efficiently providing quality query suggestions for keyword queries on an XML document. We illustrate certain biases in previous work and propose a principled and general framework, XClean, based on the state-of-the-art language model. Compared with previous methods, XClean can accommodate different error models and XML keyword query semantics without losing rigor. Algorithms have been developed that compute the top-k suggestions efficiently. We performed an extensive experiment study using two large-scale real datasets. The experiment results demonstrate the effectiveness and efficiency of the proposed methods.
- Publication type
- Conference paper
- Research centre
- Swinburne University of Technology
- Source
- Proceedings of the 27th International Conference on Data Engineering (ICDE 2011), Hannover, Germany, 11-16 April 2011, pp. 661-672
- Publication year
- 2011
- FOR Code(s)
- 0804 Data Format
- Keyword(s)
- Keyword searching; Typographical errors; XClean; XML queries
- Publisher
- IEEE
- ISBN
- 9781424489589, 142448958X
- Publisher URL
- http://dx.doi.org/10.1109/ICDE.2011.5767847
- Copyright
- Copyright © 2011 IEEE. The accepted manuscript of the paper is reproduced here in accordance with the copyright policy of the publisher. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE.
- Research Projects
-
XML views of relational databases: semantics and update problems, Australian Research Council grant number DP0878405
Effective and efficient keyword search for relevant entities over XML data, Australian Research Council grant number DP110102407
Keyword search in structured and semi-structured databases, Australian Research Council grant number DP0987273
Efficient exact similarity join, Australian Research Council grant number DP0881779
- Full text

- Peer reviewed



