Predicate-based Filtering of XPath Expressions

Shuang Huo and Hans-Arno Jacobsen.

In IEEE International Conference on Data Engineering (ICDE), page 53, Atlanta, GA, April 2006.
Acceptance rate: 20%. Number of submissions: 456.

Abstract

The XML/XPath filtering problem has found wide-spread interest. In this paper, we propose a novel algorithm for solving it. Our approach encodes XPath expressions (XPEs) as ordered sets of predicates and translates XML documents into sets of tuples, which are evaluated over these predicates. Predicates representing overlapping portions of XPEs are stored and processed once, thus fully exploiting potential overlap in XPEs. We experimentally evaluate the performance of our algorithm, demonstrating its scalability to millions of XPEs, with matching performance in the millisecond range. We show interesting trade-offs to alternative approaches.

Download


Tags: content-based matching, content-based publish/subscribe, topss, xpath query, xml routing, x-topss