Fuzzy querying of semistructured data

Campi, A.; Guinea, S.; Spoletini, Paola

Querying XML data is a well-explored topic thanks to powerful query languages such as XPath and XQuery. Both were designed to support the evaluation of binary predicates, which can be proven to be a limited approach to effective querying of XML data. In this paper, a fuzzy extension of the XPath query language is proposed. Its goal is to achieve more flexible querying through vague queries, which can be expressed exploiting fuzzy predicates and fuzzy connectives. We also provide an elegant definition of structure relaxation and primitive operators to span the space of relaxations. Finally we propose an approach to the fuzzy matching of XML trees: XPath provides a deep-equal function that can be used to assess whether two sequences are recursively equal. This can be restrictive, therefore we provide an extension named deep-similar to assess whether the sequences are similar both in content and in structure. We also provide the user with ranking functions to define how the results should be ranked and presented.