Object Identification Quality

Mattis Neiling(Freie Universität Berlin), Steffen Jurk(Freie Universität Berlin), Hans‐J. Lenz(Brandenburg University of Technology Cottbus-Senftenberg), Felix Naumann
edoc Publication server (Humboldt University of Berlin)
January 1, 2003
Cited by 14Open Access
Full Text

Abstract

Research and industry has tackled the object identification problem of data integration in many different ways. This paper presents a framework, that allows the evaluation of competing approaches. To this end, complexity measures and data characteristics are introduced, which reflect the hardness of a given object identification problem. All characteristics can be estimated by use of simple SQL queries and simple calculations. Following the principle of benchmark definitions we specify a test framework. It consists of a test database and its characteristics, quality criteria, and a test specification. Adequate measures needed for the correctness criterion of the benchmark are given. A running example of the Berlin Online Apartment-Advertisements database (BOA) illustrates the approach. The BOA-database is freely available at www.wiwiss.fu-berlin.de/lenz/boa/.


Related Papers

No related papers found

Powered by citation graph analysis