Effectiveness Bounds for Non-Exhaustive Schema Matching Systems

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

1 Citation (Scopus)
182 Downloads (Pure)

Abstract

Semantic validation of the effectiveness of a schema matching system is traditionally performed by comparing system-generated mappings with those of human evaluators. The human effort required for validation quickly becomes huge in large scale environments. The performance of a matching system, however, is not solely determined by the quality of the mappings, but also by the efficiency with which it can produce them. Improving efficiency quickly leads to a trade-off between efficiency and effectiveness. Establishing or obtaining a large test collection for measuring this trade-off is often a severe obstacle. In this paper, we present a technique for determining lower and upper bounds for effectiveness measures for a certain class of schema matching system improvements in order to lower the required validation effort. Effectiveness bounds for a matching system improvement are solely derived from a comparison of answer sets of the improved and original matching system. The technique was developed in the context of improving efficiency in XML schema matching, but we believe it to be more generically applicable in other retrieval systems facing scalability problems.
Original languageUndefined
Title of host publicationProceedings of the 22nd International Conference on Data Engineering Workshops (ICDEW'06)
Place of PublicationLos Alamitos, CA, USA
PublisherIEEE
Pages83
Number of pages10
ISBN (Print)0-7695-2571-7
DOIs
Publication statusPublished - 7 Apr 2006
Event22nd International Conference on Data Engineering, ICDE 2006 - Atlanta, United States
Duration: 3 Apr 20068 Apr 2006
Conference number: 22

Publication series

Name
PublisherIEEE Computer Society Press
Number2

Workshop

Workshop22nd International Conference on Data Engineering, ICDE 2006
Abbreviated titleICDE
Country/TerritoryUnited States
CityAtlanta
Period3/04/068/04/06

Keywords

  • IR-66511
  • METIS-238228
  • DB-IR: INFORMATION RETRIEVAL
  • DB-SDI: SCHEMA AND DATA INTEGRATION
  • DB-PRJBF: BELLFLOWER
  • EWI-7540

Cite this