by language constructs, and furthermore, clustering strategies to deal with transitivity conflicts are pro- posed. In [12] Hern´ andez et. al. propose the sliding window approach for similarity-based duplicate identification where a neighborhood conserving key can be derived and describe ef...