A comparative study of duplicate record detection techniques