• News and feature updates
  • Tutorials and Tips and tricks
  • General discussions and opinions

De-duping gets a make over

June 25, 2010 |  by  |  Site Updates

As of version 3.0.0

Just one update to the site to mention – but a good one!

  • New De-Dupe Merged Export

When you export a De-Dupe matching project, 2 results files are now created for you to download.

File 1.) This is your main cleansed de-duped  list, containing records that were not matched and the merged enhanced derived record from each de-duped group of matches.

File 2.) The “Master Grouping” file. This file contains all the matched/duplicate records grouped together by a unique Master Group ID.  It allows you to see those records that were matched and grouped together.

Let me explain a bit about the merged/enhanced de-duped records in File 1.

For each group of matched records, which are in essence the duplicates you found and selected using the Match Visualiser, we create one master record for that group. This master record is derived from the most common values found within the group on a column by column basis.

So if some of the matched records didn’t have a value for some columns and other records in that same group did, then the most common value for that column would be used. This creates a more complete record to use.

In the case of an address Match2Lists will choose the most complete address from one of your de-duped records and use this address, this avoids issues with fictitious addresses being created.

Screenshot showing the 2 results files ready to download

Screenshot showing the 2 results files ready to download

Use it, try it out and let us know what you think. Leave feedback here or send us as email, we love reading feedback from our users.

Until the next time, happy matching.

P.S. The next upgrade will allow you to define your own logic to determine how to choose which value to incorporate in your master record.

 

Leave a Reply

copyright ©2008-2012 Match2Lists Ltd