Date:         Fri, 18 Jul 1997 11:22:53 +1000
Sender:       "SAS(r) Discussion" <SAS-L@UGA.CC.UGA.EDU>
Subject:      Re: LINKPro System -Reply -Reply
Comments: To:
Content-Type: text/plain

This is a bit off-topic but probabilistic record linkage (which includes tasks such as customer list de-duplication) is probably of interest to many SAS users.

Richard Hockey notes: >>> Richard Hockey <> We have done a lot of successful linkage work using the original Links macros which we have extended to encompass all aspects of probabilistic record linkage. My impression is that LInksPro now includes a lot of this extra stuff. The advantage of the Links macros are that they run under any OS running SAS and they are extremely flexible/configurable. Automatch (mentioned below) on the other hand is not. It is a virtual blackbox system. <<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<<< I have to take issue with the statement that AutoMatch is "a virtual black box". In fact, almost every aspect of the linkage process and parameters are configurable in AutoMatch and every aspect of its operation is well documented both in the manual and in the scientific literature (see Medline record below). AutoMatch also runs on just about every platform which SAS runs on from PC to mainframe. The only downside to AutoMatch (apart from its reasonable but not totally trivial cost) is the need to export all your data to ASCII files.

There are two other record linkage/de-duplication products I know of: SSA-Names and ScrubMaster. Both of these products are definitely "black boxes" and tend to be offered as "turn-key" solutions. AutoMatch (and no doubt LinkPRO and/or the Links macros) require iterative development of linkage strategies to get optimal results, although in most circumstances you cab get pretty good results with minimal fiddling.

Tim Churches NSW Health Department Sydney, Australia Email:

