LISTSERV at the University of Georgia
Menubar Imagemap
Home Browse Manage Request Manuals Register
Previous messageNext messagePrevious in topicNext in topicPrevious by same authorNext by same authorPrevious page (August 2005, week 3)Back to main SAS-L pageJoin or leave SAS-L (or change settings)ReplyPost a new messageSearchProportional fontNon-proportional font
Date:   Thu, 18 Aug 2005 08:08:07 -0700
Reply-To:   Toby <tobydunn@HOTMAIL.COM>
Sender:   "SAS(r) Discussion" <SAS-L@LISTSERV.UGA.EDU>
From:   Toby <tobydunn@HOTMAIL.COM>
Organization:   http://groups.google.com
Subject:   Re: a toughy for me
Comments:   To: sas-l@uga.edu
In-Reply-To:   <1124377400.012951.63390@g43g2000cwa.googlegroups.com>
Content-Type:   text/plain; charset="iso-8859-1"

Tina,

Look at some of the fuzzy matching functions that SAS has, Soundex, Spedis, etc....

Toby Dunn

Hi,

I have a tough problem, at least for me, so I need and would really appreciate any help that you may have.

I want to pull some data from a data base for my dataset. I have a unique 8-digit company id (co_id) to do the matching.

After matching on 8 digits, first 7 digits, and first 6 digits of 'co_id', I still have ~2,000 firms unmatched out of ~9,000.

Next step, I want to match on 5 digits of 'co_id," and require that the matching firm have the first word of the sample firm's company name (coname_1stPart).

For example, if the sample firm is Walt Disney, I want to pull any firm

in the database that has the same 5 digits of 'co_id' and contain the word 'Walt' (coname_1stPart). I do not want to do exact matching (a.coname=b.coname), since the database may list the company name as 'Walt Disney Co.' or 'Disney (Walt)', etc.

I want to do something like "if index ( b.coname, 'a.coname_1stPart')." But the tricky part is I do not know 'a. coname_1stPart' before hand.

Any suggestions?

Thanks very much.

Tina


Back to: Top of message | Previous page | Main SAS-L page