LISTSERV at the University of Georgia
Menubar Imagemap
Home Browse Manage Request Manuals Register
Previous messageNext messagePrevious in topicNext in topicPrevious by same authorNext by same authorPrevious page (March 2002, week 2)Back to main SAS-L pageJoin or leave SAS-L (or change settings)ReplyPost a new messageSearchProportional fontNon-proportional font
Date:         Tue, 12 Mar 2002 16:10:49 -0500
Reply-To:     Charles Patridge <Charles_S_Patridge@PRODIGY.NET>
Sender:       "SAS(r) Discussion" <SAS-L@LISTSERV.UGA.EDU>
From:         Charles Patridge <Charles_S_Patridge@PRODIGY.NET>
Subject:      Re: address field - cleaning and separating
Comments: To: Courtney Cook <ccook@MACROINT.COM>

Dear Courtney,

If you do find such code, please let me know <grin> as I would be very interested to see it.

I know there are commercial products including SI's product Data Quality/ Data Clean using Proc Match and Proc Schema which some of them do try to break address into their basic elements.

It may be possible to develop such routines provided the data follows some logical pattern but in my humble opinion, my experience has shown that address comes in many flavors and colors (mostly dirty).

And I believe my routines that you spoke will not nor tries to break down an address into its basic elements which is why I developed some of those routines the way I did (ie addresses from various sources do not follow any regular patterns). However, some of those routines will go a long ways in making the data a bit more consistent, possibly enabling development of more routines to help you out.

I guess I would be interested in knowing why you need to turn addresses into their basic elements. I assume so you can match addresses with other files which may also need to be standardized.

Sorry I could not make my routines easier to follow but when dealing with nasty data sometimes requires nasty code <grin>.

Let me know what you find out. Thanks. Charles Patridge Email: Charles_S_Patridge@prodigy.net


Back to: Top of message | Previous page | Main SAS-L page