LISTSERV at the University of Georgia
Menubar Imagemap
Home Browse Manage Request Manuals Register
Previous (more recent) messageNext (less recent) messagePrevious (more recent) in topicNext (less recent) in topicPrevious (more recent) by same authorNext (less recent) by same authorPrevious page (June 2007, week 4)Back to main SAS-L pageJoin or leave SAS-L (or change settings)ReplyPost a new messageSearchProportional fontNon-proportional font
Date:         Wed, 27 Jun 2007 15:31:51 -0700
Reply-To:     jeff@ARROWMODEL.COM
Sender:       "SAS(r) Discussion" <SAS-L@LISTSERV.UGA.EDU>
From:         jeff@ARROWMODEL.COM
Organization: http://groups.google.com
Subject:      Geo clustering
Comments: To: sas-l@uga.edu
Content-Type: text/plain; charset="iso-8859-1"

Hello,

I'm trying to find geographical patterns.

The data is a large mailbase of prospects (several million observations). All were mailed a solicitation, some responded, most didn't. For each prospect I have the ZIP code and hence can look up approximate latitude and longitude. I can also calculate the distance in miles between any two ZIP codes.

I'd like to identify areas with high and low response rate that are sufficiently large and stable, for example, by grouping individual ZIP codes into relatively few large clusters (maybe 2 to 5).

I started by grouping the data by ZIP code, calculated response rate for each ZIP, and then did hierarchical clustering. The results were not very good, partly because in some ZIPs there were few responders, and partly because the clusters turned out too round.

I suspect there must be a better way, but what is it? Maybe Kohonen's self-organizing maps?

Thank you,

Jeff Zanooda http://arrowmodel.com


Back to: Top of message | Previous page | Main SAS-L page