LISTSERV at the University of Georgia
Menubar Imagemap
Home Browse Manage Request Manuals Register
Previous messageNext messagePrevious in topicNext in topicPrevious by same authorNext by same authorPrevious page (April 1997)Back to main SPSSX-L pageJoin or leave SPSSX-L (or change settings)ReplyPost a new messageSearchProportional fontNon-proportional font
Date:   Fri, 18 Apr 1997 20:07:05 GMT
Reply-To:   Richard F Ulrich <wpilib+@PITT.EDU>
Sender:   "SPSSX(r) Discussion" <SPSSX-L@UGA.CC.UGA.EDU>
From:   Richard F Ulrich <wpilib+@PITT.EDU>
Organization:   University of Pittsburgh
Subject:   Re: cluster analysis with mixed data

Purnima Chawla (pchawla@ETS.ORG) wrote: : Hello, everyone. : I'm trying to do a cluster analysis based on how people have responded to : a survey. Some of the questions in the survey required a Y/N response : and thus yield binomial data. Others required a Likert type response; : still others were designed like multiple choice questions but are graded : and can be interpreted as ordinal data.

: My question is this: how should the proximities be calculated if some of : the data are binomial and some are ordinal? Also, does SPSS do this?

How? carefully!

a) SPSS Proximities has a number of possibilities for scoring distances or proximities for binomial data. It is possible to use a subset of variables, and create a Distance-score from dichotomies; then use that as just one among another set of variables.

b) the default (without STANDARDIZE) means that a variable scored 0-100 would have 10 or 100 times the weight as a variable scored 1-10. Et cetera.

c) the choice of VARIABLES going into your distances is what the weighting will be computed over; if there are 10 trivial variables and one important one, the distances will be not depend much on the important one....

Rich Ulrich, biostatistician wpilib+@pitt.edu http://www.pitt.edu/~wpilib/index.html Univ. of Pittsburgh


Back to: Top of message | Previous page | Main SPSSX-L page