Date: Wed, 3 Apr 2002 22:30:42 +0100
Reply-To: "R. Allan Reese" <R.A.Reese@gri.hull.ac.uk>
Sender: "SPSSX(r) Discussion" <SPSSX-L@LISTSERV.UGA.EDU>
From: "R. Allan Reese" <R.A.Reese@gri.hull.ac.uk>
Subject: Re: Duplicate question
In-Reply-To: <3CAB6655.1EFA1FC1@fibertel.com.ar>
Content-Type: TEXT/PLAIN; charset=US-ASCII
Just to add a grace note to Hector's response, it is sensible to SORT thecases as a duplicate ID is possibly a mistyping so will be in the wrong
order. I have just used such an idea to add jitter to make colocated
points visible on a scatterplot. Only the "duplicate" points need be
moved:
sort cases by x y.
compute dup= (x eq lag(x)) and (y eq lag(y)).
compute jit= uniform(1)-0.5.
compute xj= x + dup*jit.
compute yj= y - dup*jit.
(BTW: I didn't paste this syntax, so apologies for any typos.)
R. Allan Reese Email: r.a.reese@gri.hull.ac.uk
|