|
I am sorry that I did not include Data Null in my previous Thank You
message. I appreciate it that Data Null pointed me to to the location in
SAS menu regarding the nodup/nodupkey options.
Libin
-----Original Message-----
From: SAS(r) Discussion [mailto:SAS-L@LISTSERV.UGA.EDU] On Behalf Of
data _null_;
Sent: Friday, October 27, 2006 11:55 AM
To: SAS-L@LISTSERV.UGA.EDU
Subject: Re: Difference between nodup and nodupkey in sort procedure
I was just quoting the documentation. I don't care about the
difference real or perceived.
On 10/26/06, Jim Groeneveld <jim2stat@yahoo.co.uk> wrote:
> Hi Data Null,
>
> In that case, sorting on all variables, you could as well use
NODUPKEY.
> So for what instances is NODUPRECS then intended specifically?
>
> Regards - Jim.
> --
> Jim Groeneveld, Netherlands
> Statistician, SAS consultant
> home.hccnet.nl/jim.groeneveld
>
> On Wed, 25 Oct 2006 15:41:30 -0400, data _null_; <datanull@GMAIL.COM>
wrote:
>
> >Check the documentation in particular this passage.
> >
> >Because NODUPRECS checks only consecutive observations, some
> >nonconsecutive duplicate observations might remain in the output data
> >set. You can remove all duplicates with this option by sorting on all
> >variables
> >
> >On 10/25/06, Xu Libin <Libin.Xu@irs.gov> wrote:
> >> I thought that nodup option in proc sort get rid of duplicate
records
> >> and nodupkey get rid of duplicates of the by variable. When I ran
the
> >> below syntax,
> >>
> >> Proc sort data=old out=new nodup;
> >> By id;
> >> Run;
> >>
> >> About 760 cases were deleted. But I was told that they are not
duplicate
> >> records. At least one variable has different values. Can anyone on
the
> >> list provide an explanation for this? Thanks.
> >>
> >> Libin
> >>
>
|