|
I was just quoting the documentation. I don't care about the
difference real or perceived.
On 10/26/06, Jim Groeneveld <jim2stat@yahoo.co.uk> wrote:
> Hi Data Null,
>
> In that case, sorting on all variables, you could as well use NODUPKEY.
> So for what instances is NODUPRECS then intended specifically?
>
> Regards - Jim.
> --
> Jim Groeneveld, Netherlands
> Statistician, SAS consultant
> home.hccnet.nl/jim.groeneveld
>
> On Wed, 25 Oct 2006 15:41:30 -0400, data _null_; <datanull@GMAIL.COM> wrote:
>
> >Check the documentation in particular this passage.
> >
> >Because NODUPRECS checks only consecutive observations, some
> >nonconsecutive duplicate observations might remain in the output data
> >set. You can remove all duplicates with this option by sorting on all
> >variables
> >
> >On 10/25/06, Xu Libin <Libin.Xu@irs.gov> wrote:
> >> I thought that nodup option in proc sort get rid of duplicate records
> >> and nodupkey get rid of duplicates of the by variable. When I ran the
> >> below syntax,
> >>
> >> Proc sort data=old out=new nodup;
> >> By id;
> >> Run;
> >>
> >> About 760 cases were deleted. But I was told that they are not duplicate
> >> records. At least one variable has different values. Can anyone on the
> >> list provide an explanation for this? Thanks.
> >>
> >> Libin
> >>
>
|