| Date: | Tue, 20 Sep 2011 15:55:52 -0500 |
| Reply-To: | Joe Matise <snoopy369@GMAIL.COM> |
| Sender: | "SAS(r) Discussion" <SAS-L@LISTSERV.UGA.EDU> |
| From: | Joe Matise <snoopy369@GMAIL.COM> |
| Subject: | Re: Remove duplicate rows |
| In-Reply-To: | <DCCDE0D83A1D0E43BB7001C29ED5B5CA06320635@EX10MAIL1.westat.com> |
| Content-Type: | text/plain; charset=ISO-8859-1 |
|---|
What's the difference between
proc sort data=mess;
by _all_;
run;
proc sort data=mess noduprec dupout=mess_duplicates_removed;
by _all_;
run;
and
proc sort data=mess nodupkey dupout=mess_duplicates_removed;
by _all_;
run;
in one step?
-Joe
On Tue, Sep 20, 2011 at 3:28 PM, Michael Raithel
<michaelraithel@westat.com>wrote:
> Dear SAS-L-ers,
>
> Toby posted the following to Richard's interesting question:
>
> > Richard....
> >
> > I could be mistaken here but somewhere I remembered when you use
> > noduprec you have to sort it first by all the variables and then sort
> > it again with the noduprec as the duplicate records have to be
> > sequential in the data set.
> >
> Toby, Bingo; I was thinking the exact same thing! I was going to suggest
> (using Richard's example):
>
> proc sort data=mess;
> by _all_;
> run;
>
> proc sort data=mess noduprec dupout=mess_duplicates_removed;
> by _all_;
> run;
>
> So, now we have a nomination and a second. Perhaps the motion passes.
> (Man, I've been living in the Washington, DC area for way too long)!
>
> Toby, best of luck in all your SAS endeavors!
>
> Take Care!
>
> ----MMMMIIIIKKKKEEEE
> (aka Michael A. Raithel)
>
|