LISTSERV at the University of Georgia
Menubar Imagemap
Home Browse Manage Request Manuals Register
Previous (more recent) messageNext (less recent) messagePrevious (more recent) in topicNext (less recent) in topicPrevious (more recent) by same authorNext (less recent) by same authorPrevious page (October 2006, week 4)Back to main SAS-L pageJoin or leave SAS-L (or change settings)ReplyPost a new messageSearchProportional fontNon-proportional font
Date:   Fri, 27 Oct 2006 11:49:50 -0400
Reply-To:   Xu Libin <Libin.Xu@IRS.GOV>
Sender:   "SAS(r) Discussion" <SAS-L@LISTSERV.UGA.EDU>
From:   Xu Libin <Libin.Xu@IRS.GOV>
Subject:   Re: Difference between nodup and nodupkey in sort procedure
Comments:   To: "Choate, Paul@DDS" <pchoate@DDS.CA.GOV>
In-Reply-To:   <A4F0DBF1F84D4F46A0E6D56405D3511989E675@HHAEXMB03.rf01.itservices.ca.gov>
Content-Type:   text/plain; charset="US-ASCII"

Thanks to the advice of Paul, Arthur, and Mittapalli, particularly your suggestions on dupout option, I was able to confirm that the data set does have identical records. I feel confident now to talk to my data provider concerning the issue. I appreciate very much your timely assistanceo the matter.

Libin

-----Original Message----- From: Choate, Paul@DDS [mailto:pchoate@DDS.CA.GOV] Sent: Wednesday, October 25, 2006 6:47 PM To: Xu Libin; SAS-L@LISTSERV.UGA.EDU Subject: RE: Difference between nodup and nodupkey in sort procedure

Libin-

In v9 there is DUPOUT= option on the PROC SORT statement: "Specify the output data set to which duplicate observations are written"

hth

Paul Choate DDS Data Extraction (916) 654-2160

-----Original Message----- From: SAS(r) Discussion [mailto:SAS-L@LISTSERV.UGA.EDU] On Behalf Of Xu Libin Sent: Wednesday, October 25, 2006 1:43 PM To: SAS-L@LISTSERV.UGA.EDU Subject: Re: Difference between nodup and nodupkey in sort procedure

I tried to figure out a way to isolate those deleted cases so that I can compare, but don't know how this can be done.

Libin

-----Original Message----- From: SAS(r) Discussion [mailto:SAS-L@LISTSERV.UGA.EDU] On Behalf Of Arthur Tabachneck Sent: Wednesday, October 25, 2006 4:05 PM To: SAS-L@LISTSERV.UGA.EDU Subject: Re: Difference between nodup and nodupkey in sort procedure

Libin,

Can you post some sample data where this happens? If the file isn't sorted by all variables I can see nodup NOT getting rid of duplicates, but I've never seen it delete records that actually don't match on all variables.

Art -------- On Wed, 25 Oct 2006 15:29:05 -0400, Xu Libin <Libin.Xu@IRS.GOV> wrote:

>I thought that nodup option in proc sort get rid of duplicate records >and nodupkey get rid of duplicates of the by variable. When I ran the >below syntax, > >Proc sort data=old out=new nodup; > By id; >Run; > >About 760 cases were deleted. But I was told that they are not duplicate >records. At least one variable has different values. Can anyone on the >list provide an explanation for this? Thanks. > >Libin


Back to: Top of message | Previous page | Main SAS-L page