|
> From: Myra
>
> I have a dataset with over 83000 records. About 2200 of those
> observations are duplicates. How can I find duplicate
> records and remove them? Could I output the duplicates into
> another dataset?
you'll want to squint as you read and comprehend the difference
between the nodupkey and noduprecs
RTFM: NODUPKEY option
Proc Sort statement
PROC Sort data = Lib._83000
out = Work._81000
%*either;
nodupkey
%*or;
noduprecs
%*and last:;
DUPOUT = SAS-data-set
;
by <IdList>;
hth
Ron Fehd the macro maven CDC Atlanta GA USA RJF2 at cdc dot gov
|