Date: Thu, 26 Jun 2008 10:26:24 -0700
Reply-To: SRAgodude@GMAIL.COM
Sender: "SAS(r) Discussion" <SAS-L@LISTSERV.UGA.EDU>
From: SRAgodude@GMAIL.COM
Organization: http://groups.google.com
Subject: losing records in large data set
Content-Type: text/plain; charset=ISO-8859-1
Hi,
I have a large data set with about 24 million records. To verify,
when running
proc contents data = cp_june26;
run;
I get:
The CONTENTS Procedure
Data Set Name WORK.CP_JUNE26
Observations 24354148
Member Type DATA
Variables 49
Engine V9
Indexes 0
... etc.
if I then do
data cp_for_partition ;
set cp_june26;
run;
the log says:
NOTE: There were 2500000 observations read from the data set
WORK.CP_JUNE26.
NOTE: The data set WORK.CP_FOR_PARTITION has 2500000 observations and
49 variables.
NOTE: DATA statement used (Total process time):
real time 1:08.52
cpu time 5.82 seconds
So my question is, how can I go from 24 million records to only 2.5
million records!!!!