Date: Wed, 2 Jul 2008 12:56:57 -0400
Sender: "SAS(r) Discussion" <SAS-L@LISTSERV.UGA.EDU>
From: Nat Wooding <Nathaniel.Wooding@DOM.COM>
Subject: Re: RE Dataset size increasing a lot
Content-Type: text/plain; charset="ISO-8859-1"
Have you compared the widths of the character variables in the two files?
Environmental Specialist III
Dominion, Environmental Biology
4111 Castlewood Rd
Richmond, VA 23234
Phone:804-271-5313, Fax: 804-271-2977
Sent by: "SAS(r) SAS-L@LISTSERV.UGA.EDU
Re: RE Dataset size increasing a
Please respond to
Thanks Patrice but the files are not compressed...
--- El mié 2-jul-08, firstname.lastname@example.org
De: email@example.com <firstname.lastname@example.org>
Asunto: RE Dataset size increasing a lot
Fecha: miércoles, 2 de julio de 2008, 1:16 pm
It sounds like the original dataset (Data1) is compressed...
In order to find out, open the original dataset in a viewtable, go to the
menu "Data" and select "Table Attributes". From there, choose the
"Advanced" tab and look for (check) into the "Attributes (compressed)"
Hope if will help you out.
Patrice B. :)
P.S.; In order to create a sas dataset in compress mode, you could either
use "data work.data2 (compress = yes);" or specify the compress option like
the following "options compress = yes" just before you execute your code.
The first option is the one I would rather use since the compression will
only apply to this dataset. The second option will affect all the datasets
following the options or until you switch it back to "options compress =
"SAS(r) Discussion" <SAS-L@LISTSERV.UGA.EDU> a écrit sur 2008-07-02
> Hi list.
> Playing with a dataset I find this issue:
> Original dataset: data1. (Size: 76745 kb). 2000000 obs, 6 vars.
> I made a copy in my hard disk of the same dataset with the name data2.
> Using the following code:
> proc sql;
> delete * from data1;
> I got a file with 0 obs, 6 vars and the same size.
> Then I insert the data2 into the data1 to see data1 size with this code:
> proc sql;
> insert into data1
> select *
> from data2;
> The result is a data1 with 2000000 obs. and 6 vars (the same as
> data2) but with the size doubled.
> Taking a look in the explorer I went to the datasets properties and
> found in the details the following:
> For data1: Deleted Rows: 2000000.
> Number of dataset pages: 38372.
> For data2: Deleted Rows: 0
> Number of dataset pages: 19186.
> Can anybody help me about dealing with this issue? Because in a real
> scenario the dataset sizes is going to increase and increase in
> every delete and insert of rows.
> ¡Buscá desde tu celular!
> Yahoo! oneSEARCH ahora está en Claro
¡Buscá desde tu celular!
Yahoo! oneSEARCH ahora está en Claro
CONFIDENTIALITY NOTICE: This electronic message contains
information which may be legally confidential and/or privileged and
does not in any case represent a firm ENERGY COMMODITY bid or offer
relating thereto which binds the sender without an additional
express written confirmation to that effect. The information is
intended solely for the individual or entity named above and access
by anyone else is unauthorized. If you are not the intended
recipient, any disclosure, copying, distribution, or use of the
contents of this information is prohibited and may be unlawful. If
you have received this electronic transmission in error, please
reply immediately to the sender that you have received the message
in error, and delete it. Thank you.