LISTSERV at the University of Georgia
Menubar Imagemap
Home Browse Manage Request Manuals Register
Previous messageNext messagePrevious in topicNext in topicPrevious by same authorNext by same authorPrevious page (October 2005, week 1)Back to main SAS-L pageJoin or leave SAS-L (or change settings)ReplyPost a new messageSearchProportional fontNon-proportional font
Date:   Mon, 3 Oct 2005 13:58:06 -0500
Reply-To:   "Nick ." <ni14@MAIL.COM>
Sender:   "SAS(r) Discussion" <SAS-L@LISTSERV.UGA.EDU>
From:   "Nick ." <ni14@MAIL.COM>
Subject:   SURVEYSELECT_how to select samples
Content-Type:   text/plain; charset="iso-8859-1"

Hello, I am familiar with PROC SURVEYSELECT but here is my question I need help with: I have a dataset of say 100,000 records. I want to split it into two (or >= 2 groups to be more general) RANDOMLY WITHOUT REPLACEMENT groups. So I will have two groups: One group will serve as predictive model built-up and the other group will serve as model validation. I know how to do this with data steps but I am alost certain there is a way in SURVEYSELECT which allows to cut a data set into >=2 randomly split (without replacement) groups. No duplicates here!

As a sidenote, what if I had the same question as above but I wanted to do it WITH REPLACEMENT. I am just curious about this, not that I needed right now. And what does with replacement mean? I tried some code from previous archives WITH REPLACEMENT but I don't get any duplicates in my sample. I thought replacement meant one record may be chosen more than once. How do you tell SURVEYSELECT to do this. I better get duplicates when I do this, right?

Thanks.

NICK

-- ___________________________________________________________ Sign-up for Ads Free at Mail.com http://promo.mail.com/adsfreejump.htm


Back to: Top of message | Previous page | Main SAS-L page