LISTSERV at the University of Georgia
Menubar Imagemap
Home Browse Manage Request Manuals Register
Previous messageNext messagePrevious in topicNext in topicPrevious by same authorNext by same authorPrevious page (June 2004, week 1)Back to main SAS-L pageJoin or leave SAS-L (or change settings)ReplyPost a new messageSearchProportional fontNon-proportional font
Date:   Thu, 3 Jun 2004 09:56:23 -0700
Reply-To:   Nathan Fong <nmfong@HOTMAIL.COM>
Sender:   "SAS(r) Discussion" <SAS-L@LISTSERV.UGA.EDU>
From:   Nathan Fong <nmfong@HOTMAIL.COM>
Subject:   Re: stepwise linear regression modeling
Content-Type:   text/plain; charset="iso-8859-1"

An easy way to do this is to use PROC GLMMOD, which builds your regression matrix the way PROC GLM would, building dummy variables, interactions, etc.

Glad I saw this post. Right now I'm working on a model with high overall fit, and using some selection model seemed like a good way to cut out some fat. I'll look into your oft-repeated advice and see if PLS is appropriate.

BTW - on a complete tangent - I find it hard to follow threads in the archives, because when people change the subject line when they reply it breaks the thread. Is there a FAQ or welcome message that tells people not to do this?

On Wed, 2 Jun 2004 15:40:51 -0700, David L. Cassell <cassell.david@epamail.epa.gov> wrote: > > > "Andrel, Jocelyn" <jandrel@WVMI.ORG> wrote: > > Is there a Proc where you can use both interaction terms and a > selection option (like SELECTION = STEPWISE) for linear regression? > Proc Reg has the selection option, and Proc GLM can handle interaction > terms, but I haven't found a Proc that does both (similar to what Proc > Logistic can do). > > Your best option is: do not do stepwise regression. > > Seriously. I have written pages on this issue in SAS-L before. > (You can bore yourself to tears by looking up my rants in the > SAS-L archives at http://www.listserv.uga.edu/archives/sas-l.html > if you want to. Just search for the keyword 'stepwise'.) > Particularly when you are working with interaction terms, which > by definition will be highly correlated with other variables > in your model, stepwise regression can do bad things. You have > no guarantee that you will get the right term, instead of some > higher-order term which happens to be correlated. > > Second option: Build your own interaction terms. They're just > your first-order terms, multiplied together. But be warned of > the incipient risks. And decide up front how you are going to > interpret an 'interaction' term which is the product of two > continuous variables. What exactly is that supposed to mean in > the real world? That is why you see this feature in PROC GLM but > not in PROC REG. > > [Side note: I don't like the fact that stepwise selection on > big heaping wads of interaction terms is even theoretically > possible in PROC LOGISTIC. It opens up several big steaming > cans of slimy worms that I would just as soon avoid.] > > HTH, > David > -- > David Cassell, CSC > Cassell.David@epa.gov > Senior computing specialist > mathematical statistician >


Back to: Top of message | Previous page | Main SAS-L page