LISTSERV at the University of Georgia
Menubar Imagemap
Home Browse Manage Request Manuals Register
Previous messageNext messagePrevious in topicNext in topicPrevious by same authorNext by same authorPrevious page (December 2006, week 1)Back to main SAS-L pageJoin or leave SAS-L (or change settings)ReplyPost a new messageSearchProportional fontNon-proportional font
Date:         Fri, 1 Dec 2006 10:45:16 -0500
Reply-To:     Wensui Liu <liuwensui@GMAIL.COM>
Sender:       "SAS(r) Discussion" <SAS-L@LISTSERV.UGA.EDU>
From:         Wensui Liu <liuwensui@GMAIL.COM>
Subject:      Re: Decision Tree refuses to grow
Comments: To: amw5gster@gmail.com
In-Reply-To:  <1164819018.146780.300870@h54g2000cwb.googlegroups.com>
Content-Type: text/plain; charset=ISO-8859-1; format=flowed

what tree algorithm are you using?

I don't know how tree algorithm works in sas/ Eminer. but split method of some tree algorith doesn't depends on p-value at all, such as CART.

My suggestion is to check the default setting of your program. A study indicates that analysis result is highly sensitive to the default setting of software, which is pretty interesting.

On 11/29/06, amw5gster@gmail.com <amw5gster@gmail.com> wrote: > Howdy, > > Silly question that's likely to show I'm overlooking something simple, > but I'm stumped. I have a dset of approx 8M observations and I'm > trying to grow an EMiner decision tree on a binary target variable. > There are about 20 independent variables, mostly interval (dates), but > some nominal, a few binary and one ordinal. The proportion of true > events is about 12%. I have not set any prior probabilities, nor > profit/cost values. > > The tree runs, but returns no splits. It just won't grow. I've tried > dropping the signif value to .00001, using upwards of 11 maximum > branches and my max depth to 10. I also tried having the tree build on > as few as 2 IVs. > > I was able to build a tree when I took a sample of 100K records and > forced the %age of true events in the sample to be 50%. Naturally I > don't want to misrepresent the proportion, and I figured that 12% > wasn't terribly rare for a d-tree. > > Am I outright doing something wrong or is this expected behavior? >

-- WenSui Liu (http://spaces.msn.com/statcompute/blog) Senior Decision Support Analyst Cincinnati Children Hospital Medical Center


Back to: Top of message | Previous page | Main SAS-L page