LISTSERV at the University of Georgia
Menubar Imagemap
Home Browse Manage Request Manuals Register
Previous messageNext messagePrevious in topicNext in topicPrevious by same authorNext by same authorPrevious page (November 2001, week 1)Back to main SAS-L pageJoin or leave SAS-L (or change settings)ReplyPost a new messageSearchProportional fontNon-proportional font
Date:   Fri, 2 Nov 2001 17:25:12 -0800
Reply-To:   Dale McLerran <dmclerra@MY-DEJA.COM>
Sender:   "SAS(r) Discussion" <SAS-L@LISTSERV.UGA.EDU>
From:   Dale McLerran <dmclerra@MY-DEJA.COM>
Subject:   Re: Interpretting NLMIXED (vs. GEE)
Comments:   To: sean_meldrum@urmc.rochester.edu
Content-Type:   text/plain

Sean Meldrum wrote: >Dale, > >Thank you very much for your responses to my posts on SAS-L. I'm really >happy to get some feedback, advice and opinions on this stuff. As always, >your posts have been very helpful and thought provoking to me. > >I appreciate your comments on GEE. I am comfortable using SUDAAN, though >I'd hoped to start using GENMOD more. I didn't realize that GENMOD fits >only cumulative logit for polytomous variables, though I guess I could get >around that if I need to. I am not concerned about whether the data will >run with GEE - we have 625 subjects (clusters) with up to 3 repeated >measures each - more than 90% have all three observations. >

GEE is designed just for this type of data. Asymptotics will apply nicely here.

> >I would appreciate a copy of the bootstrap macro you mentioned, if it's not >too much trouble - I'm curious. >

I'll send that under separate cover to the list.

> >Thank you for the pitfalls of non-linear modeling that you mention. I've >found hints of the same problems in the literature - specifically troubles >getting the models to converge. Several places have suggested feeding SAS >initial parameter estimates from a conventional logistic regression, but >that doesn't seem to be a guaranteed method. I'm also quite concerned about >comments I've heard about the stability of the models, which you also echo. >Along those same lines, there is very little published advice that I've been >able to find regarding model fitting techniques for these types of models. > >But I'm still perplexed about 'population odds' and 'individual odds'. I've >always been under the impression that the parameters from a conventional >logistic regression represent individual odds - have I been wrong??? It >certainly seems that is how we present the results to people. I mean, after >we do studies we take the parameter estimates and interpret them by saying >things like "On average, if a male age 50-60 stops smoking his risk of >stroke in the next year will decrease by XX%." This sounds like an >individual odds...? Is there something about the parameter estimates from a >non-linear regression that make them uninterprettable outside of the context >of the regression themselves? >

When you fit a logistic regression model to data which are a simple random sample, then there is no difference between individual and population interpretations of odds ratios. The problem arises when you have correlated data. I will do my best with the figure below to demonstrate individual vs population average interpretations.

P(y) | 1111111112222222ppppp33 | 1111 2222pppppp333 | 111 222 ppp 333 | 11 22 pp 33 | 11 22 pp 33 | 11 22 p 33 | 1 2 pp 3 | 1 2 pp 3 | 11 22pp 33 | 11 22p 33 | 1 pp 3 | 1 p 3 | 11 pp2 33 | 11 p22 33 | 11 pp22 33 | 11 p 22 33 | 11 p 22 33 | 11 pp 22 33 | 11 ppp 22 33 | 111 ppp 222 333 | 1111ppppp2222 3333 |11111222222223333333333 ----------------------------------------------------------- X

This figure shows the cumulative probability function for three individuals (identified by values 1, 2, or 3) as they might be found from a conditional logistic regression model. The three probability plots are parallel. Given a certain cumulative probability value, say 0.10, occuring at X(1), X(2), or X(3), then the cumulative probability function at X(1)+d, X(2)+d, and X(3)+d will be the same. What causes the difference between the three curves is that each subject has a different intercept, which shift the curve left or right along the X axis. Also shown on this plot is a population average cumulative probability value as obtained, say, from a GEE. The population average model is identified by the letter p. This population average cumulative probability does not increase at the same rate as the subject specific cumulative probabilities. The population average model flattens out the curve. This implies a lower odds ratio, which is what you typically obtain when you fit a logistic regression model employing GEE as opposed to an NLMIXED or GLIMMIX. Note that by averaging over the subject- specific cumulative probabilities, you can obtain the population average cumulative probability. So, by fitting the subject-specific model, you can subsequently obtain a population average interpretation if you desire. You cannot turn this around and infer a subject- specific effect from a population average effect.

But then the question is what is the level for which you want to make some inference, individual subjects or the population? If you want to make a policy statement that modification of fat intake by such and such an amount in the population will have this certain reduction in population probability of prostate cancer, then you want to employ a population average model. If you are a doctor advising a patient, then fat modification by the same amount will have a different (larger) benefit. This is where you need to make your decision: who is going to use this information? Will individuals be counseled, or will the results be disseminated for policy making?

> >Sorry to keep pounding at the same question - I'm just feeling very dense >this week. Pretty soon I'm just going to have to make a decision and go >with it, and marginal models certainly would be a legitimate way to go. >(And they are bound to be easier for me to figure out!) But before I go >down that road I want to make sure that I'm giving the analysis the best >shot I can - and the chance to learn a new technique just whets my >appetite. > >Thank you again. I hope that others may have some more comments to add, >too. > >Sean Meldrum >University of Rochester >

--------------------------------------- Dale McLerran Fred Hutchinson Cancer Research Center mailto: dmclerra@fhcrc.org Ph: (206) 667-2926 Fax: (206) 667-5977 ---------------------------------------

------------------------------------------------------------ --== Sent via Deja.com ==-- http://www.deja.com/


Back to: Top of message | Previous page | Main SAS-L page