UNIVERSITY OF CALIFORNIA, LOS ANGELES
Department of Economics

Economics 143 (Cameron) - Applied Regression Analysis

SHAZAM code [wls_sim.sha]



* SHAZAM code downloaded from UCLA Econ 143 (CAMERON) WebSite: 
* HTML file called e143sh26.htm, and should have 
*  been downloaded as wls_sim.sha

set nocolor
* We will each create our own sample from a known "population"
* with a known "population regression function," (PRF).  By construction, 
* these data are heteroscedastic.  The true population error variance 
* is approximately proportional to the squared value of the 
* explanatory variable.  

popsize:300
sample 1 [popsize]

* CREATE SOME "POPULATION" DATA:

* Let non-profit organization revenues (r) be distributed uniformly between
* $500,000 and $1,500,000.

genr r=500+uni(1000)

* Let fund-raising expenditures or "development" costs (d) be given by
* a known "data-generating process" plus an error u that depends on revenue.
* The standard deviation of the PRF regression error is "sig" times r.
* Thus, E[u-squared] is proportional to r-squared.

sig:.05
genr u=[sig]*r*nor(1)

genr d=5 + 0.2*r + u 

plot d r

ols d r / resid=e

genr e2=e*e
plot e2 r
genr r2=r*r

*
* For your particular data set (everyone's will be different), is
*  the squared regression error more closely associated with r
*  or with r2?  Which would you expect, given the way the data
*  were constructed?
ols e2 r
ols e2 r2

*
* Assume that the regression  ols e2 r2  has the highest R-squared value.
* Note that we expect it to, since that was the way the data set was
* generated.  It is possible that, for your particular sample, this true
* underlying relationship will not be clear from the sample you have drawn.
*
* If your data do not reveal the true underlying form of
* the heteroscedasticity in the PRF, you have learned that even our best
* efforts with a sample of data will sometimes not reveal the TRUE form
* of heteroscedasticity afflicting our data.  Here, we know the true form,
* but in any real-life application, we will not.
*
* Now, multiply both sides of the population regression function by 1/r.
* This means that d becomes (d/r), the constant term becomes (1/r), and
* the r term becomes (r/r)=1 (the new constant).  
* The next question is:  If we do a regression of the transformed 
* dependent variable on the transformed explanatory
* variables (where the "variable" is now 1/r), what do we get?

* First, generate the transformed variables:
*    td= transformed d
*    tconst= transformed constant term
*    tr= transformed r, now just a constant equal to 1
* We will use OLS td tconst tr / noconstant to prevent an automatic constant


genr td=d/r
genr tconst=1/r
genr tr=1

*
* First plot transformed (td) against transformed constant (tconst)
* Does this more closely satisfy the maintained hypotheses for OLS?

plot td tconst

* Next, run the regression through these points and NOTE the point estimates

ols td tr tconst / noconstant

*
* Now create the appropriate weighting variable and see if you get the same 
* results as by explicit transformation of all variables and weighted 
* least squares

genr wt=1/(r*r)

ols d r / weight=wt

stop




Updated: 10:32 PM 11/18/98; Prepared by: Trudy Ann Cameron; Site Index