
Ed230B Assignment #1 - Simple Regression
Part I: Consider the following results from a set of data containing 100 pairs of observations:
| Midterm (X) | Final (Y)
|
| Mean | Xbar=30 | Ybar=100
|
| Standard Deviation | sx=3.5 | sy=10
|
| Correlation Coefficient | rxy=0.65
|
From the information given above compute the following, showing all formulas and work:
Part II: Run a simple regression on the computer using the file
data.reg1 or in Stata
use http://www.gseis.ucla.edu/courses/data/reg1
- Circle and label each of the terms computed in Part I. Calculate and list any term you cannot find in the computer output.
- Plot standardized residuals. Indicate outliers, if any.
- Produce each of the following bivariate scatter plots (vertical axis listed first):
Include your comments for each of these plots with respect to possible violations of assumptions.
- Dependent Variable (Y) vs. Independent Variable (X)
- Residuals vs Predicted (fitted) Y
- Residuals vs X
- Residuals vs id (indexplot)
Note: You may use any form of residuals for the plots in (b) and (c) that you wish (raw
residuals, standardized, Studentized, etc.).
UCLA Department of Education
Phil Ender, 4Jan99