A simple random sample of 35 world-ranked chess players provides the following statistics: Number of hours of study per day:  ,  Yearly winnings:  ,  Correlation:   Based on this information, what is the resulting linear regression equation?
Winnings = 4850 Hours + 178,000

Winnings = 6300 Hours + 169,000

Winnings = 31,200 Hours + 14,550

Winnings = 32,300 Hours + 7750

Winnings = 42,000 Hours - 52,400

A scatterplot of a company's revenues versus time indicates a possible exponential relationship.   A linear regression on  against  gives  with .  Which of the following are valid conclusions? I.  One the average, revenue goes up 0.82 thousand dollars (or \$820) per year. II.  The predicted revenue in year 2005 is approximately 59 million dollars. III.  53% of the variation in revenue can be explained by variation in time.
I

II

III

I and III

None of the above are valid conclusions.

Suppose the regression line for a set of data, , passes through the point .  If  and  are the sample means of the x and y values, respectively, then
Consider the following residual plot: Which of the following scatterplots could have resulted in the above residual plot?  (The y-axis scales are the same in the scatterplots as in the residual plot.)
None of these could result in the given residual plot.

Which of the following is a correct conclusion based on the residual plot displayed?
The line overestimates the data.

The line underestimates the data.

It is not appropriate to fit a line to these data since there is clearly no correlation between the variables.

The data are not related.

None of these

A simple random sample of years and earnings was organized into pairs (time in years, earnings in \$1,000s).  The scatterplot appears exponential and the transformation  is applied to the data.  Linear regression is done on the transformed data and the following results are given: Which of the following is a valid conclusion?
The earnings gained after 12 years are approximately 5.8759.

The earnings gained after 12 years are approximately 356,345.

The earnings will increase by 0.464 thousand dollars each year.

The original investment was \$307.90

None of these is valid.

The heart disease death rates per 100,000 people in the United States for certain years, as reported by the National Center for Health Statistics, were Year 1950 1960 1970 1975 1980 Death rate 307.6 286,2 253.6 217.8 202.0 Which of the following is a correct interpretation of the slope of the best-fitting straight line for the above data?
The heart disease rate per 100,000 people has been dropping about 3.627 per year.

The baseline heart disease rate is 7386.87.

The regression line explains 96.28% of the variation in heart disease death rates over the years.

The regression line explains 98.12% of the variation in heart disease death rates over the years.

The heart disease death rate will be zero in the year 2036.

Which of the following is the quantity that is minimized by the least squares regression process?
Which of the following is a true statement?
The higher the correlation coefficient, the steeper the line of best fit.

The correlation coefficient has the same sign as the y-intercept of the least squares regression line.

A low correlation coefficient indicates a weak relationship between the two variables.

Two sets of bivariate data can have approximately equal correlation coefficients but very different scatterplots.

All of these are true.

If the correlation coefficient of a bivariate set of data  is , then which of the following is true?
The variables   and   are linearly related.

The correlation coefficient of the set   is also  .

The correlation coefficient of the set   is  .

The correlation coefficient of the set   is

None of these.

