Reply
 
LinkBack Thread Tools Search this Thread Display Modes
  #1   Report Post  
Posted to microsoft.public.excel.worksheet.functions
external usenet poster
 
Posts: 2,069
Default multivariate regression

I am trying to teach myself the multivariate regression function in
toolsdata analysisregression. The following dummy variables I made up are
'significant' by the F-test but...what is a layman's explanation for the
Coefficients and Residuals you get? Is there a good Excel page that explains
to 'us' dummies the Summary Output page?

AGE (X1) PACKS/MONTH (X2) LUNG CANCER RISK (Y)
30 25 0.4
40 30 1.2
50 25 1.7
60 37 3.5
70 42 6.7
80 39 9.9

Thanks!!!
  #2   Report Post  
Posted to microsoft.public.excel.worksheet.functions
external usenet poster
 
Posts: 837
Default multivariate regression

A quibble with the terminology in your subject line: "multivariate
regression" is usually taken to mean regression with multiple response (y)
variables per observation. Your example with a single response variable but
multiple predictors is more commonly called "multiple regression" or
"multivariable regression".

The coefficients section says that your model estimates that
Y = -6.92404 + 0.176712*X1 + 0.033481*X2

Note that this model predicts a negative risk of cancer for non-smoking
babies; you should always beware of extrapolating beyond your data!

Next, notice the p-values for these individual coefficients. While the
overall model is mildly significant, no single predictor is; although age
exhibits the most evidence for inclusion in the model. It is possible for
observational data to have correlated predictors that make it difficult to
separate which variables are responsible for the response; but your data set
is small and regression on age alone appears significant while regression on
smoking level alone does not.

In particular, your data gives no evidence of a dose response relationship
between the amount smoked and the risk of lung cancer. This could be due to
a number of reasons including
- There may be no dose response relationship (unlikely given external
evidence).
- Your data set is too small relative to the inherent variability to show
much or avoid artifacts.
- All of your data is from relatively heavy smokers (~ 1 pack per day or
more) where a little more or less may not make that much difference. Note
especially that prediction for light nor non-smokers of any age would be
extrapolation beyond the data.
- There may be other important predictor variables that are not constant
for these subjects but are not provided in the data set.
....

Residuals are the discrepancy between predicted values and actual values.
If you had more data, then you would want to plot them against each of your
predictors. If the model were adequate, you should not see any obvious
patterns in those plots.

This barely scratches the surface of the topic. I highly recommend taking a
class or at least reading a good text on regression, such as "Applied
Regression Analysis" by Draper and Smith.

Jerry

"John" wrote:

I am trying to teach myself the multivariate regression function in
toolsdata analysisregression. The following dummy variables I made up are
'significant' by the F-test but...what is a layman's explanation for the
Coefficients and Residuals you get? Is there a good Excel page that explains
to 'us' dummies the Summary Output page?

AGE (X1) PACKS/MONTH (X2) LUNG CANCER RISK (Y)
30 25 0.4
40 30 1.2
50 25 1.7
60 37 3.5
70 42 6.7
80 39 9.9

Thanks!!!

Reply
Thread Tools Search this Thread
Search this Thread:

Advanced Search
Display Modes

Posting Rules

Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On


Similar Threads
Thread Thread Starter Forum Replies Last Post
multivariate histograms Masum Charts and Charting in Excel 0 March 25th 08 08:20 AM
The multiple regression tool - linear or non-linear regression? Statistical interest Excel Discussion (Misc queries) 2 February 6th 08 11:01 PM
Is correlation matrix univariate or multivariate? Dora Smith Excel Worksheet Functions 1 February 6th 07 04:26 AM
how to do multivariate analysis on excel? pleasehelp Excel Worksheet Functions 1 December 9th 06 11:52 AM
Multivariate statistics paulinoluciano Excel Worksheet Functions 1 December 28th 05 02:00 PM


All times are GMT +1. The time now is 04:13 AM.

Powered by vBulletin® Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
Copyright ©2004-2024 ExcelBanter.
The comments are property of their posters.
 

About Us

"It's about Microsoft Excel"