ExcelBanter

ExcelBanter (https://www.excelbanter.com/)
-   Excel Worksheet Functions (https://www.excelbanter.com/excel-worksheet-functions/)
-   -   How do I run a regression on data that is not numerical? (https://www.excelbanter.com/excel-worksheet-functions/112338-how-do-i-run-regression-data-not-numerical.html)

NG

How do I run a regression on data that is not numerical?
 
I am using Microsoft Excel 2003. I have been running regressions on
numerical data and am curious to know how to run one if part of my data is
non-numerical such as gender or race.

ExcelBanter AI

Answer: How do I run a regression on data that is not numerical?
 
Unfortunately, Microsoft Excel 2003 does not have the capability to run regressions on non-numerical data. Regression analysis requires numerical data in order to calculate the relationship between variables. However, there are a few workarounds you can try:
  1. Convert non-numerical data to numerical data: If you have categorical data such as gender or race, you can assign numerical values to each category. For example, you can assign 1 to male and 2 to female, or 1 to white, 2 to black, and so on. This will allow you to run a regression analysis, but keep in mind that the numerical values you assign do not have any inherent meaning.
  2. Use a different statistical softwa There are many statistical software packages available that can handle non-numerical data, such as R or SAS. These programs have more advanced regression analysis capabilities and can handle a wider range of data types.
  3. Consider a different type of analysis: If you cannot convert your non-numerical data to numerical data, you may need to consider a different type of analysis that is more appropriate for your data. For example, you could use a chi-squared test to analyze the relationship between two categorical variables.

Jerry W. Lewis

How do I run a regression on data that is not numerical?
 
Where only two values are possible (as with gender) then you use a single
variable with +1 for one gender and -1 for the other. Extending to more than
two values is possible, but non-trivial.

Alternately, if you have Excel 2003 or later, you can create an indicator
variable (0 or 1) for each possible non-numeric value. This approach
directly permits more than 2 possible values.

Jerry

"NG" wrote:

I am using Microsoft Excel 2003. I have been running regressions on
numerical data and am curious to know how to run one if part of my data is
non-numerical such as gender or race.


NG

How do I run a regression on data that is not numerical?
 
Can you explain the Excel 2003 or later indicator variables a little more? I
have four non-numerical values for race.

Thanks for the information on gender. I was using 1 for men and 2 for
females.

"Jerry W. Lewis" wrote:

Where only two values are possible (as with gender) then you use a single
variable with +1 for one gender and -1 for the other. Extending to more than
two values is possible, but non-trivial.

Alternately, if you have Excel 2003 or later, you can create an indicator
variable (0 or 1) for each possible non-numeric value. This approach
directly permits more than 2 possible values.

Jerry

"NG" wrote:

I am using Microsoft Excel 2003. I have been running regressions on
numerical data and am curious to know how to run one if part of my data is
non-numerical such as gender or race.


Mike Middleton

How do I run a regression on data that is not numerical?
 
NG -

For four levels of a categorical variable, e.g., A or B or C or D, use three
indicator variables. Select one level as the base case, e.g., A, and the
value of each indicator variable (B, C, D) shows whether an observation is B
or not B, C or not C, etc. For an observation with level A, the value of all
three indicator variables is zero. The regression coefficients measure how
different B,C,D are from the base case A, on the average.

I use the same approach of gender, e.g., 0 for male and 1 for female, in
which case the regression coefficent for the gender indicator shows how
females differ from males, on the average.

- Mike
http://www.mikemiddleton.com

"NG" wrote in message
...
Can you explain the Excel 2003 or later indicator variables a little more?
I
have four non-numerical values for race.

Thanks for the information on gender. I was using 1 for men and 2 for
females.

"Jerry W. Lewis" wrote:

Where only two values are possible (as with gender) then you use a single
variable with +1 for one gender and -1 for the other. Extending to more
than
two values is possible, but non-trivial.

Alternately, if you have Excel 2003 or later, you can create an indicator
variable (0 or 1) for each possible non-numeric value. This approach
directly permits more than 2 possible values.

Jerry

"NG" wrote:

I am using Microsoft Excel 2003. I have been running regressions on
numerical data and am curious to know how to run one if part of my data
is
non-numerical such as gender or race.





All times are GMT +1. The time now is 12:02 PM.

Powered by vBulletin® Copyright ©2000 - 2024, Jelsoft Enterprises Ltd.
ExcelBanter.com