Median calculation of grouped data.
"Philippe L. Balmanno" wrote in message
...
"Bruce" wrote in message
...
Hi PHilippe: Thanks for responding. My data currently looks like this:
A B C D E
3 5 6 3 2
where (A, B, C, D, E) correspond to the number of responses I get for
ratings of (1, 2, 3, 4, 5), respectively. For example, a "6" in column C
indicates that I have 6 responses of a rating "3".
The median for this "grouped" data is 3 as there are 8 responses below
the 3
rating and 8 above. But doing an "=MEDIAN(A1:E1) would give me the
median
value of the group (2,3,3,5,6), when what I actually want is the that of
the
groups(1,1,2,2,2,3,3,3,4,4,4,4,4,5,5,5,5,5,5). Does this make sense?
But
how to do it???
-Bruce
--
Bruce
"Philippe L. Balmanno" wrote:
"Philippe L. Balmanno" wrote in message
...
"Bruce" wrote in message
...
Greetings all: All I want to do is calculate the median of grouped
data.
I
have five cols. of data corresponding to a 1:5 Likert scale. For
example: 3
ones, 5 twos, 3 threes, 6 fours, 2 fives. I have 19 values, so the
10th
ranked value would be the median. The 10th value resides in the
"threes"
column, so that's my median value. But how to do this via a
function?
Do I expand these cells like this?:
=MEDIAN(1,1,1,2,2,2,2,2,3,3,3,4,4,4,4,4,4,5,5), which I don't know
how to
do,
by the way, or is there a simpler way?
Many thanks in advance...
--
Bruce
Well if you have your values in a row columns A1:S1 then
if this was placed in T1: =MEDIAN(A1:S1) It would bring up your
Median.
If your numbers were in a column like in A1:A19 then =MEDIAN(A1:A19)
but
=MEDIAN(A1:E6) wouldn't make sense unless you had an array.
5 columns (6) 4's
so your data would look like this {=MEDIAN(A1:E4)}
A B C D E
1 1 1 2 2
2 2 2 3 3
3 4 4 4 4
4 4 5 5
Because each column corresponds to a value, what you want then is the max
value in the row
A B C D E
1 3 5 6 3 2
=MAX(A1:E1) would return 6 where Column C corresponds to the rating of 3
and there are 6
You should consider the way you are recording your data. If the entire data
set has a chance of multiple equal ratings, I wouldn't compress it as much
as you have. The reason is what if for some fluke you come up with:
A B C D E
1 3 6 6 3 2
where would your mode be? At the rating of 2 or 3? If this possibility
isn't an issue then forget I mentioned it. If it is then you're better off
disigning a Frequency Distribution table and using Tools/Data
Analysis/Descriptive Statistics like this: (noter the mode and the max and
you will see why I chose =MAX(A1:E1)
Rating Frequency
1 3
2 5
3 6
4 3
5 2
Rating Frequency
Mean 3 Mean 3.8
Standard Error 0.707106781 Standard Error 0.734846923
Median 3 Median 3
Mode #N/A Mode 3
Standard Deviation 1.58113883 Standard Deviation 1.643167673
Sample Variance 2.5 Sample Variance 2.7
Kurtosis -1.2 Kurtosis -1.687242798
Skewness 0 Skewness 0.518420528
Range 4 Range 4
Minimum 1 Minimum 2
Maximum 5 Maximum 6
Sum 15 Sum 19
Count 5 Count 5
|