ExcelBanter - Regression results

ExcelBanter (https://www.excelbanter.com/)

- Excel Programming (https://www.excelbanter.com/excel-programming/)

- - Regression results (https://www.excelbanter.com/excel-programming/359568-re-regression-results.html)

Regression results

Hi Jerry,

I usually have somewhere around 375 data points, which goes up by 1 each
week, as I am tracking a stock closing, but only on a weekly basis. Yes, it
is the 6th degree polynomial that I add to the graph. I tried to figure out
the individual data points represented by the graph, using the formula
provided by the graph, but the "point" was not very accurate and I am given
the understanding that this was due to rounding errors and the precion of
Excel, prior to Office 2003.

Maybe this has all changed? I had also run across a third party add-on that
was claiming to have increased the accuracy, but did not purchase it and have
not heard anyone else mention it or vouch for it. But if I could find a way
to figure out the individual points represented by the graph, I would be
interested in doing that.

Thanks for your help.

--
David

"Jerry W. Lewis" wrote:

If by "6th degree", you mean a 6th degree polynomial, then you should be
conserned about whether you have sampled a wide enough range of data to be
able to reliably estimate the parameters. Even with independent parameters
and/or adequate data range, validation of the formula could be interesting.
Remember the famous quote of von Neumann "With four parameters I can fit an
elephant, and with five I can make him wiggle his trunk."

Jerry

"David" wrote:

I just started reading your thread and have become interested. I was at one
time trying to create data points from an anaysis of Stock closiings, that
are represented by a regression (6th degree) line. I was not able to use the
formula supplied in previous versions of Excel, prior to Ver 2003. But I
would be interested in trying this again, since i have the new version. I
would like to actually create data points as respesented by the regression
line, fi that is possible?

Thanks,
--
David

"Mitch" wrote:

I need to run regression analyses (linear & nonlinear) on many data sets. I
have observed that the resulting equations generated in a chart trend line
and that generated by the linest() function are frequently very different. I
have read Tushar Mehta's explanation of the problem, but need advice on which
is more accurate.

Regression results

How widely spaced are these 375 x-values?

If the x-values are 1,2,3,... then fitting a 6th degree polynomial is a very
difficult problem numerically (condition number ~3E31). The direct algorithm
used by LINEST prior to 2003 would likely produce meaningless results. The
chart trendline and LINEST might be accurate to a few figures, but the
problem could be challenging for them as well. To reliably fit a problem
this numerically difficult, you might need a package that uses quadruple
precision (I don't know of any statistics programs that do) or arbitrary
precision (cf.
http://groups.google.com/group/micro...fdea49d5c999a7
). You could probably do this in double precision by fitting orthogonal
polynomials, assuming that the x-values themselves are really accurate enough
to be worth the effort.

Your mention of "the formula provided by the graph" raises another issue:
while the chart trendline (unchanged with 2003) has always been quite good
numerically (better than almost all dedicated statistical packages, except
where they fit by orthogonal polynomials), by default very few figures of
this high quality fit get displayed on the graph. You need to right click on
the equation and change the numeric format to display scientific notation
with 14 decimal places.

Jerry

"David" wrote:

Hi Jerry,

I usually have somewhere around 375 data points, which goes up by 1 each
week, as I am tracking a stock closing, but only on a weekly basis. Yes, it
is the 6th degree polynomial that I add to the graph. I tried to figure out
the individual data points represented by the graph, using the formula
provided by the graph, but the "point" was not very accurate and I am given
the understanding that this was due to rounding errors and the precion of
Excel, prior to Office 2003.

Maybe this has all changed? I had also run across a third party add-on that
was claiming to have increased the accuracy, but did not purchase it and have
not heard anyone else mention it or vouch for it. But if I could find a way
to figure out the individual points represented by the graph, I would be
interested in doing that.

Thanks for your help.

--
David

"Jerry W. Lewis" wrote:

If by "6th degree", you mean a 6th degree polynomial, then you should be
conserned about whether you have sampled a wide enough range of data to be
able to reliably estimate the parameters. Even with independent parameters
and/or adequate data range, validation of the formula could be interesting.
Remember the famous quote of von Neumann "With four parameters I can fit an
elephant, and with five I can make him wiggle his trunk."

Jerry

"David" wrote:

I just started reading your thread and have become interested. I was at one
time trying to create data points from an anaysis of Stock closiings, that
are represented by a regression (6th degree) line. I was not able to use the
formula supplied in previous versions of Excel, prior to Ver 2003. But I
would be interested in trying this again, since i have the new version. I
would like to actually create data points as respesented by the regression
line, fi that is possible?

Thanks,
--
David

Regression results

Hi Again Jerry,
What I am recording are weekly closes of stocks and the precision displayed
is to the 100ths. And it is graphed based on the time line of a week. I do
not know what orthogonal
polynomials are, so I would not know how to implement that, but the 6th
degree polynomial that is displayed appears very good to me. My understanding
is that it is based on a least squares and that the numbers are so large that
Excel has a difficult time calculating out as far as the equation requires?
Thansk again for your help.
--
David

"Jerry W. Lewis" wrote:

How widely spaced are these 375 x-values?

If the x-values are 1,2,3,... then fitting a 6th degree polynomial is a very
difficult problem numerically (condition number ~3E31). The direct algorithm
used by LINEST prior to 2003 would likely produce meaningless results. The
chart trendline and LINEST might be accurate to a few figures, but the
problem could be challenging for them as well. To reliably fit a problem
this numerically difficult, you might need a package that uses quadruple
precision (I don't know of any statistics programs that do) or arbitrary
precision (cf.
http://groups.google.com/group/micro...fdea49d5c999a7
). You could probably do this in double precision by fitting orthogonal
polynomials, assuming that the x-values themselves are really accurate enough
to be worth the effort.

Your mention of "the formula provided by the graph" raises another issue:
while the chart trendline (unchanged with 2003) has always been quite good
numerically (better than almost all dedicated statistical packages, except
where they fit by orthogonal polynomials), by default very few figures of
this high quality fit get displayed on the graph. You need to right click on
the equation and change the numeric format to display scientific notation
with 14 decimal places.

Jerry

"David" wrote:

Hi Jerry,

I usually have somewhere around 375 data points, which goes up by 1 each
week, as I am tracking a stock closing, but only on a weekly basis. Yes, it
is the 6th degree polynomial that I add to the graph. I tried to figure out
the individual data points represented by the graph, using the formula
provided by the graph, but the "point" was not very accurate and I am given
the understanding that this was due to rounding errors and the precion of
Excel, prior to Office 2003.

Maybe this has all changed? I had also run across a third party add-on that
was claiming to have increased the accuracy, but did not purchase it and have
not heard anyone else mention it or vouch for it. But if I could find a way
to figure out the individual points represented by the graph, I would be
interested in doing that.

Thanks for your help.

--
David

"Jerry W. Lewis" wrote:

If by "6th degree", you mean a 6th degree polynomial, then you should be
conserned about whether you have sampled a wide enough range of data to be
able to reliably estimate the parameters. Even with independent parameters
and/or adequate data range, validation of the formula could be interesting.
Remember the famous quote of von Neumann "With four parameters I can fit an
elephant, and with five I can make him wiggle his trunk."

Jerry

"David" wrote:

I just started reading your thread and have become interested. I was at one
time trying to create data points from an anaysis of Stock closiings, that
are represented by a regression (6th degree) line. I was not able to use the
formula supplied in previous versions of Excel, prior to Ver 2003. But I
would be interested in trying this again, since i have the new version. I
would like to actually create data points as respesented by the regression
line, fi that is possible?

Thanks,
--
David

Regression results

Presumably the stock closes are what you want to predict. That is largely
irrelevant for determining the numeric difficulty of the problem. The issue
is what you are trying to use to predict them (a 6th degree polynomial in
what?), and whether you have sampled a wide enough range to have any hope of
accurately estimating those coefficients.

As I also tried to communicate, the numerical problems are not an Excel
issue, per se, since the accuracy of the Excel chart trendline and Excel
2003's LINEST function is comparable to that of dedicated statistics
programs. Accurately fitting 6th degree polynomials is often very difficult
numerically.

Potentially even more problematic is the question of whether accurately
fitting this 6th degree polynomial would tell you anything useful. Unless
you have some theoretical basis for expecting that the form of your model (a
6th degree polynomial in whatever) is right, then accurately fitting the past
gives no assurance that the model will accurately predict the future, even in
the near term. The more empirical parameters you have to include to fit the
past, greater the chance that the fitted model will have little or no
predictive power (recall the von Neumann quote).

Jerry

"David" wrote:

Hi Again Jerry,
What I am recording are weekly closes of stocks and the precision displayed
is to the 100ths. And it is graphed based on the time line of a week. I do
not know what orthogonal
polynomials are, so I would not know how to implement that, but the 6th
degree polynomial that is displayed appears very good to me. My understanding
is that it is based on a least squares and that the numbers are so large that
Excel has a difficult time calculating out as far as the equation requires?
Thansk again for your help.
--
David

"Jerry W. Lewis" wrote:

How widely spaced are these 375 x-values?

If the x-values are 1,2,3,... then fitting a 6th degree polynomial is a very
difficult problem numerically (condition number ~3E31). The direct algorithm
used by LINEST prior to 2003 would likely produce meaningless results. The
chart trendline and LINEST might be accurate to a few figures, but the
problem could be challenging for them as well. To reliably fit a problem
this numerically difficult, you might need a package that uses quadruple
precision (I don't know of any statistics programs that do) or arbitrary
precision (cf.
http://groups.google.com/group/micro...fdea49d5c999a7
). You could probably do this in double precision by fitting orthogonal
polynomials, assuming that the x-values themselves are really accurate enough
to be worth the effort.

Your mention of "the formula provided by the graph" raises another issue:
while the chart trendline (unchanged with 2003) has always been quite good
numerically (better than almost all dedicated statistical packages, except
where they fit by orthogonal polynomials), by default very few figures of
this high quality fit get displayed on the graph. You need to right click on
the equation and change the numeric format to display scientific notation
with 14 decimal places.

Jerry

"David" wrote:

Hi Jerry,

I usually have somewhere around 375 data points, which goes up by 1 each
week, as I am tracking a stock closing, but only on a weekly basis. Yes, it
is the 6th degree polynomial that I add to the graph. I tried to figure out
the individual data points represented by the graph, using the formula
provided by the graph, but the "point" was not very accurate and I am given
the understanding that this was due to rounding errors and the precion of
Excel, prior to Office 2003.

Maybe this has all changed? I had also run across a third party add-on that
was claiming to have increased the accuracy, but did not purchase it and have
not heard anyone else mention it or vouch for it. But if I could find a way
to figure out the individual points represented by the graph, I would be
interested in doing that.

Thanks for your help.

--
David

"Jerry W. Lewis" wrote:

If by "6th degree", you mean a 6th degree polynomial, then you should be
conserned about whether you have sampled a wide enough range of data to be
able to reliably estimate the parameters. Even with independent parameters
and/or adequate data range, validation of the formula could be interesting.
Remember the famous quote of von Neumann "With four parameters I can fit an
elephant, and with five I can make him wiggle his trunk."

Jerry

"David" wrote:

I just started reading your thread and have become interested. I was at one
time trying to create data points from an anaysis of Stock closiings, that
are represented by a regression (6th degree) line. I was not able to use the
formula supplied in previous versions of Excel, prior to Ver 2003. But I
would be interested in trying this again, since i have the new version. I
would like to actually create data points as respesented by the regression
line, fi that is possible?

Thanks,
--
David

Regression results

Hi Jerry,

I am not trying to predict, I am trying to find the specific data points
associated with a 6th degree polynomial. I have the two things only a Date
and a Close. These are compiled on a weekly basis for a stock each Monday.
Today I will add a close and a date to a stock after the market closes.
Nothing more or less.

I sometimes chart that stock, very simple, the dates and the closings. I can
add to that chart a 6th degree poly. But I do not know what the data points
are that are plotted and that is what I would like to know. I usually have
about 5 years worth of data, but I commonly only plot 2 years worth of data.
The data I use is actual closes for the stock on each Monday. After adding
the 6th degree poly, I would like to know where the line is on the chart for
any specific Monday and would like to add a column to the simple File that
says "Poly Close."

Date Close
1/1/1999 10.00
.... ...
.... ...
4/24/06 15.00
(Above is what I have)

(What I would like is)
Date Close PloyClose
1/1/1999 10.00 9.88
.... ... ...
.... ... ....
.... ... ...
4/24/06 15.00 14.65

--
David

"Jerry W. Lewis" wrote:

Presumably the stock closes are what you want to predict. That is largely
irrelevant for determining the numeric difficulty of the problem. The issue
is what you are trying to use to predict them (a 6th degree polynomial in
what?), and whether you have sampled a wide enough range to have any hope of
accurately estimating those coefficients.

As I also tried to communicate, the numerical problems are not an Excel
issue, per se, since the accuracy of the Excel chart trendline and Excel
2003's LINEST function is comparable to that of dedicated statistics
programs. Accurately fitting 6th degree polynomials is often very difficult
numerically.

Potentially even more problematic is the question of whether accurately
fitting this 6th degree polynomial would tell you anything useful. Unless
you have some theoretical basis for expecting that the form of your model (a
6th degree polynomial in whatever) is right, then accurately fitting the past
gives no assurance that the model will accurately predict the future, even in
the near term. The more empirical parameters you have to include to fit the
past, greater the chance that the fitted model will have little or no
predictive power (recall the von Neumann quote).

Jerry

"David" wrote:

Hi Again Jerry,
What I am recording are weekly closes of stocks and the precision displayed
is to the 100ths. And it is graphed based on the time line of a week. I do
not know what orthogonal
polynomials are, so I would not know how to implement that, but the 6th
degree polynomial that is displayed appears very good to me. My understanding
is that it is based on a least squares and that the numbers are so large that
Excel has a difficult time calculating out as far as the equation requires?
Thansk again for your help.
--
David

"Jerry W. Lewis" wrote:

How widely spaced are these 375 x-values?

If the x-values are 1,2,3,... then fitting a 6th degree polynomial is a very
difficult problem numerically (condition number ~3E31). The direct algorithm
used by LINEST prior to 2003 would likely produce meaningless results. The
chart trendline and LINEST might be accurate to a few figures, but the
problem could be challenging for them as well. To reliably fit a problem
this numerically difficult, you might need a package that uses quadruple
precision (I don't know of any statistics programs that do) or arbitrary
precision (cf.
http://groups.google.com/group/micro...fdea49d5c999a7
). You could probably do this in double precision by fitting orthogonal
polynomials, assuming that the x-values themselves are really accurate enough
to be worth the effort.

Your mention of "the formula provided by the graph" raises another issue:
while the chart trendline (unchanged with 2003) has always been quite good
numerically (better than almost all dedicated statistical packages, except
where they fit by orthogonal polynomials), by default very few figures of
this high quality fit get displayed on the graph. You need to right click on
the equation and change the numeric format to display scientific notation
with 14 decimal places.

Jerry

"David" wrote:

Hi Jerry,

I usually have somewhere around 375 data points, which goes up by 1 each
week, as I am tracking a stock closing, but only on a weekly basis. Yes, it
is the 6th degree polynomial that I add to the graph. I tried to figure out
the individual data points represented by the graph, using the formula
provided by the graph, but the "point" was not very accurate and I am given
the understanding that this was due to rounding errors and the precion of
Excel, prior to Office 2003.

Maybe this has all changed? I had also run across a third party add-on that
was claiming to have increased the accuracy, but did not purchase it and have
not heard anyone else mention it or vouch for it. But if I could find a way
to figure out the individual points represented by the graph, I would be
interested in doing that.

Thanks for your help.

--
David

"Jerry W. Lewis" wrote:

If by "6th degree", you mean a 6th degree polynomial, then you should be
conserned about whether you have sampled a wide enough range of data to be
able to reliably estimate the parameters. Even with independent parameters
and/or adequate data range, validation of the formula could be interesting.
Remember the famous quote of von Neumann "With four parameters I can fit an
elephant, and with five I can make him wiggle his trunk."

Jerry

"David" wrote:

I just started reading your thread and have become interested. I was at one
time trying to create data points from an anaysis of Stock closiings, that
are represented by a regression (6th degree) line. I was not able to use the
formula supplied in previous versions of Excel, prior to Ver 2003. But I
would be interested in trying this again, since i have the new version. I
would like to actually create data points as respesented by the regression
line, fi that is possible?

Thanks,
--
David

Regression results

I have been studying the market with different statistical functions for many
years ; binomial law, standard deviation, normal law, etc. I've also
downloaded many programs trying to understand the behavior of the market and
it's not easy, just predicting the direction of the market is a task even if
at the end the risk/reward could generate lot's of profits or losses; how
many people brokers, individuals had predicted the black Monday in 1987 ?
Anyways why a 6th degree, why not 5th or 7th degree ? I believe you would
extract a minimum or a maximum like in a second degree equation (-b/2a for x)
if negative maximum (downslide) and if positive (uptrend) so this would give
i believe a range over time ????? i do hope i'm making sense here......!!!!
this is order normal conditions but what if in 5 days the barrel of oil would
go up or down by $3.00, what if the $us would trade at 115 yens, if interest
rates would increase/decrease of 1%, if Microsoft or whatever stock on the
Dow would increase/decrease his earnings of $1 how would the market react ????

So far one of the best thing i came over was a multiple regression (LINEST
function) not only you can play with the amount of periods, variables but
also you can play with the standard deviation of the equation and better yet,
you can see what impact each variable does in your equation. Believe me
currencies have a lot of impact for example the price of oil is in $US
however when converting into foreign currencies that price could be
attractive/expensive, same thing for gold, interest rates, options on
indexes, etc.

I remember in college doing a paper on a simple regression unemployment rate
and gallons of beer sold, it was at 0.9 over a long period period of term
(more then 30 months) it was logical that people since they dont work stayed
home and drank beer however people had lower income !!!! now if you would
join a third variable the correlation would increase or decrease a lot,
however if a fourth variable would be had up the correlation would stabilise
around 0.6. Even if you would change the 3rd or the 4th one, it would stay
around 0.6.

it seems also that you have a hard time with small numbers probably because
when using the date, Excel uses a number instead; 39000 represents Oct
10/2006, use Oct 10/1906 and it will recalculate it at 2475, so small numbers
will come lot less and reformat it just to see the date without the year.
Also dont forget to use the format scientific because if you have a variable
like 1.5666 E-25 it takes a mighty big number just to came to 0,01 (power
22-23-24).

Since June 2005, by using multiple regressions i have a compounded annual
rate of return of 16.05% wich would place me in Canada among in the first
best quartile of managers even if my worksheet uses 60 megs and i believe if
i fine tuned variables, periods, ANOVA, decision trees, study sectors, have
the time also to compare forecasts and results i would out perform the
general trend of the market.

On the uptrend it's easy to buy stocks and out perform the index but it
comes harder and harder to say; time to take to my profits and run away and
stay away, on the side line until a stock reaches a buttom and buy it back
when it reaches the buttom unless you're into put options.

last week i have sold some units of a Japaneese fund that tracks the Nikkei
index that i had bought in October 2006 at $5.5 and sold them at $6.80
because the probability of reaching well over $7 was decreasing but on the
other hand the probability of increases in the price of gold, oil, bonds,
high-tech, small caps was slowly increasing even if the market was high so
with the income i have reinvested 50% into a fund that tracks the canadian
index and kept 50% on the sideline in case if.........

"David" wrote:

Hi Jerry,

I am not trying to predict, I am trying to find the specific data points
associated with a 6th degree polynomial. I have the two things only a Date
and a Close. These are compiled on a weekly basis for a stock each Monday.
Today I will add a close and a date to a stock after the market closes.
Nothing more or less.

I sometimes chart that stock, very simple, the dates and the closings. I can
add to that chart a 6th degree poly. But I do not know what the data points
are that are plotted and that is what I would like to know. I usually have
about 5 years worth of data, but I commonly only plot 2 years worth of data.
The data I use is actual closes for the stock on each Monday. After adding
the 6th degree poly, I would like to know where the line is on the chart for
any specific Monday and would like to add a column to the simple File that
says "Poly Close."

Date Close
1/1/1999 10.00
... ...
... ...
4/24/06 15.00
(Above is what I have)

(What I would like is)
Date Close PloyClose
1/1/1999 10.00 9.88
... ... ...
... ... ....
... ... ...
4/24/06 15.00 14.65

--
David

"Jerry W. Lewis" wrote:

Presumably the stock closes are what you want to predict. That is largely
irrelevant for determining the numeric difficulty of the problem. The issue
is what you are trying to use to predict them (a 6th degree polynomial in
what?), and whether you have sampled a wide enough range to have any hope of
accurately estimating those coefficients.

As I also tried to communicate, the numerical problems are not an Excel
issue, per se, since the accuracy of the Excel chart trendline and Excel
2003's LINEST function is comparable to that of dedicated statistics
programs. Accurately fitting 6th degree polynomials is often very difficult
numerically.

Potentially even more problematic is the question of whether accurately
fitting this 6th degree polynomial would tell you anything useful. Unless
you have some theoretical basis for expecting that the form of your model (a
6th degree polynomial in whatever) is right, then accurately fitting the past
gives no assurance that the model will accurately predict the future, even in
the near term. The more empirical parameters you have to include to fit the
past, greater the chance that the fitted model will have little or no
predictive power (recall the von Neumann quote).

Jerry

"David" wrote:

Hi Again Jerry,
What I am recording are weekly closes of stocks and the precision displayed
is to the 100ths. And it is graphed based on the time line of a week. I do
not know what orthogonal
polynomials are, so I would not know how to implement that, but the 6th
degree polynomial that is displayed appears very good to me. My understanding
is that it is based on a least squares and that the numbers are so large that
Excel has a difficult time calculating out as far as the equation requires?
Thansk again for your help.
--
David

"Jerry W. Lewis" wrote:

How widely spaced are these 375 x-values?

If the x-values are 1,2,3,... then fitting a 6th degree polynomial is a very
difficult problem numerically (condition number ~3E31). The direct algorithm
used by LINEST prior to 2003 would likely produce meaningless results. The
chart trendline and LINEST might be accurate to a few figures, but the
problem could be challenging for them as well. To reliably fit a problem
this numerically difficult, you might need a package that uses quadruple
precision (I don't know of any statistics programs that do) or arbitrary
precision (cf.
http://groups.google.com/group/micro...fdea49d5c999a7
). You could probably do this in double precision by fitting orthogonal
polynomials, assuming that the x-values themselves are really accurate enough
to be worth the effort.

Your mention of "the formula provided by the graph" raises another issue:
while the chart trendline (unchanged with 2003) has always been quite good
numerically (better than almost all dedicated statistical packages, except
where they fit by orthogonal polynomials), by default very few figures of
this high quality fit get displayed on the graph. You need to right click on
the equation and change the numeric format to display scientific notation
with 14 decimal places.

Jerry

"David" wrote:

Hi Jerry,

I usually have somewhere around 375 data points, which goes up by 1 each
week, as I am tracking a stock closing, but only on a weekly basis. Yes, it
is the 6th degree polynomial that I add to the graph. I tried to figure out
the individual data points represented by the graph, using the formula
provided by the graph, but the "point" was not very accurate and I am given
the understanding that this was due to rounding errors and the precion of
Excel, prior to Office 2003.

Maybe this has all changed? I had also run across a third party add-on that
was claiming to have increased the accuracy, but did not purchase it and have
not heard anyone else mention it or vouch for it. But if I could find a way
to figure out the individual points represented by the graph, I would be
interested in doing that.

Thanks for your help.

--
David

"Jerry W. Lewis" wrote:

If by "6th degree", you mean a 6th degree polynomial, then you should be
conserned about whether you have sampled a wide enough range of data to be
able to reliably estimate the parameters. Even with independent parameters
and/or adequate data range, validation of the formula could be interesting.
Remember the famous quote of von Neumann "With four parameters I can fit an
elephant, and with five I can make him wiggle his trunk."

Jerry

"David" wrote:

I just started reading your thread and have become interested. I was at one
time trying to create data points from an anaysis of Stock closiings, that
are represented by a regression (6th degree) line. I was not able to use the
formula supplied in previous versions of Excel, prior to Ver 2003. But I
would be interested in trying this again, since i have the new version. I
would like to actually create data points as respesented by the regression
line, fi that is possible?

Thanks,
--
David

Regression results

Hi Again,

Yes many variables that can be looked at, but my analysis is fairly simple.
It is an experiment and work in progress. I still remain interested in find
the "data" points. I bottom fish now, but it appears there might be a better
way, especially in an up trend. Some of the downs are easy to spot, this type
of market is much harder I think.
--
David

"The Gourou" wrote:

I have been studying the market with different statistical functions for many
years ; binomial law, standard deviation, normal law, etc. I've also
downloaded many programs trying to understand the behavior of the market and
it's not easy, just predicting the direction of the market is a task even if
at the end the risk/reward could generate lot's of profits or losses; how
many people brokers, individuals had predicted the black Monday in 1987 ?
Anyways why a 6th degree, why not 5th or 7th degree ? I believe you would
extract a minimum or a maximum like in a second degree equation (-b/2a for x)
if negative maximum (downslide) and if positive (uptrend) so this would give
i believe a range over time ????? i do hope i'm making sense here......!!!!
this is order normal conditions but what if in 5 days the barrel of oil would
go up or down by $3.00, what if the $us would trade at 115 yens, if interest
rates would increase/decrease of 1%, if Microsoft or whatever stock on the
Dow would increase/decrease his earnings of $1 how would the market react ????

So far one of the best thing i came over was a multiple regression (LINEST
function) not only you can play with the amount of periods, variables but
also you can play with the standard deviation of the equation and better yet,
you can see what impact each variable does in your equation. Believe me
currencies have a lot of impact for example the price of oil is in $US
however when converting into foreign currencies that price could be
attractive/expensive, same thing for gold, interest rates, options on
indexes, etc.

I remember in college doing a paper on a simple regression unemployment rate
and gallons of beer sold, it was at 0.9 over a long period period of term
(more then 30 months) it was logical that people since they dont work stayed
home and drank beer however people had lower income !!!! now if you would
join a third variable the correlation would increase or decrease a lot,
however if a fourth variable would be had up the correlation would stabilise
around 0.6. Even if you would change the 3rd or the 4th one, it would stay
around 0.6.

it seems also that you have a hard time with small numbers probably because
when using the date, Excel uses a number instead; 39000 represents Oct
10/2006, use Oct 10/1906 and it will recalculate it at 2475, so small numbers
will come lot less and reformat it just to see the date without the year.
Also dont forget to use the format scientific because if you have a variable
like 1.5666 E-25 it takes a mighty big number just to came to 0,01 (power
22-23-24).

Since June 2005, by using multiple regressions i have a compounded annual
rate of return of 16.05% wich would place me in Canada among in the first
best quartile of managers even if my worksheet uses 60 megs and i believe if
i fine tuned variables, periods, ANOVA, decision trees, study sectors, have
the time also to compare forecasts and results i would out perform the
general trend of the market.

On the uptrend it's easy to buy stocks and out perform the index but it
comes harder and harder to say; time to take to my profits and run away and
stay away, on the side line until a stock reaches a buttom and buy it back
when it reaches the buttom unless you're into put options.

last week i have sold some units of a Japaneese fund that tracks the Nikkei
index that i had bought in October 2006 at $5.5 and sold them at $6.80
because the probability of reaching well over $7 was decreasing but on the
other hand the probability of increases in the price of gold, oil, bonds,
high-tech, small caps was slowly increasing even if the market was high so
with the income i have reinvested 50% into a fund that tracks the canadian
index and kept 50% on the sideline in case if.........

"David" wrote:

Hi Jerry,

I am not trying to predict, I am trying to find the specific data points
associated with a 6th degree polynomial. I have the two things only a Date
and a Close. These are compiled on a weekly basis for a stock each Monday.
Today I will add a close and a date to a stock after the market closes.
Nothing more or less.

I sometimes chart that stock, very simple, the dates and the closings. I can
add to that chart a 6th degree poly. But I do not know what the data points
are that are plotted and that is what I would like to know. I usually have
about 5 years worth of data, but I commonly only plot 2 years worth of data.
The data I use is actual closes for the stock on each Monday. After adding
the 6th degree poly, I would like to know where the line is on the chart for
any specific Monday and would like to add a column to the simple File that
says "Poly Close."

Date Close
1/1/1999 10.00
... ...
... ...
4/24/06 15.00
(Above is what I have)

(What I would like is)
Date Close PloyClose
1/1/1999 10.00 9.88
... ... ...
... ... ....
... ... ...
4/24/06 15.00 14.65

--
David

"Jerry W. Lewis" wrote:

Presumably the stock closes are what you want to predict. That is largely
irrelevant for determining the numeric difficulty of the problem. The issue
is what you are trying to use to predict them (a 6th degree polynomial in
what?), and whether you have sampled a wide enough range to have any hope of
accurately estimating those coefficients.

As I also tried to communicate, the numerical problems are not an Excel
issue, per se, since the accuracy of the Excel chart trendline and Excel
2003's LINEST function is comparable to that of dedicated statistics
programs. Accurately fitting 6th degree polynomials is often very difficult
numerically.

Potentially even more problematic is the question of whether accurately
fitting this 6th degree polynomial would tell you anything useful. Unless
you have some theoretical basis for expecting that the form of your model (a
6th degree polynomial in whatever) is right, then accurately fitting the past
gives no assurance that the model will accurately predict the future, even in
the near term. The more empirical parameters you have to include to fit the
past, greater the chance that the fitted model will have little or no
predictive power (recall the von Neumann quote).

Jerry

"David" wrote:

Hi Again Jerry,
What I am recording are weekly closes of stocks and the precision displayed
is to the 100ths. And it is graphed based on the time line of a week. I do
not know what orthogonal
polynomials are, so I would not know how to implement that, but the 6th
degree polynomial that is displayed appears very good to me. My understanding
is that it is based on a least squares and that the numbers are so large that
Excel has a difficult time calculating out as far as the equation requires?
Thansk again for your help.
--
David

"Jerry W. Lewis" wrote:

How widely spaced are these 375 x-values?

If the x-values are 1,2,3,... then fitting a 6th degree polynomial is a very
difficult problem numerically (condition number ~3E31). The direct algorithm
used by LINEST prior to 2003 would likely produce meaningless results. The
chart trendline and LINEST might be accurate to a few figures, but the
problem could be challenging for them as well. To reliably fit a problem
this numerically difficult, you might need a package that uses quadruple
precision (I don't know of any statistics programs that do) or arbitrary
precision (cf.
http://groups.google.com/group/micro...fdea49d5c999a7
). You could probably do this in double precision by fitting orthogonal
polynomials, assuming that the x-values themselves are really accurate enough
to be worth the effort.

Your mention of "the formula provided by the graph" raises another issue:
while the chart trendline (unchanged with 2003) has always been quite good
numerically (better than almost all dedicated statistical packages, except
where they fit by orthogonal polynomials), by default very few figures of
this high quality fit get displayed on the graph. You need to right click on
the equation and change the numeric format to display scientific notation
with 14 decimal places.

Jerry

"David" wrote:

Hi Jerry,

I usually have somewhere around 375 data points, which goes up by 1 each
week, as I am tracking a stock closing, but only on a weekly basis. Yes, it
is the 6th degree polynomial that I add to the graph. I tried to figure out
the individual data points represented by the graph, using the formula
provided by the graph, but the "point" was not very accurate and I am given
the understanding that this was due to rounding errors and the precion of
Excel, prior to Office 2003.

Maybe this has all changed? I had also run across a third party add-on that
was claiming to have increased the accuracy, but did not purchase it and have
not heard anyone else mention it or vouch for it. But if I could find a way
to figure out the individual points represented by the graph, I would be
interested in doing that.

Thanks for your help.

--
David

"Jerry W. Lewis" wrote:

If by "6th degree", you mean a 6th degree polynomial, then you should be
conserned about whether you have sampled a wide enough range of data to be
able to reliably estimate the parameters. Even with independent parameters
and/or adequate data range, validation of the formula could be interesting.
Remember the famous quote of von Neumann "With four parameters I can fit an
elephant, and with five I can make him wiggle his trunk."

Jerry

"David" wrote:

I just started reading your thread and have become interested. I was at one
time trying to create data points from an anaysis of Stock closiings, that
are represented by a regression (6th degree) line. I was not able to use the
formula supplied in previous versions of Excel, prior to Ver 2003. But I
would be interested in trying this again, since i have the new version. I
would like to actually create data points as respesented by the regression
line, fi that is possible?

Thanks,
--
David

Regression results

Folks ... a quick question - can you do a multi-variate non-linear regression
in Excel? ...the trendline clearly does that with one variable. But what if
you have multiple variables?

"David" wrote:

Hi Again,

Yes many variables that can be looked at, but my analysis is fairly simple.
It is an experiment and work in progress. I still remain interested in find
the "data" points. I bottom fish now, but it appears there might be a better
way, especially in an up trend. Some of the downs are easy to spot, this type
of market is much harder I think.
--
David

"The Gourou" wrote:

I have been studying the market with different statistical functions for many
years ; binomial law, standard deviation, normal law, etc. I've also
downloaded many programs trying to understand the behavior of the market and
it's not easy, just predicting the direction of the market is a task even if
at the end the risk/reward could generate lot's of profits or losses; how
many people brokers, individuals had predicted the black Monday in 1987 ?
Anyways why a 6th degree, why not 5th or 7th degree ? I believe you would
extract a minimum or a maximum like in a second degree equation (-b/2a for x)
if negative maximum (downslide) and if positive (uptrend) so this would give
i believe a range over time ????? i do hope i'm making sense here......!!!!
this is order normal conditions but what if in 5 days the barrel of oil would
go up or down by $3.00, what if the $us would trade at 115 yens, if interest
rates would increase/decrease of 1%, if Microsoft or whatever stock on the
Dow would increase/decrease his earnings of $1 how would the market react ????

So far one of the best thing i came over was a multiple regression (LINEST
function) not only you can play with the amount of periods, variables but
also you can play with the standard deviation of the equation and better yet,
you can see what impact each variable does in your equation. Believe me
currencies have a lot of impact for example the price of oil is in $US
however when converting into foreign currencies that price could be
attractive/expensive, same thing for gold, interest rates, options on
indexes, etc.

I remember in college doing a paper on a simple regression unemployment rate
and gallons of beer sold, it was at 0.9 over a long period period of term
(more then 30 months) it was logical that people since they dont work stayed
home and drank beer however people had lower income !!!! now if you would
join a third variable the correlation would increase or decrease a lot,
however if a fourth variable would be had up the correlation would stabilise
around 0.6. Even if you would change the 3rd or the 4th one, it would stay
around 0.6.

it seems also that you have a hard time with small numbers probably because
when using the date, Excel uses a number instead; 39000 represents Oct
10/2006, use Oct 10/1906 and it will recalculate it at 2475, so small numbers
will come lot less and reformat it just to see the date without the year.
Also dont forget to use the format scientific because if you have a variable
like 1.5666 E-25 it takes a mighty big number just to came to 0,01 (power
22-23-24).

Since June 2005, by using multiple regressions i have a compounded annual
rate of return of 16.05% wich would place me in Canada among in the first
best quartile of managers even if my worksheet uses 60 megs and i believe if
i fine tuned variables, periods, ANOVA, decision trees, study sectors, have
the time also to compare forecasts and results i would out perform the
general trend of the market.

On the uptrend it's easy to buy stocks and out perform the index but it
comes harder and harder to say; time to take to my profits and run away and
stay away, on the side line until a stock reaches a buttom and buy it back
when it reaches the buttom unless you're into put options.

last week i have sold some units of a Japaneese fund that tracks the Nikkei
index that i had bought in October 2006 at $5.5 and sold them at $6.80
because the probability of reaching well over $7 was decreasing but on the
other hand the probability of increases in the price of gold, oil, bonds,
high-tech, small caps was slowly increasing even if the market was high so
with the income i have reinvested 50% into a fund that tracks the canadian
index and kept 50% on the sideline in case if.........

"David" wrote:

Hi Jerry,

I am not trying to predict, I am trying to find the specific data points
associated with a 6th degree polynomial. I have the two things only a Date
and a Close. These are compiled on a weekly basis for a stock each Monday.
Today I will add a close and a date to a stock after the market closes.
Nothing more or less.

I sometimes chart that stock, very simple, the dates and the closings. I can
add to that chart a 6th degree poly. But I do not know what the data points
are that are plotted and that is what I would like to know. I usually have
about 5 years worth of data, but I commonly only plot 2 years worth of data.
The data I use is actual closes for the stock on each Monday. After adding
the 6th degree poly, I would like to know where the line is on the chart for
any specific Monday and would like to add a column to the simple File that
says "Poly Close."

Date Close
1/1/1999 10.00
... ...
... ...
4/24/06 15.00
(Above is what I have)

(What I would like is)
Date Close PloyClose
1/1/1999 10.00 9.88
... ... ...
... ... ....
... ... ...
4/24/06 15.00 14.65

--
David

"Jerry W. Lewis" wrote:

Presumably the stock closes are what you want to predict. That is largely
irrelevant for determining the numeric difficulty of the problem. The issue
is what you are trying to use to predict them (a 6th degree polynomial in
what?), and whether you have sampled a wide enough range to have any hope of
accurately estimating those coefficients.

As I also tried to communicate, the numerical problems are not an Excel
issue, per se, since the accuracy of the Excel chart trendline and Excel
2003's LINEST function is comparable to that of dedicated statistics
programs. Accurately fitting 6th degree polynomials is often very difficult
numerically.

Potentially even more problematic is the question of whether accurately
fitting this 6th degree polynomial would tell you anything useful. Unless
you have some theoretical basis for expecting that the form of your model (a
6th degree polynomial in whatever) is right, then accurately fitting the past
gives no assurance that the model will accurately predict the future, even in
the near term. The more empirical parameters you have to include to fit the
past, greater the chance that the fitted model will have little or no
predictive power (recall the von Neumann quote).

Jerry

"David" wrote:

Hi Again Jerry,
What I am recording are weekly closes of stocks and the precision displayed
is to the 100ths. And it is graphed based on the time line of a week. I do
not know what orthogonal
polynomials are, so I would not know how to implement that, but the 6th
degree polynomial that is displayed appears very good to me. My understanding
is that it is based on a least squares and that the numbers are so large that
Excel has a difficult time calculating out as far as the equation requires?
Thansk again for your help.
--
David

"Jerry W. Lewis" wrote:

How widely spaced are these 375 x-values?

If the x-values are 1,2,3,... then fitting a 6th degree polynomial is a very
difficult problem numerically (condition number ~3E31). The direct algorithm
used by LINEST prior to 2003 would likely produce meaningless results. The
chart trendline and LINEST might be accurate to a few figures, but the
problem could be challenging for them as well. To reliably fit a problem
this numerically difficult, you might need a package that uses quadruple
precision (I don't know of any statistics programs that do) or arbitrary
precision (cf.
http://groups.google.com/group/micro...fdea49d5c999a7
). You could probably do this in double precision by fitting orthogonal
polynomials, assuming that the x-values themselves are really accurate enough
to be worth the effort.

Your mention of "the formula provided by the graph" raises another issue:
while the chart trendline (unchanged with 2003) has always been quite good
numerically (better than almost all dedicated statistical packages, except
where they fit by orthogonal polynomials), by default very few figures of
this high quality fit get displayed on the graph. You need to right click on
the equation and change the numeric format to display scientific notation
with 14 decimal places.

Jerry

"David" wrote:

Hi Jerry,

I usually have somewhere around 375 data points, which goes up by 1 each
week, as I am tracking a stock closing, but only on a weekly basis. Yes, it
is the 6th degree polynomial that I add to the graph. I tried to figure out
the individual data points represented by the graph, using the formula
provided by the graph, but the "point" was not very accurate and I am given
the understanding that this was due to rounding errors and the precion of
Excel, prior to Office 2003.

Maybe this has all changed? I had also run across a third party add-on that
was claiming to have increased the accuracy, but did not purchase it and have
not heard anyone else mention it or vouch for it. But if I could find a way
to figure out the individual points represented by the graph, I would be
interested in doing that.

Thanks for your help.

--
David

"Jerry W. Lewis" wrote:

If by "6th degree", you mean a 6th degree polynomial, then you should be
conserned about whether you have sampled a wide enough range of data to be
able to reliably estimate the parameters. Even with independent parameters
and/or adequate data range, validation of the formula could be interesting.
Remember the famous quote of von Neumann "With four parameters I can fit an
elephant, and with five I can make him wiggle his trunk."

Jerry

"David" wrote:

I just started reading your thread and have become interested. I was at one
time trying to create data points from an anaysis of Stock closiings, that
are represented by a regression (6th degree) line. I was not able to use the
formula supplied in previous versions of Excel, prior to Ver 2003. But I
would be interested in trying this again, since i have the new version. I
would like to actually create data points as respesented by the regression
line, fi that is possible?

Thanks,
--
David