Simple formula for calculating R-Squared between 2 stocks
Hello Experts, I'm a Delphi programmer writing a bit of a financial analysis program for myself and I'm having trouble with the calculation of R-Squared between the daily closing prices of 2 financial instruments. I thought I found a good formula on the net for doing it, but it doesn't yield sensible results. I'm no advanced statistics experts, but doesn't R-Squared have to be a value between 0 and 1? The formula I used yields values like 7.11 and 10.67... not making much sense. Here is the formula:
Well, some definitions would put the correlation coefficient "r" in the range from -1 > +1.
So, yes, r squared would be in the range 0 > +1.
This much tracks.
If your "formula" yields other values then one would have to know the formula and the data being put into it wouldn't one?
0
shawn857Author Commented:
Thanks Fred, yes I've come across that too on the net that the r-squared is no more than the Correlation Coefficient squared. That's simple enough - I'm already calculation correctly the Correlation Coefficient... but on some sites it describes r-squared differently and gives a different calculation. It's said that r-squared indicates of the "reliability" of the correlation coefficient... but to me, simply taking the CorrCoef and squaring it doesn't really offer any more insight. So I guess I'm just a little confused...
Thanks
Shawn
P.S: I'm not sure what you mean by "This much tracks"...?
I meant by "this much tracks" is that we agree on terms as I've re-stated them.
But how in the world do you get the numbers you get? You haven't revealed the expressions being used. That does *not* track.
0
There are many ways to learn to code these days. From coding bootcamps like Flatiron School to online courses to totally free beginner resources. The best way to learn to code depends on many factors, but the most important one is you. See what course is best for you.
That's the one that yielded the crazy numbers. But really - if RSquared is nothing more than Correlation Coefficient (...which I'm already calculating) squared, then I don't think I'm going to bang my head against the wall about this any more!
Then I guess we might look at some real numbers because the correlation coefficient can't be greater than 1.0 in magnitude. I don't see anything wrong with the expressions.
While it may seem a bit simplisitic: Here is a web page where one can calculate some cases and envision how it should go;
Sometimes there are scaling factors that need to be taken into account.
When you do a help on Covariance (x,y)) and (Variance(x) in your programming language, what precisely are the equations that these functions boil down to. I just went through an exercise at work where in one language, the scaling factor were different than in a library.
0
shawn857Author Commented:
Guys I'm awfully sorry - I just found a grievous grievous error in my program I'm writing. This was responsible for the crazy RSquared numbers. I corrected this andt he formula I mentioned a few posts earlier seems to be now yielding sensible results. Sorry for the false alarm!
Cheers
Shawn
0
shawn857Author Commented:
Thank you!
0
Question has a verified solution.
Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.
At Springboard, we know how to get you a job in data science. With Springboard’s Data Science Career Track, you’ll master data science with a curriculum built by industry experts. You’ll work on real projects, and get 1-on-1 mentorship from a data scientist.
So, yes, r squared would be in the range 0 > +1.
This much tracks.
If your "formula" yields other values then one would have to know the formula and the data being put into it wouldn't one?