Advertisement

08.03.2008 at 05:36PM PDT, ID: 23617810
[x]
Attachment Details
[x]
The Solution Rating System

With so many solutions, how can you tell which solutions are most likely to help you and which ones are not? To provide you with a tool to use, we rate our solutions based on various elements that most accurately determine if a solution is a quality solution. To explain what factors affect the solution rating, here are the elements we take into consideration when formulating our solution rating.

  • The Grade of the Solution
  • The Zone Rank of the Expert Providing the Solution
  • The Number of Author and Expert Comments
  • The Number of Experts Contributing
  • The Feedback of the Community

Your Input Matters
Because of the way the system is set up, the most important variable in this equation is you. As a member of Experts Exchange, you are able to cast your vote on the quality of the solutions in regard to how complete, accurate, helpful and easy to understand each solution is. When you provide your feedback, each rating is adjusted accordingly. So, if you see a solution that has a poor rating that you think is a good solution, let us know by rating it. As you do, the rating will be adjusted and will become more accurate for other members of our site.

If you have any suggestions that you would like to make for our rating system, please ask a question in the Suggestions Zone of Community Support.

Thank you!

8.0

Calculating skewness and kurtosis

Asked by mjcoyne in Perl Programming Language, Math & Science, Probability & Statistics

The attached code produces the following output:

   Excel results:

   Standard Deviation  11.03035112
   Skewness             0.363338937
   Kurtosis            -0.070497874

   My results:

   Standard Deviation  11.0303511188038
   Skewness             0.314893745627018
   Kurtosis            -0.595823617828494

As you can see, my results for skewness and kurtosis are off from those produced by Excel (or SPSS 16 for Windows, for that matter) given the same data.

I've taken the equations for skewness and kurtosis from http://www.itl.nist.gov/div898/handbook/eda/section3/eda35b.htm, and I *think* I've reproduced them faithfully in Perl.

Can anyone spot the problem?Start Free Trial
1:
2:
3:
4:
5:
6:
7:
8:
9:
10:
11:
12:
13:
14:
15:
16:
17:
18:
19:
20:
21:
22:
23:
24:
25:
26:
27:
28:
29:
30:
31:
32:
33:
34:
35:
36:
37:
38:
39:
40:
41:
42:
43:
44:
45:
46:
47:
48:
49:
50:
51:
52:
53:
54:
55:
56:
57:
58:
59:
60:
61:
62:
63:
64:
65:
66:
67:
68:
69:
70:
71:
72:
73:
74:
75:
76:
77:
78:
79:
80:
81:
82:
83:
84:
85:
#!/usr/bin/perl -w
use strict;
 
my @data = qw(
    91.3698
    76.3382
    74.5692
    85.2957
    99.0112
    86.9900
    70.7837
    72.8340
    78.1644
    77.7472
    66.0627
    59.7810
    68.4793
    78.6103
    59.8621
);
 
my $excel = <<END;
   Excel results:
 
   Standard Deviation  11.03035112
   Skewness             0.363338937
   Kurtosis            -0.070497874
 
END
 
my $std_dev = sample_sd(\@data);
my $skewness = skewness(\@data);
my $kurtosis = kurtosis(\@data);
 
my $results = <<END;
   My results:
 
   Standard Deviation  $std_dev
   Skewness             $skewness
   Kurtosis            $kurtosis
 
END
 
print $excel, $results;
 
sub sample_sd {
    my $arrayref = shift;
    my ($sum, $sum_sq);
    foreach (@$arrayref) {
        $sum += $_;
    }
    my $mean = $sum/@$arrayref;
    foreach (@$arrayref) {
        $sum_sq += ($_ - $mean) ** 2;
    }
    return sqrt($sum_sq / $#{$arrayref}); # (N-1)
}
 
sub skewness {
    my $arrayref = shift;
    my ($sum, $sum_cubes);
    foreach (@$arrayref) {
        $sum += $_;
    }
    my $mean = $sum/@$arrayref;
    my $std_dev = sample_sd($arrayref);
    foreach (@$arrayref) {
        $sum_cubes += ($_ - $mean) ** 3;
    }
    return $sum_cubes / ($#{$arrayref} * ($std_dev ** 3));
}
 
sub kurtosis {
    my $arrayref = shift;
    my ($sum, $sum_fourth);
    foreach (@$arrayref) {
        $sum += $_;
    }
    my $mean = $sum/@$arrayref;
    my $sd = sample_sd($arrayref);
    foreach (@$arrayref) {
        $sum_fourth += ($_ - $mean) ** 4;
    }
    return $sum_fourth / ($#{$arrayref} * ($sd ** 4)) - 3;
}
[+][-]08.03.2008 at 07:07PM PDT, ID: 22149544

View this solution now by starting your 7-day free trial. Setting up your free trial is quick, easy, and secure. We will return you to this solution, unlocked, when you're done.

 

About this solution

Zones: Perl Programming Language, Math & Science, Probability & Statistics
Sign Up Now!
Solution Provided By: Adam314
Participating Experts: 1
Solution Grade: A
 
 
[+][-]08.03.2008 at 08:03PM PDT, ID: 22149694

Often, when Experts are collaborating with members who have asked questions, they will request additional information about the problem. Askers respond with an author comment like this one.

Start your 7-day free trial to view this Author Comment or ask the Experts your question.

 
[+][-]08.03.2008 at 08:16PM PDT, ID: 22149737

Often, when Experts are collaborating with members who have asked questions, they will request additional information about the problem. Askers respond with an author comment like this one.

Start your 7-day free trial to view this Author Comment or ask the Experts your question.

 
 
Loading Advertisement...
20080716-EE-VQP-32 / EE_QW_2_20070628