# Survey (Data Analysis)

Posted on 2013-01-28

Hello Math/Stats Experts:

I'm in the midst of developing a survey/questionaire (for an academic project). Before continuing the survey development, I'd like to get a better understanding of possible analytic methods (upon having received the data later on).

----------Break----------

Background on research:

- I have 8 independent variables (IV).

- I have 1 dependent variable (DV).

- I'd like to prove there is (or is not) a positive/negative correlation between each independent variable and the dependent variable.

- I will NOT test any possible interactions between the independent variables.

----------Break----------

Background on survey concept:

- For both independent variables and dependent variable, I will ask several (maybe 3) questions.

- Most (if not all) questions will use a Likert Scale (values of 1-5). As far as I know, this makes it a "categorical" measurement scale (i.e., "interval" data). [Let me know if you disagree]

- I will have n number of survey respondents (SR).

----------Break----------

Current survey concept... let's say I have 27 questions... 3 for each of the 8 IVs... and 3 for the single DV.

[Btw, the Likert scale values below are complete made up... I just punched in numbers]

Q# Response of SR_sub_1 Response of SR_sub_2 Response of SR_sub_n

1. 5, 5, 4

2. 4, 3, 5

3. 5, 4, 4

...

...

...

25. 3, 4, 4

26. 2, 3, 3

27. 4, 3, 4

----------Break----------

Current concept of data analysis:

- Questions 1:3 pertain to independent variable #1.

- SR_sub_1 answered them as follows: 5, 4, 5

- SR_sub_2 answered them as follows: 5, 3, 4

- SR_sub_n answered them as follows: 4, 5, 4

- Questions 25:27 pertain to the dependent variable.

- SR_sub_1 answered them as follows: 3, 2, 4

- SR_sub_2 answered them as follows: 4, 3, 3

- SR_sub_n answered them as follows: 4, 3, 4

Now, I was thinking to use "index scores" (i.e., averages) for each. If so, I'd have the following data:

- Index score for independent variable #1.

- SR_sub_1 = average of (5, 4, 5) = 4.67

- SR_sub_2 = average of (5, 3, 4) = 4.00

- SR_sub_n = average of (4, 5, 4) = 4.33

- Index score for dependent variable.

- SR_sub_1 = average of (3, 2, 4) = 3.00

- SR_sub_2 = average of (4, 3, 3) = 3.33

- SR_sub_n = average of (4, 3, 4) = 3.67

[Again, the survey respondents' "values" are merely made up right now... their individual values are not important at this moment.]

My questions:

- Is the approach of those "index scores" a valid one?

- If yes, do I simply plot the intersections of [4.67, 3.00] & [4,00, 3.33] & [4.33, 3.67] into a scatter plot?

- If so, what is the recommend statistical analyis (ANOVA, MDA, Chi-Square) to analyze perform the analysis to determine whether or not there is a positive/negative correlation (not causation) between the 8 independent variables and the dependent variable?

Thousand thanks in advance,

EEH