Topics Covered in this Session
- Definition and Characteristics of Correlational Research
- Appropriateness/Limitations
- Correlation Coefficients/Regression Analysis
Correlational Research
Correlational research is used to explore co-varying relationships between two or more variables. A simple definition of a co-varying relationship is as one variable changes so does the other variable(s). The purpose of correlational research is to:
- To identify variables that relate to one each other (i.e. is there a relationship between family income and grade point average; is there a relationship between part time employment and grade point average);
- To make predictions of one variable from another variable (i.e. can I.Q. test scores be used to predict student achievement; can SAT scores be used to predict college grade point averages);
- To examine possible cause and effect relationships between one variable and another.
A caution has to be advised when considering correlational research and cause and effect. Major researchers such as B.F. Skinner posit that while we can make many conclusions identifying a relationship between one or more variables, establishing cause and effect is very difficult and maybe impossible due to the myriad interactions of many variables in social science research.
In education-based correlational studies, data is frequently collected using standardized measures such as test scores. Report presentations almost always use hypotheses in the form of "No relationship exists between variable X and variable Y." Data analysis using correlation coefficients is generally quantitative. Rather than rich descriptive narrative as we might see in descriptive or ethnographic studies, correlation presentations tend to be succinct relying on statistical analyses of correlation coefficients and regression. Of the various quantitative methodologies, correlational research is among the easiest to design and apply. For this reason, it is popular and frequently used in conjunction with other research methodologies.
Data Sources
- Raw scores such as standardized test scores.
- Measures such as grade point averages.
- Dichotomous data, data which has two possibilities such as male/female or pass/fail.
Research Tools
- Standardized tests are the most common tools for doing correlational studies.
- Direct measurement techniques have also been used for specialized studies such as monitoring student pulse rates to determine stress on test performance.
Procedural Considerations
- Null hypothesis is frequently used.
- Research questions sometimes stated instead of hypotheses.
- Statistics used tend to be measures of relationship such as: Pearson Product-Moment Coefficient, Spearman Rank Order Coefficient, Phi Correlation Coefficient, Regression.
Report Presentation
- Reports are almost always quantitative rather than qualitative presentations.
- Statistical data is provided in the form of correlation coefficients as mentioned above.
Statistical Analysis in Correlational Research
- Correlation is the relationship between two or more variables or sets of data. It is expressed in the form of a coefficient with +1.00 indicating a perfect positive correlation; -1.00 indicating a perfect inverse correlation; 0.00 indicating a complete lack of a relationship.
- Pearson's Product Moment Coefficient (r) is the most often used and most precise coefficient; and generally used with continuous variables.
- Spearman Rank Order Coefficient (p) is a form of the Pearson's Product Moment Coefficient which can be used with ordinal or ranked data.
- Phi Correlation Coefficient is a form of the Pearson's Product Moment Coefficient which can be used with dichotomous variables (i.e. pass/fail, male/female).
- Regression the use of correlation to plot a line illustrating the linear relationship of two variables X and Y. It is based on the slope of the line which is represented by the formula : Y = a + bX where
- Y = dependent variable
- X = independent variable
- b = slope of the line
- a = constant or Y intercept
Regression is used extensively in making predictions based on finding unknown Y values from known X values. (i.e. predicting college GPA from known high school grade point averages.)
- Multiple Regression is the same as regression except that it attempts to predict Y from two or more independent X variables (i.e. predicting college GPA from known high school grade point averages and SAT scores.) The formula for multiple regression is an extension of the linear regression formula:
Y = a + b1 X1 + b2 X2 + ....
FOR MORE INFORMATION ON THE TOPICS COVERED IN THIS SESSION, PLEASE REFER TO
CHAPTER 1 OF A.G. PICCIANO "EDUCATIONAL RESEARCH PRIMER". |