Hypothesis Test for Correlation

Mobile Features AB

Let's look at the hypothesis test for correlation, including the hypothesis test for correlation coefficient, the hypothesis test for negative correlation and the null hypothesis for correlation test.

Get started

Millions of flashcards designed to help you ace your studies

Sign up for free

Achieve better grades quicker with Premium

PREMIUM
Karteikarten Spaced Repetition Lernsets AI-Tools Probeklausuren Lernplan Erklärungen Karteikarten Spaced Repetition Lernsets AI-Tools Probeklausuren Lernplan Erklärungen
Kostenlos testen

Geld-zurück-Garantie, wenn du durch die Prüfung fällst

Review generated flashcards

Sign up for free
You have reached the daily AI limit

Start learning or create your own AI flashcards

StudySmarter Editorial Team

Team Hypothesis Test for Correlation Teachers

  • 9 minutes reading time
  • Checked by StudySmarter Editorial Team
Save Article Save Article
Sign up for free to save, edit & create flashcards.
Save Article Save Article
  • Fact Checked Content
  • Last Updated: 31.08.2022
  • 9 min reading time
Contents
Contents
  • Fact Checked Content
  • Last Updated: 31.08.2022
  • 9 min reading time
  • Content creation process designed by
    Lily Hulatt Avatar
  • Content cross-checked by
    Gabriel Freitas Avatar
  • Content quality checked by
    Gabriel Freitas Avatar
Sign up for free to save, edit & create flashcards.
Save Article Save Article

Jump to a key chapter

    What is the hypothesis test for correlation coefficient?

    When given a sample of bivariate data (data which include two variables), it is possible to calculate how linearly correlated the data are, using a correlation coefficient.

    The product moment correlation coefficient (PMCC) describes the extent to which one variable correlates with another. In other words, the strength of the correlation between two variables. The PMCC for a sample of data is denoted by r, while the PMCC for a population is denoted by ρ.

    The PMCC is limited to values between -1 and 1 (included).

    • Ifr = 1, there is a perfect positive linear correlation. All points lie on a straight line with a positive gradient, and the higher one of the variables is, the higher the other.

    • Ifr = 0, there is no linear correlation between the variables.

    • If r =-1, there is a perfect negative linear correlation. All points lie on a straight line with a negative gradient, and the higher one of the variables is, the lower the other.

    Correlation is not equivalent to causation, but a PMCC close to 1 or -1 can indicate that there is a higher likelihood that two variables are related.

    statistics bivariate data correlation null positive negative graphs StudySmarter Bivariate data with no correlation, positive correlation, and negative correlation

    The PMCC should be able to be calculated using a graphics calculator by finding the regression line of y on x, and hence finding r (this value is automatically calculated by the calculator), or by using the formular=SxySxxSyy, which is in the formula booklet. The closer r is to 1 or -1, the stronger the correlation between the variables, and hence the more closely associated the variables are. You need to be able to carry out hypothesis tests on a sample of bivariate data to determine if we can establish a linear relationship for an entire population. By calculating the PMCC, and comparing it to a critical value, it is possible to determine the likelihood of a linear relationship existing.

    What is the hypothesis test for negative correlation?

    To conduct a hypothesis test, a number of keywords must be understood:

    • Null hypothesis ( H0): the hypothesis assumed to be correct until proven otherwise

    • Alternative hypothesis ( H1): the conclusion made ifH0 is rejected.

    • Hypothesis test: a mathematical procedure to examine a value of a population parameter proposed by the null hypothesis compared to the alternative hypothesis.

    • Test statistic: is calculated from the sample and tested in cumulative probability tables or with the normal distribution as the last part of the significance test.

    • Critical region: the range of values that lead to the rejection of the null hypothesis.

    • Significance level: the actual significance level is the probability of rejectingH0 when it is in fact true.

    The null hypothesis is also known as the 'working hypothesis'. It is what we assume to be true for the purpose of the test, or until proven otherwise.

    The alternative hypothesis is what is concluded if the null hypothesis is rejected. It also determines whether the test is one-tailed or two-tailed.

    A one-tailed test allows for the possibility of an effect in one direction, while two-tailed tests allow for the possibility of an effect in two directions, in other words, both in the positive and the negative directions. Method: A series of steps must be followed to determine the existence of a linear relationship between 2 variables. 1. Write down the null and alternative hypotheses (H0 and H1). The null hypothesis is alwaysρ =0, while the alternative hypothesis depends on what is asked in the question. Both hypotheses must be stated in symbols only (not in words).

    2. Using a calculator, work out the value of the PMCC of the sample data, r .

    3. Use the significance level and sample size to figure out the critical value. This can be found in the PMCC table in the formula booklet.

    4. Take the absolute value of the PMCC and r, and compare these to the critical value. If the absolute value is greater than the critical value, the null hypothesis should be rejected. Otherwise, the null hypothesis should be accepted.

    5. Write a full conclusion in the context of the question. The conclusion should be stated in full: both in statistical language and in words reflecting the context of the question. A negative correlation signifies that the alternative hypothesis is rejected: the lack of one variable correlates with a stronger presence of the other variable, whereas, when there is a positive correlation, the presence of one variable correlates with the presence of the other.

    How to interpret results based on the null hypothesis

    From the observed results (test statistic), a decision must be made, determining whether to reject the null hypothesis or not.

    hypothesis test for correlation probability of observed result studysmarterImage: Repapetilto CC BY-SA 3.0,

    Hypothesis Test for Correlation two-tailed test StudySmarterTwo-tailed test applied to normal distribution. Image: public domain

    Both the one-tailed and two-tailed tests are shown at the 5% level of significance. However, the 5% is distributed in both the positive and negative side in the two-tailed test, and solely on the positive side in the one-tailed test.

    From the null hypothesis, the result could lie anywhere on the graph. If the observed result lies in the shaded area, the test statistic is significant at 5%, in other words, we rejectH0. Therefore,H0 could actually be true but it is still rejected. Hence, the significance level, 5%, is the probability thatH0 is rejected even though it is true, in other words, the probability thatH0 is incorrectly rejected. When H0 is rejected, H1(the alternative hypothesis) is used to write the conclusion.

    We can define the null and alternative hypotheses for one-tailed and two-tailed tests:

    For a one-tailed test:

    • H0: ρ=0 : H1 ρ>0 or
    • H0: ρ=0 : H1 ρ<0

    For a two-tailed test:

    • H0: ρ=0: H1 ρ 0

    Let us look at an example of testing for correlation.

    12 students sat two biology tests: one was theoretical and the other was practical. The results are shown in the table.

    Score in theoretical test, t5971120461712101516
    Score in practical test, p689132098171481718

    a) Find the product moment correlation coefficient for this data, to 3 significant figures.

    b) A teacher claims that students who do well in the theoretical test tend to do well in the practical test. Test this claim at the 0.05 level of significance, clearly stating your hypotheses.

    a) Using a calculator, we find the PMCC (enter the data into two lists and calculate the regression line. the PMCC will appear). r = 0.935 to 3 sign. figures

    b) We are testing for a positive correlation, since the claim is that a higher score in the theoretical test is associated with a higher score in the practical test. We will now use the five steps we previously looked at.

    1. State the null and alternative hypotheses. H0: ρ = 0 and H1: ρ > 0

    2. Calculate the PMCC. From part a), r = 0.935

    3. Figure out the critical value from the sample size and significance level. The sample size, n, is 12. The significance level is 5%. The hypothesis is one-tailed since we are only testing for positive correlation. Using the table from the formula booklet, the critical value is shown to be cv = 0.4973

    4. The absolute value of the PMCC is 0.935, which is larger than 0.4973. Since the PMCC is larger than the critical value at the 5% level of significance, we can reach a conclusion.

    5. Since the PMCC is larger than the critical value, we choose to reject the null hypothesis. We can conclude that there is significant evidence to support the claim that students who do well in the theoretical biology test also tend to do well in the practical biology test.

    Let us look at a second example.

    A tetrahedral die (four faces) is rolled 40 times and 6 'ones' are observed. Is there any evidence at the 10% level that the probability of a score of 1 is less than a quarter?

    The expected mean is 10 =40×14. The question asks whether the observed result (test statistic 6 is unusually low.

    We now follow the same series of steps.

    1. State the null and alternative hypotheses. H0: ρ = 0 and H1: ρ <0.25

    2. We cannot calculate the PMCC since we are only given data for the frequency of 'ones'.

    3. A one-tailed test is required ( ρ < 0.25) at the 10% significance level. We can convert this to a binomial distribution in which X is the number of 'ones' so X~B(40, 0.25), we then use the cumulative binomial tables. The observed value is X = 6. To P(X6 'ones' in 40 rolls)=0.0962.

    4. Since 0.0962, or 9.62% <10%, the observed result lies in the critical region.

    5. We reject and accept the alternative hypothesis. We conclude that there is evidence to show that the probability of rolling a 'one' is less than14

    Hypothesis Test for Correlation - Key takeaways

    • The Product Moment Correlation Coefficient (PMCC), or r, is a measure of how strongly related 2 variables are. It ranges between -1 and 1, indicating the strength of a correlation.
    • The closer r is to 1 or -1 the stronger the (positive or negative) correlation between two variables.
    • The null hypothesis is the hypothesis that is assumed to be correct until proven otherwise. It states that there is no correlation between the variables.
    • The alternative hypothesis is that which is accepted when the null hypothesis is rejected. It can be either one-tailed (looking at one outcome) or two-tailed (looking at both outcomes – positive and negative).
    • If the significance level is 5%, this means that there is a 5% chance that the null hypothesis is incorrectly rejected.

    ImagesOne-tailed test: https://en.wikipedia.org/w/index.php?curid=35569621

    Learn faster with the 0 flashcards about Hypothesis Test for Correlation

    Sign up for free to gain access to all our flashcards.

    Hypothesis Test for Correlation
    Frequently Asked Questions about Hypothesis Test for Correlation

    Is the Pearson correlation a hypothesis test?

    Yes. The Pearson correlation produces a PMCC value, or r  value, which indicates the strength of the relationship between two variables.

    Can we test a hypothesis with correlation?

    Yes. Correlation is not equivalent to causation, however we can test hypotheses to determine whether a correlation (or association) exists between two variables.

    How do you set up the hypothesis test for correlation?

    You need a null (p = 0) and alternative hypothesis. The PMCC, or r value must be calculated, based on the sample data. Based on the significance level and sample size, the critical value can be worked out from a table of values in the formula booklet. Finally the r value and critical value can be compared to determine which hypothesis is accepted.

    Save Article
    How we ensure our content is accurate and trustworthy?

    At StudySmarter, we have created a learning platform that serves millions of students. Meet the people who work hard to deliver fact based content as well as making sure it is verified.

    Content Creation Process:
    Lily Hulatt Avatar

    Lily Hulatt

    Digital Content Specialist

    Lily Hulatt is a Digital Content Specialist with over three years of experience in content strategy and curriculum design. She gained her PhD in English Literature from Durham University in 2022, taught in Durham University’s English Studies Department, and has contributed to a number of publications. Lily specialises in English Literature, English Language, History, and Philosophy.

    Get to know Lily
    Content Quality Monitored by:
    Gabriel Freitas Avatar

    Gabriel Freitas

    AI Engineer

    Gabriel Freitas is an AI Engineer with a solid experience in software development, machine learning algorithms, and generative AI, including large language models’ (LLMs) applications. Graduated in Electrical Engineering at the University of São Paulo, he is currently pursuing an MSc in Computer Engineering at the University of Campinas, specializing in machine learning topics. Gabriel has a strong background in software engineering and has worked on projects involving computer vision, embedded AI, and LLM applications.

    Get to know Gabriel

    Discover learning materials with the free StudySmarter app

    Sign up for free
    1
    About StudySmarter

    StudySmarter is a globally recognized educational technology company, offering a holistic learning platform designed for students of all ages and educational levels. Our platform provides learning support for a wide range of subjects, including STEM, Social Sciences, and Languages and also helps students to successfully master various tests and exams worldwide, such as GCSE, A Level, SAT, ACT, Abitur, and more. We offer an extensive library of learning materials, including interactive flashcards, comprehensive textbook solutions, and detailed explanations. The cutting-edge technology and tools we provide help students create their own learning materials. StudySmarter’s content is not only expert-verified but also regularly updated to ensure accuracy and relevance.

    Learn more
    StudySmarter Editorial Team

    Team Math Teachers

    • 9 minutes reading time
    • Checked by StudySmarter Editorial Team
    Save Explanation Save Explanation

    Study anywhere. Anytime.Across all devices.

    Sign-up for free

    Sign up to highlight and take notes. It’s 100% free.

    Join over 22 million students in learning with our StudySmarter App

    The first learning app that truly has everything you need to ace your exams in one place

    • Flashcards & Quizzes
    • AI Study Assistant
    • Study Planner
    • Mock-Exams
    • Smart Note-Taking
    Join over 22 million students in learning with our StudySmarter App
    Sign up with Email