The p-value and rejecting the null (for one- and two-tail tests)

 
 
The p-value and rejecting the null blog post.jpeg
 
 
 

What is the p-value?

The pp-value (or the observed level of significance) is the smallest level of significance at which you can reject the null hypothesis, assuming the null hypothesis is true.

You can also think about the pp-value as the total area of the region of rejection. Remember that in a one-tailed test, the region of rejection is consolidated into one tail, whereas in a two-tailed test, the rejection region is split between two tails.

Krista King Math.jpg

Hi! I'm krista.

I create online courses to help you rock your math class. Read more.

 

So, as you might expect, calculating the pp-value as the area of the rejection region will be slightly different depending on whether we’re using a two-tailed test or a one-tailed test, and whether the one-tailed test is an upper-tail test or lower-tail test.


Calculating the pp-value

For a one-tailed, lower-tail test

For a one-tailed test, first calculate your zz-test statistic. For a lower-tail test, zz will be negative. Look up the zz-value in a zz-table, and the value you find in the body of the table represents the area under the probability distribution curve to the left of your negative zz-value.

For instance, assume you found z=1.46z=-1.46. In a zz-table, you find

 
table of negative z-values
 

So 0.07210.0721 is the area under the curve to the left of z=1.46z=-1.46, and this is the pp-value also. So p=0.0721p=0.0721.

 
p-value for a negative z-score
 

For a one-tailed, upper-tail test

For a one-tailed test, first calculate your zz-test statistic. For an upper-tail test, zz will be positive. Look up the zz-value in a zz-table, and the value you find in the body of the table represents the area under the probability distribution curve to the left of your positive zz-value.

For instance, assume you found z=1.46z=1.46. In a zz-table, you find

 
table of positive z-values
 

But in an upper-tail test, you’re interested in the area to the right of the zz-value, not the area to the left. To find the area to the right, you need to subtract the value in the zz-table from 11.

10.9279=0.07211-0.9279=0.0721

So 0.07210.0721 is the area under the curve to the right of z=1.46z=1.46, and this is the pp-value also. So p=0.0721p=0.0721.

 
p-value for a positive z-score
 

For a two-tailed test

For a two-tailed test, first calculate your zz-test statistic. For an two-tail test, zz could be either positive or negative. Look up the zz-value in a -table, and the value you find in the body of the table represents the area under the probability distribution curve to the left of your zz-value.

For instance, assume you found z=1.23z=1.23. In a zz-table, you find

 
table of positive z-scores
 

But for a positive zz-value, you’re interested in the area to the right of the zz-value, not the area to the left. To find the area to the right, you need to subtract the value in the zz-table from 11.

10.8907=0.10931-0.8907=0.1093

So 0.10930.1093 is the area under the curve to the right of z=1.23z=1.23. Because this is a two-tail test, the region of rejection is not only the 10.93%10.93\% of area under the upper tail, but also the symmetrical 10.93%10.93\% of area under the lower tail. So we’ll double 0.10930.1093 to get 2(0.1093)=0.21862(0.1093)=0.2186, and this is the pp-value also. So p=0.2186p=0.2186.

 
p-value for a two-tail test
 

How to reject the null hypothesis

The reason we’ve gone through all this work to understand the pp-value is because using a pp-value is a really quick way to decide whether or not to reject the null hypothesis.

Whether or not you should reject H0H_0 can be determined by the relationship between the α\alpha level and the pp-value.

If pαp\leq \alpha, reject the null hypothesis

If p>αp>\alpha, do not reject the null hypothesis

In our earlier examples, we found

p=0.0721p=0.0721 for the lower-tail one-tailed test

p=0.0721p=0.0721 for the upper-tail one-tailed test

p=0.2186p=0.2186 for the two-tailed test

With these in mind, let’s say for instance you set the confidence level of your hypothesis test at 90%90\%, which is the same as setting the α\alpha level at α=0.10\alpha=0.10. In that case,

p=0.0721α=0.10p=0.0721\leq\alpha=0.10

p=0.2186>α=0.10p=0.2186>\alpha=0.10

So we would have rejected the null hypothesis for both one-tailed tests, but we would have failed to reject the null in the two-tailed test. If, however, we’d picked a more rigorous α=0.05\alpha=0.05 or α=0.01\alpha=0.01, we would have failed to reject the null hypothesis every time.

Significance

The significance (or statistical significance) of a test is the probability of obtaining your result by chance. The less likely it is that we obtained a result by chance, the more significant our results.

Hopefully by now it’s not too surprising by now that all of these are equivalent statements:

  • The finding is significant at the 0.010.01 level

  • The confidence level is 99%99\%

  • The Type I error rate is 0.010.01

  • The alpha level is 0.010.01, α=0.01\alpha=0.01

  • The area of the rejection region is 0.010.01

  • The pp-value is 0.010.01, p=0.01p=0.01

  • There’s a 11 in 100100 chance of getting a result as, or more, extreme as this one

The smaller the pp-value, or the smaller the alpha value, or the lower the Type I error rate, and the smaller the region of rejection, the higher the confidence level, and the less likely it is that you got your result by chance.

In other words, an alpha level of 0.100.10 (or a pp-value of 0.100.10, or a confidence level of 90%90\%) is a lower bar to clear. At that significance level, there’s a 11 in 1010 chance that the result we got was just by chance. And therefore there’s a 11 in 1010 chance that we’ll reject the null hypothesis when we really shouldn’t have, thinking that we provided support for the alternative hypothesis when we shouldn’t have.

But a stricter alpha level of 0.010.01 (or a pp-value of 0.010.01, or a confidence level of 99%99\%) is a higher bar to clear. At that significance level, there’s only a 11 in 100100 chance that the result we got was just by chance. And therefore there’s only a 11 in 100100 chance that we’ll reject the null hypothesis when we really shouldn’t have, thinking that we provided support for the alternative hypothesis when we shouldn’t have.

If we find a result that clears the bar we’ve set for ourselves, then we reject the null hypothesis and we say that the finding is significant at the pp-value that we find. Otherwise, we fail to reject the null.

 
 

How to use the p-value to determine whether or not you can reject the null hypothesis


 
Krista King Math Signup.png
 
Probability & Statistics course.png

Take the course

Want to learn more about Probability & Statistics? I have a step-by-step course for that. :)