Joseph V. Casillas, PhD
Rutgers University
Last update: 2025-02-15
More about z-scores
Probability
Test scores are N(80, 5)
If we standardize the scores: N(0, 1)…
Probability
Probability
1. What is the probability that \(Z\) takes values between 0 and 1.9?
The probability that \(Z\) takes values between 0 and 1.9 is
0.4712834
Probability
2. What is the probability that \(Z\) takes values between 0 and 2?
The probability that \(Z\) takes values between 0 and 2 is
0.4772499
Probability
3. What is the probability that \(Z\) takes values between 0 and 2.33?
The probability that \(Z\) takes values between 0 and 2.33 is
0.4900969
Probability
4. Find the probability that \(Z\) takes values bewteen 1 and 2.
The probability that \(Z\) takes values between 1 and 2 is
0.1359051
Probalitiy of (1) = (2) - (3)
Probability
5. What is the probability that \(Z\) > 2?
The probability that \(Z\) > 2 is
0.0227501
Probalitiy of (1) = (2) - (3)
Probability
6. What is the probability that \(Z\) < 2?
The probability that \(Z\) < 2 is
0.9772499
Probalitiy of (1) = 0.5 + (3)
Probability
7. What is the probability that −2 < \(Z\) < 0?
The probability that −2 < \(Z\) < 0 is
0.4772499
Probalitiy of (1) = (2)
Probability
8. What is the probability that −1 < \(Z\) < 2?
The probability that −1 < \(Z\) < 2 is
0.8185946
Probalitiy of (1) = (2) + (3)
Huh?
…if you take many samples from a population, and calculate the averages of each one, the collection of those averages will be normally distributed… and it doesn’t matter what the shape of the source distribution is.
Ex 1 - IQ
The population distribution of IQ in the general public is known to have a mean of 100 with a standard deviation of 15.1
Ex 1 - IQ
Ex 1 - IQ
15 / sqrt(36)
= 2.5.Ex 1 - IQ
\[\color{red}{z} = \frac{\color{blue}{\bar{x}} - \color{green}{\mu}}{\color{purple}{\sigma} \div \color{orange}{\sqrt[]{\color{grey}{n}}}}\]
Where z is derived by calculating the difference between the sample mean and the population mean and dividing it by the population standard deviation divided by the square root of the sample size.
Ex 2 - Exams
Assume that the test scores of a college entrance exam fits a normal distribution. Furthermore, the mean test score is 72, and the standard deviation is 15. What is the percentage of students scoring 84 or more in the exam?
In a trial, the
null hypothesis (H0)
is innocence.
The objective is to see if the evidence (the data) contradicts this hypothesis, supporting an alternative hypothesis (H1), guilt.
NHST
If there is no evidence,
the accused cannot be
found guilty.
We fail to reject H0
If there is evidence of guilt beyond a reasonable doubt, the accused is found guilty.
We reject H0
So what’s a p-value?
p-value: the probability of obtaining
your data, if H0 is TRUE.
…and “significant”?
Significance: obtaining a p-value below
an arbitrary threshold
High p-value: your data are likely with a true null.
Low p-value: your data are unlikely with a true null.What is power?
The probability of (correctly) rejecting
H0 when H1 is true.
Hypothesis testing in context
Reality
|
|||
---|---|---|---|
H0 is true | H0 is false | ||
Decision | Fail to reject H0 | Correct | Type II error |
Decision | Reject H0 | Type I error | Correct |
Which test do I use?
There are many…
name | group | score |
---|---|---|
Juan | g1 | 250 |
Jorge | g1 | 340 |
Jacobo | g2 | 460 |
José | g2 | 200 |
name | test1 | test2 |
---|---|---|
Miguel | 35 | 67 |
Marta | 50 | 46 |
Marisa | 90 | 86 |
Marcos | 78 | 91 |
name | group | score |
---|---|---|
Juan | g1 | 250 |
Jorge | g1 | 340 |
Jacobo | g2 | 460 |
José | g2 | 200 |
# A tibble: 1 × 3
g1 g2 diff
<dbl> <dbl> <dbl>
1 295 330 -35
Welch Two Sample t-test
data: score by group
t = -0.25442, df = 1.2363, p-value = 0.8349
alternative hypothesis: true difference in means between group g1 and group g2 is not equal to 0
95 percent confidence interval:
-1161.468 1091.468
sample estimates:
mean in group g1 mean in group g2
295 330
name | test1 | test2 |
---|---|---|
Miguel | 35 | 67 |
Marta | 50 | 46 |
Marisa | 90 | 86 |
Marcos | 78 | 91 |
# A tibble: 1 × 3
test1 test2 diff
<dbl> <dbl> <dbl>
1 63.2 72.5 -9.25
Paired t-test
data: prd_samp$test1 and prd_samp$test2
t = -1.0785, df = 3, p-value = 0.3598
alternative hypothesis: true mean difference is not equal to 0
95 percent confidence interval:
-36.54539 18.04539
sample estimates:
mean difference
-9.25