📋 Formulas
The formula sheet I typically provide for my students can be found below. You can also click on the tab below to find the relevant formulas for each section of the lecture notes.
Formula Sheet Download Formula Sheet
ˉx=1nΣxi mean
s=√Σ(x−ˉx)2n−1 sample standard deviation
s2 variance
ˆy=b0+b1x estimated regression line
b1=rsysx slope of regression line
b0=¯y−b1¯x y-intercept of regression line
R2=explained variationtotal variation=(r)2⋅100% coefficient of determination
P(A∩B)=P(AandB)=P(A∣B)·P(B)
P(A∪B)=P(A orB)=P(A)+P(B)−P(A and B)
P(A∣B)=P(Aand B)P(B)
Discrete Random Variables
μX=E(X)=Σxi·P(xi) mean of a discrete random variable
σX=√Σ(xi−μX)2·P(xi)=√Σ(x2i·P(xi))−μ2X standard deviation of a discrete random variable
Binomial Random Variables
If X has a binomial distribution with parameters n and p, then
P(X=k)=nCk·pk·(1−p)n−k
μX=E(X)=np mean for a binomial random variable
σX=√np(1−p) standard deviation for a binomial random variable
Normal Random Variables
z=x−μσ direct calculation
x=z(σ)+μ inverse calculation
ˆp∼AN(p,√pqn) if
np and nq≥10
μˆp=p mean of sample proportion
σˆp=√pqn standard deviation of sample proportion
Inferential Statistics
statistic±margin of error=statistic±critical value⋅standard error=ˆp±z∗√ˆpˆqn confidence interval for one proportion
n=(z*)2ˆpˆq(ME)2 sample size required for given CL and ME
z=ˆp−p0√p0q0n test statistic for one proportion
ˉx∼AN(μ,σ√n) if
X∼N or n≥30
μˉx=μ mean of sample mean
σˉx=σ√n standard deviation of sample mean
Inferential Statistics
statistic±margin of error=statistic±critical value⋅standard error=ˉx±t∗s√n confidence interval for one mean
t=ˉx−μ0s√n test statistic for one mean
degree of freedom for t is given by df=n−1
^p1−^p2∼AN(p1−p2,√p1q1n1+p2q2n2) if
np and nq≥10
μ^p1−^p2=p1−p2 mean of difference in sample proportions
σ^p1−^p2=√p1q1n1+p2q2n2 standard deviation of difference in sample proportion
Inferential Statistics
statistic±critical value⋅standard error=^p1−^p2±z∗√^p1^q1n1+^p2^q2n2 confidence interval for difference in proportions
z=^p1−^p2√^pc^qc(1n1+1n2) test statistic for difference in proportions where
p1=p2 is null hypothesis and
^pc=X1+X2n1+n2
Sampling Distribution for Difference of Means
¯x1−¯x2∼AN(μ1−μ2,√σ21n1+σ22n2) if
np and nq≥10
μ¯x1−¯x2=μ1−μ2 mean of difference in sample means
σ¯x1−¯x2=√σ21n1+σ22n2 standard deviation of difference in sample means
Inferential Statistics
statistic±critical value⋅standard error=¯x1−¯x2±t∗√s21n1+s22n2 confidence interval for difference in proportions
t=¯x1−¯x2√s21n1+s22n2) test statistic for difference in proportions where
μ1=μ2
degree of freedom for t is given by super complicated formula
Links to an external site. or approximated by df=min(n1−1,n2−1)
χ2=∑(O−E)2E test statistic for GoF, where O = observed and E = expected
degree of freedom for χ2 is given by
df=k−1 where
k is the number of levels of the categorical variable
Chi-Square Test for Independence / Homogeneity
χ2=∑(O−E)2E test statistic for GoF, where O = observed and E = expected
E=(row total)(col total)grand total
degree of freedom for χ2 is given by
df=(r−1)(c−1) where
r is the number of rows and
c is the number of columns
ANOVA
The formulas are gross... I will add them later.
The formulas for inference on regression are coming soon!