"Master Hypothesis Testing in Field Experiments"
Hypothesis testing in field experiments is a statistical approach to evaluate cause-and-effect relationships by analyzing real-world data to determine the significance of observed outcomes. It involves defining null and alternative hypotheses, designing randomized experiments, collecting data, and using statistical tests like t-tests or regression for result interpretation.
|
||||||||||||||||||
The Path of Statistical InferenceFrom a Small Sample to a Big Conclusion 🎯
The GoalWe want to understand a whole **population**, but we can only study a small **sample**. Statistical inference is the science of using that sample data to make educated guesses about the population. 👥
Population (Everyone) →
🧑🤝🧑
Sample (A Small Group) 🌉
The Bridge: Central Limit TheoremIf we take many random samples and plot their average values, they form a predictable bell curve called a **sampling distribution**. This allows us to use the properties of the normal distribution to make inferences. 📜
The Framework: Hypothesis TestingThis is the formal process for testing a claim. 1️⃣ State HypothesesDefine the Null (H₀, no effect) and Alternative (Hₐ, an effect) hypotheses. 2️⃣ Set the StandardChoose a significance level (α), usually 5% (0.05). 3️⃣ Analyze DataCalculate a test statistic from your sample data. 4️⃣ Make a DecisionCompare your result (p-value) to your standard (α). ⚖️
The VerdictThe **p-value** is the probability of seeing your data if the null hypothesis is true. We compare it to alpha (α) to make a decision. IF p-value ≤ α 💥 Reject the Null Hypothesis (The result is statistically significant) IF p-value > α 🤷 Fail to Reject the Null (The result is not statistically significant) ⚠️
The Risks: ErrorsSince we're dealing with probability, we can make two kinds of mistakes.
📏
The UncertaintyA **confidence interval** gives a range of plausible values for the true population parameter, quantifying the uncertainty around our sample estimate. Sample Mean: 105
99
111
95% Confidence Interval: [99, 111] We are 95% confident the true population mean lies between 99 and 111. |
||||||||||||||||||
2-causal-inference 3-hypothesis-testing 4-covariates 5-one-sided-compliance