Why Standard Errors Change

OLS assumes: $$E[u_i \, u_j] = 0 \quad \text{for } i \neq j$$ But in panel or spatial data, **that is false.** Observations are correlated within: - The same property - The same census tract - The same county - The same time period - The same firm - The same school If you ignore that, your SEs are **too small** → false significance. **Clustering corrects this.** --- ## Intuition If shocks are correlated within group \$g\$: $$u_{ig} = \underbrace{\text{common component}}_{\text{shared within cluster}} + \underbrace{\text{idiosyncratic}}_{\text{independent across obs.}}$$ Then treating observations as independent **overcounts information.** Clustering says: *treat each cluster as the unit of independent variation.* --- ## What Happens When You Change Cluster Level? Suppose you estimate: $$\ln P_{it} = \beta \, \text{Shock}_{it} + \text{FE} + u_{it}$$ Now consider clustering at:

1. Property (PIN)

Allows arbitrary serial correlation within property over time.
SE usually ↑ relative to naive.

2. Census Tract

Allows correlation across different properties in same tract.
If crime shocks spill over spatially, this matters.
SE likely ↑ further.

3. County

Allows even broader spatial correlation.
SE ↑ more.

4. No Clustering

Assumes independence.
SE smallest. Usually wrong.

--- ## Why Larger Clusters Often Increase SE Because: $$\text{Var}(\hat{\beta}) \;\propto\; \frac{1}{G}$$ where \$G\$ = number of independent clusters (not \$N\$). | Cluster Level | # Clusters | SE | |---|---|---| | Property | Many | Smaller | | Tract | Fewer | Larger | | County | Few | Largest | The effective sample size becomes \$G\$, not \$N\$. --- ## Key Principle

You should cluster at the level where the identifying variation is correlated.

--- ## In the Crime–Housing Context The setting uses: - Spatial kernel exposure - Properties within tracts - Crime shocks at neighborhood level Errors are likely correlated within: - **Census tract** - Possibly community area - Possibly time × area So **clustering at tract** is reasonable.