Aronow, Samii & Wang (2020) Design-based inference for spatial experiments with interference, arXiv.

Setting

$N$ intervention points: $\mathcal{N} = \{1, \dots, N\}$ . Each point $i \in \mathcal{N}$ resides in $\mathcal{X} \subset \mathbb{R}^2$ (lattitude and longitude).
An experimental design assigns a treatment $Z_i \in \{0,1\}$ to each intervention point. $\mathbf{Z} = (Z_1, \dots , Z_N)$ . Design-based inference = $\mathbf{Z}$ のみランダム，他の変数はすべて固定値．
$Y_x(\mathbf{z})$ : potential outcome at $x \in \mathcal{X}$ under $\mathbf{Z} = \mathbf{z}$ . Potential outcome は個人ごとでなく location ごとに定める．
Observed outcome:

$Y_x = \sum_{\mathbf{z} \in \mathcal{Z}} Y_x(\mathbf{z})I(\mathbf{Z} = \mathbf{z})$

Let $\mathbf{Y}(\mathbf{z}) = \{Y_x(\mathbf{z})\}_{x \in \mathcal{X}}$ and $\mathbf{Y} = \{Y_x(\mathbf{Z})\}_{x \in \mathcal{X}}$ .
"Circle average" function:

$\displaystyle \mu_i( \mathbf{Y}(\mathbf{z}); d) = \frac{1}{|\{x : d_i(x) = d\}|}\int_{x: d_i(x) = d} Y_x(\mathbf{z})dk$

where $d_i(x) = ||x(i) - x||$ , and $k$ is a suitable measure = 地点 $i$ から距離 $d$ 離れた地点の potential outcome の平均．

Realized circle average は $\mu_i( \mathbf{Y}; d) = \sum_{\mathbf{z} \in \mathcal{Z}} \mu_i( \mathbf{Y}(\mathbf{z}); d) I(\mathbf{Z} = \mathbf{z})$

Defining a marginal spatial effect

Rewrite the potential outcome as $Y_x(z_i, \mathbf{z}_{-i})$
Experimental design parameter: $\alpha$ (e.g., $Z_i = 1$ を割り振る確率)
以下を定義する：

$Y_{ix}(z; \alpha) = \sum_{\mathbf{z}_{-i} \in \mathcal{Z}_{-i}}Y_x(z, \mathbf{z}_{-i})\Pr(\mathbf{Z}_{-i} = \mathbf{z}_{-i} \mid Z_i = z, \alpha)$

Individualistic marginalized response:

$\tau_{ix} = Y_{ix}(1; \alpha) - Y_{ix}(0; \alpha)$

= 地点 $i$ のトリートメントが変化したときの地点 $x$ における response

同様に以下を定義する：

$\mu_i(z; d, \alpha) = \sum_{\mathbf{z}_{-i} \in \mathcal{Z}_{-i}} \mu_i(\mathbf{Y}(z, \mathbf{z}_{-i}); d)\Pr(\mathbf{Z}_{-i} = \mathbf{z}_{-i} \mid Z_i = z, \alpha)$

$\tau_i(d; \alpha) = \mu_i(1; d, \alpha) - \mu_i(0; d, \alpha)$

= 地点 $i$ のトリートメントが変化したときに $i$ から距離 $d$ 離れた地点における平均 reponse

Average marginalized response:

$\displaystyle \tau(d; \alpha) = \frac{1}{N}\sum_{i = 1}^N \tau_i(d; \alpha)$

Bernoulli assignment rule の場合 $(1, \mathbf{z}_{-i})$ と $(0, \mathbf{z}_{-i})$ は同じ確率で起きる．
一方で，トリートメントグループのサイズ (say, $N_1$ ) が決められている random assignment rule の場合， $Y_{ix}(1; \alpha)$ は "残りの" $N_1-1$ 地点のトリートメント assignment で marginalize する一方， $Y_{ix}(0; \alpha)$ は $N_1$ 地点すべてのトリートメント assignment で marginalize している．
ただし，データのサイズ $N$ が十分大きい場合，これらの assignment rule の違いはほとんど無視できる．

Inferential assumptions

Assumption (Bernoulli design): $\{Z_1, \dots , Z_N\}$ are i.i.d. $Bernoulli(p)$ .

$I_{ij}(d)$ を「地点 $i$ へのトリートメントが $j$ からの半径 $d$ 平均 response に干渉するか否か」を表す指示関数とする：

\begin{align} I_{ij}(d) = \begin{cases} 1 & \text{if} \;\; \mu_j(\mathbf{Y}(\mathbf{z}); d) \neq \mu_j(\mathbf{Y}(\mathbf{z}'); d) \;\; \text{for some} \;\; \mathbf{z}, \mathbf{z}' \;\; \text{such that} \;\; \mathbf{z}_{-i} = \mathbf{z}'_{-i} \\ 1 & \text{if} \;\; i = j \\ 0 & \text{otherwise}\end{cases} \end{align}

さらに， $s_{ij}(d)$ を「地点 $i$ と $j$ からの半径 $d$ 平均 response が同時にあるトリートメントの影響を受けるか否か」を表す指示関数とする：

\begin{align} s_{ij}(d) = \begin{cases} 1 & \text{if} \;\; I_{\ell i}(d) I_{\ell j}(d) = 1 \;\; \text{for some} \;\; \ell \in \mathcal{N} \\ 0 & \text{otherwise}\end{cases} \end{align}

Assumption (Local interference): There exists a constant $h(d)$ such that if $||x(i) - x(j)|| - d \gt h(d)$ , then $s_{ij}(d) = 0$ .

Local interference assumption の implication: $Cov(\mu_i(Y(z, \mathbf{Z}_{-i}); d), \mu_j(Y(z, \mathbf{Z}_{-j}); d)) = 0$ for all $z, z'$ .

Assumption (Intervention point spacing): 各地点は少なくともお互いに $d_0$ よりも離れている．

Estimation and inference

Horvitz-Thompson estimator:

$\displaystyle \hat \tau(d) = \frac{1}{Np} \sum_{i = 1}^N Z_i \mu_i(\mathbf{Y}; d) - \frac{1}{N(1 - p)} \sum_{i = 1}^N(1 - Z_i) \mu_i(\mathbf{Y}; d)$

Proposition 1: $\mathbb{E}_Z[\hat \tau(d) ] = \tau(d; \alpha)$

Proposition 2: $\sqrt{N}(\hat \tau(d) - \tau (d; \alpha)) \overset{d}{\to} N(0, V_{HT})$

Hajek estimator:

$\displaystyle \hat \tau_{HA}(d) = \frac{1}{N_1} \sum_{i = 1}^N Z_i \mu_i(\mathbf{Y}; d) - \frac{1}{N_0} \sum_{i = 1}^N(1 - Z_i) \mu_i(\mathbf{Y}; d)$ , where $N_1 = \sum_{i = 1}^N Z_i$ and $N_0 = N - N_1$ .