General setup

Let us set up the notations first. Suppose a there exists a partition of a region D ∈ ℛ² (e.g., a city). This partition is denoted by A_i, i = 1, …, n. Moreover, there exists another partition of the same city, denoted B_j, where j = 1, …, m. These partitions can be seen as two different administrative divisions within the same city. It is common for different government agencies to release data for different divisions of a same city, country, or state.

Model-based approach

Consequently,

$$\mathrm{E}[Y(A_i)] = \frac{1}{\lvert A_i \rvert} \int_{A_i} \mathrm{E}[Y(\mathbf{s})] \, \mathrm{d} \mathbf{s} = \frac{1}{\lvert A_i \rvert} \int_{A_i} \mu \, \mathrm{d} \mathbf{s} = \mu$$

and

where ∥s − s′∥ is the Euclidean distance between the coordinates s and s′, and C(∥s − s′∥; θ) is an isotropic covariance function depending on the parameter θ.

Assume we observe a random variable Y(⋅) at each region A_i and we are interested in predict/estimate this variable in each of the regions B_j. Now suppose the random variable Y(⋅) varies continuously over D and is defined as follows Y(s) = μ + S(s) + ε(s), s ∈ D ⊂ ℛ².

where $$ S(\cdot) \sim \mathrm{GP}(0, \sigma^2 \rho(\cdot; \, \phi, \kappa)) \; \text{ and } \; \varepsilon(\cdot) \overset{\mathrm{i.i.d.}}{\sim} \mathrm{N}(0, \sigma^2 \rho(\cdot; \, \phi, \kappa)), $$ where S and ε are independent. For now, let’s make the unrealistic assumption that all those parameters are known. Then, our assumption is that the observed data is as follows

where |⋅| returns the area of a polygon. Furthermore, it can be shown that (using Fubini’s Theorem and some algebraic manipulation) $$ \mathrm{Cov}(Y(A_i), Y(A_j)) = \frac{\sigma^2}{\lvert A_i \rvert \lvert A_j \rvert} \int_{A_i \times A_j} \rho( \lVert \mathbf{s} - \mathbf{s}' \rVert; \, \phi, \kappa ) \, \mathrm{d} \mathbf{s} \, \mathrm{d} \mathbf{s}' + \mathbf{I}(i = j) \frac{\tau}{\lvert A_i \rvert}, $$ where ρ(⋅; ϕ, κ) is a positive definite correlation function. Now, let R_κ(ϕ) be a correlation matrix such that $$ \mathrm{R}_{\kappa}(\phi)_{ij} = \frac{1}{\lvert A_i \rvert \lvert A_j \rvert} \int_{A_i \times A_j} \rho( \lVert \mathbf{s} - \mathbf{s}' \rVert; \, \phi, \kappa ) \, \mathrm{d} \mathbf{s} \, \mathrm{d} \mathbf{s}', $$ thus, Y(A₁, ⋯, A_n) ∼ N(μ1_n, σ²R_κ(ϕ) + τdiag(|A₁|⁻¹, …, |A₁|⁻¹)). Then, if we assume (Y^⊤(A₁, ⋯, A_n), Y^⊤(B₁, ⋯, A_m)^⊤) to be jointly normal, we use can the conditional mean of Y^⊤(B₁, ⋯, A_m)^⊤ given Y^⊤(A₁, ⋯, A_n) to estimate the observed random variable in the partition B₁, …, B_m.

Now, suppose the parameters θ = (μ, σ², ϕ, τ) are unknown. The Likelihood of Y(A₁, …, A_n) can still be computed.

In particular, if we use the parametrization α = τ/σ², we have closed form for the Maximum Likelihood estimators both for μ and σ². Thus, we can optimize the profile likelihood for ϕ and α numerically. Then, we resort on conditional Normal properties again to compute the predictions in a new different set of regions.

Areal Interpolation (AI)

Areal interpolation is a nonparametric approach that interpolates Y(A_i)’s to construct Y(B_j)’s. Define an m × n matrix W = {w_ij}, where w_ij is the weight associated with the polygon A_i in constructing Y(B_j). The weights are w_ij = |A_i ∩ B_j|/|B_j| (Goodchild and Lam 1980; Gotway and Young 2002). The interpolation for Ŷ(B₁, …, B_m) is constructed as The expectation and variance of the predictor are, respectively, E[Ŷ(B₁, …, B_m)] = WE[Y(A₁, …, A_n)] and In practice, the covariance matrix Var[Y(A₁, …, A_n)] is unknown and, consequently needs to be estimated.

The variance each predictor Var[Ŷ(B_i)] is needed as an uncertainty measure. It relies on both the variances of Y(A_j)’s and their covariances: The variances are often observed in survey data, but the covariances are not. For practical purpose, we propose an approximation for Cov[Y(A_i), Y(A_l)] based on Moran’s I, a global spatial autocorrelation. Specifically, let ρ_I be the Moran’s I calculated with a weight matrix constructed with first-degree neighbors. That is, ρ_I is the average of the pairwise correlation for all neighboring pairs. For regions A_i and A_l, if they are neighbors of each other, our approximation is The covariance between non-neighboring A_i and A_l is discarded. The final uncertainty approximation for Var[Ŷ(B_i)] will be an underestimate. Alternatively, we can derive, at least, an upper bound for the variance of the estimates by using a simple application from the Cauchy–Schwartz inequality, in which case, ρ_I is replaced with~1.

Reference

Goodchild, Michael F, and Nina Siu-Ngan Lam. 1980. “Areal Interpolation: A Variant of the Traditional Spatial Problem.” Geo-Processing 1: 279–312.

Gotway, Carol A, and Linda J Young. 2002. “Combining Incompatible Spatial Data.” Journal of the American Statistical Association 97 (458): 632–48.

4. Method

General setup

Model-based approach

Areal Interpolation (AI)

Reference