Environmental Pollution 143 (2006) 416e426 www.elsevier.com/locate/envpol

The use of soil survey data to determine the magnitude and extent of historic metal deposition related to atmospheric smelter emissions across Humberside, UK B.G. Rawlins a,*, R.M. Lark b, R. Webster b, K.E. O’Donnell a a

British Geological Survey, Keyworth, Nottingham NG12 5GG, UK b Rothamsted Research, Harpenden, Hertfordshire AL5 2JQ, UK

Received 12 May 2005; received in revised form 12 December 2005; accepted 14 December 2005

Soil survey data are used to estimate the deposition of metals to land surrounding a former smelter. Abstract When a smelter has ceased operation, and in the absence of historical emission data, high-resolution geochemical surveys of the soil can reveal historical loads to the surrounding land. We use measurements of lead and tin in the soil at two depths to estimate the total quantities of these metals deposited on 286 km2 of land around the former Capper Pass smelter (north-east England). We subtracted median background concentrations for three parent material types outside the region of deposition from the data within it. We then constructed a statistical model of metal deposition based on the adjusted data. The data were from irregularly spaced sites and were strongly skewed with a spatial trend. We mapped the concentrations of the metals by lognormal universal kriging with the parameters for the trend and residuals modelled simultaneously by residual maximum likelihood (REML). The maps suggest that metal was deposited up to 24 km to the north-east of the smelter by the prevailing wind. We estimated total excess metal in the soil over the area of deposition to be 2500 t of lead and 830 t of tin. Ó 2006 NERC. Published by Elsevier Ltd. All rights reserved. Keywords: Smelter emission; Tin; Lead; Soil; REML; Universal kriging

1. Introduction Smelters of non-ferrous metals emit particles into the atmosphere. Most of the particles subsequently fall to the ground close to the smelters and result in increased concentrations of metals in both organic (McMartin et al., 1999) and mineral (Sterckeman et al., 2002) fractions of the soil. Accumulations of lead (Pb), cadmium (Cd) and zinc (Zn) in particular have reduced the abundance and diversity of invertebrates (Nahmani et al., 2003; Colgan et al., 2003). There have been few published studies of the effects of emissions on human health, but Roels et al. (1980) found that children close to

* Corresponding author. Tel.: þ44 115 9363140; fax: þ44 115 9363200. E-mail address: [email protected] (B.G. Rawlins).

a lead smelter ingested and inhaled more of the metal than those further away at a control site. Hence, there is serious interest in the nature, amount and extent of environmental pollution from smelters, both those that are currently operating and those that have ceased to function. Investigators also want sound methods of survey for estimating the effects. Where data are available on current or historical emissions from smelting one might be able to validate a model of the mass balance between emission and deposition based on the monitoring of atmospheric deposition. For example, De Caritat et al. (1997) used data from the chemical analysis of rain and snow to estimate atmospheric deposition of metals around the Monchegorsk smelter in Russia, and they compared their estimates with those from a model of deposition based on distance decay functions. Such an approach is not possible where there is little or no documentary evidence on historical

0269-7491/$ - see front matter Ó 2006 NERC. Published by Elsevier Ltd. All rights reserved. doi:10.1016/j.envpol.2005.12.010

B.G. Rawlins et al. / Environmental Pollution 143 (2006) 416e426

emissions and when a smelter has ceased to operate. One might estimate the extent and total quantity of various metals deposited on land from geochemical data from a high-resolution soil survey, provided that certain conditions are satisfied. First, sampling1 must be sufficiently dense in the vicinity of the smelter to capture the spatial dependence and to make accurate estimates of metal concentrations in the soil, given that deposition typically diminishes rapidly with increasing distance from the source (De Caritat et al., 1997). Second, each soil type or parent material represented in the polluted region must be sampled adequately outside the region to provide ‘background’ concentrations against which to judge the amount of pollutant deposited. The geochemical sampling effort for this will clearly depend on the complexity of the local bedrock and any superficial materials. The particular region that concerns us is that around a former tin smelter (Capper Pass) near North Ferriby on Humberside in the north-east of England. The smelter operated for more than 50 years in the last century, and is thought to have polluted more than 100 km2 in its neighbourhood with both tin (Sn) and Pb. The soil of the region was sampled by the British Geological Survey at a density of one sample per 2 km2, and the contents of metals in the soil were determined. We have analysed the data from the survey. We first estimated the typical concentrations of the metals in soil on the same parent materials outside the plume of deposition. We calculate median background concentrations of metal and subtract these from the actual data to estimate deposited metal. We then use these data to construct a statistical model of metal deposition, plot soil geochemical maps, and estimate total metal deposition over an area of 286 km2. 2. Materials and methods 2.1. Study region and soil survey The Capper Pass smelter occupied 28 ha of a 160-ha site on the north bank of the Humber estuary to the west of Hull (Fig. 1). It was the world’s largest producer of tin from secondary materials, including solder, drosses, nonferrous slags, flue dusts and tin-based alloys and residues. At its peak in the early 1980s the plant produced about 90 000 t of metal per year, including about 10% of the world’s output of tin. Impurities in the feed included Pb, antimony (Sb), arsenic (As), and copper (Cu). The smelter operated for 53 years from 1938 to 1991 (Litten and Strachan, 1995). The original 61-m high chimney was replaced in 1971 by a chimney of 183 m. The dominant parent materials of the soil in the region are Upper Cretaceous Chalk and two Quaternary deposits, namely alluvium (around the Humber estuary) and glacial till (see Fig. 1). A map of the parent material was digitized from four map sheets at 1:50 000 of solid and drift geology maps of the British Geological Survey (1983a,b, 1993, 1995). The dominant topographic feature is the northesouth trending outcrop of the Cretaceous Chalk forming the Yorkshire Wolds (up to 200 m above Ordnance Datum). The

417

ground to both the east and west is generally low-lying (<10 m), with very low land along the Humber estuary. Long-term data (from the British Meteorological Office) for a weather station in the region and summarized in the form of a wind rose (Department of the Environment, 1992, page 6) show that the strongest winds are from the south-west, which is also the dominant wind direction. Land use in the region at the time of the survey was predominantly arable agriculture (84%), with a small proportion of pasture (14%) and even less rough grazing (2%). In England, following the second world war, it was common for pasture and arable land to be rotated. This is significant because ploughing will have mixed any aerially deposited particulates for both arable and pasture to the maximum plough depth, typically between 20 and 30 cm. The geochemical data we analysed for this paper were recorded as part of a regional geochemical survey of eastern England. Sample sites were chosen from every second kilometre square of the British National Grid by simple random selection within each square, subject to the avoidance of roads, tracks, railways, domestic and public and gardens, and other seriously disturbed ground. The samples of soil were all collected in summer; those to the north of the Humber estuary in 1994, those few samples to the south in 1995. All sampling sites were in rural and peri-urban land. At each site a sample of topsoil (0e15 cm depth) was taken from five holes augered by hand at the corners and centre of a square of side 20 m, and combined to form a bulked sample weighing approximately 0.5 kg. Note that this local sampling configuration defines what is known in geostatistics as the support of the data. All our statistics are conditional on the support. We treat our data as point observations, however, since the support is very small by contrast to the distance between sample sites. This is standard practice in geostatistics, since all data must have a finite support. In addition, deeper samples were collected predominantly (80%) across the depth range 25e40 cm; the remainder at depths spanning 5 cm above (10%) and below (10%) this range. As for the topsoil, samples from each of the five auger holes were combined to form a bulked sample. These sampling depths are those of the standard survey protocol to meet the various objectives of the Geological Survey, though they might not be optimal for assessing aerially deposited particulates in the soil profile. All samples of soil were dried and disaggregated. The topsoil samples were sieved to pass 2 mm, the deeper ones to pass 150 mm; the two different grain-size fractions were not chosen for this study but are related to the broader objectives of the geochemical survey which also includes the sampling and analysis of sub-150-mm stream sediments. The comparison of analyses of soil samples in the profile based on different size fractions at different depths is problematic as they cannot be compared directly. If a homogenized bulk soil sample was analysed based on sub-samples of these two fractions, larger concentrations of trace elements would typically be reported for the finer fraction as the coarse fraction is more diluted by large amounts of minerals such as quartz that contain little of the trace elements. Data from a pilot study to the north of the study area in which analyses of these two fractions were compared for homogenized soil samples over a range of parent material types showed that calculated average Pb concentrations were 25% greater in the sub-150-mm than in the coarser fraction. We have not attempted to adjust our results to account for this difference as we believe there is no simple and justifiable mechanism for doing so. From each soil sample a 50-g sub-sample was ground in an agate planetary ball mill and pressed into pellets. The total concentrations of up to 33 major and trace elements (including As, Cd, Cu, Pb, Sb, Sn) were determined in each pellet by energy- and wavelength-dispersive XRFS (X-Ray Fluorescence Spectrometry). The detection limit for As, Cu, Pb, Sb was 1 mg kg1, whilst those for Sn and Cd were 0.8 and 0.7 mg kg1, respectively. Reference materials were analysed for calibration, and the British Geological Survey (2000) has published the results for six of them for all of the elements, covering the analytical concentration range compared with their recommended values in Govindaraju (1994).

1

The terms sample(s) and sampling are used in this paper in two senses. In statistics a sample is a set of units chosen from a population, and in a regional geochemical survey the units are sites where measurements are made or from which material is collected. Geochemists refer to the material they collect from any one site as ‘a sample’ and the process of collection as ‘sampling’. Where the context is not clear in the text, we clarify in which sense we are using the term.

2.2. Selection of plume and background soil sample subsets Preliminary maps of the concentrations of As, Cu, Pb, Sb, and Sn were made with proportional symbols for both the topsoil and subsoil for an area with a radius of 50 km centred on North Ferriby, the site of the smelter. There

B.G. Rawlins et al. / Environmental Pollution 143 (2006) 416e426

418

UM

North Sea

Humber estuary

Fig. 1. Parent materials in the study region and the soil sample locations within (circles), and outwith (squares) the deposition plume of the smelter.

appeared to be some enrichment of As, Cu and Sb in the soil within a few kilometres of the smelter, but it did not extend much further, and we therefore chose to limit our investigation to Pb and Sn, for which concentrations appeared to be increased for more than 20 km over land to the north and east of the smelter. The dominance of Pb and Sn accords with the results of atmospheric monitoring based on monthly large-volume air samples taken during the closure of the smelter (Litten and Strachan, 1995). Those results showed that these two metals typically comprised around 90% of the total mass of four airborne metals, the other of which were Cd and As. After examining the maps of Pb and Sn, we digitized a polygon that encompassed all those sampling sites contained within a hypothetical deposition plume extending to the north-east of the smelter (see Fig. 1). This polygon had a long axis of 24 km (trending south-west to north-east) and a short axis perpendicular to it of 13 km. Using digital versions of four 1:50 000 scale geological and superficial deposit map sheets (British Geological Survey, 1983a,b, 1993, 1995) we assigned each soil sample to one of the three parent material types. These samples (both surface and deeper soil) and their parent material identifiers comprise the plume subset (Fig. 1). We then identified sampling sites outside the plume but near to its margin and assigned them to two of the three parent material types (chalk and till). There were few soil samples taken on the alluvium near to the plume polygon to the north of the Humber estuary. We therefore selected samples on alluvium on the south bank because they represented the deposit of most similar composition (Fig. 1). These samples comprise the background subset, and their locations are also shown in Fig. 1.

2.3. Exploratory analysis Summary statistics were computed for concentrations of lead and tin in both topsoil and subsoil of the background data set and also for the subsets

of data identified with the three parent material classes. Table 1 lists the results. Most of the sets of data were strongly positively skewed (skewness coefficients > 1). We therefore express the centres of their distributions by their sample medians rather than their means to avoid giving undue weight to data in the long upper tails of the distributions. We then considered the data in the target region separately for each metal, each depth and for each of the three parent materials. The data for the two depths were treated separately throughout the subsequent geostatistical analysis. For each set we subtracted the median of the background data and recomputed summaries of the residuals, to which we subsequently refer as the adjusted concentrations in the soil. Although we cannot know the original values, we have made a pragmatic assumption that the medians of the background data would be the most reasonable measures of metal concentrations in the soil in the region before the smelter began operation. The results are listed in Table 2, and their quintiles are displayed as postplots in Fig. 2. All the variables still have strongly skewed distributions, and to stabilize their variances we transformed them to logarithms after fitting threeparameter lognormal curves to their frequency distributions using the distribution directive in GenStat (Payne et al., 2003). The probability density function for a variable z with such a distribution is given by

f ðzÞ ¼

  1 1 pffiffiffiffiffiffi exp  2 flnðz  aÞ  mg2 ; 2s sðz  aÞ 2p

ð1Þ

of which the three parameters are the mean, m, the standard deviation, s, and a shift, a. The transformed variable is y ¼ lnðz  aÞ with ywN ðm; sÞ:

ð2Þ

The directive DISTRIBUTION did not converge for the data on tin, so these were transformed by

B.G. Rawlins et al. / Environmental Pollution 143 (2006) 416e426 Table 1 Summary statistics on the background data (units mg kg1)

yðxÞ ¼

K X

bk fk ðxÞ þ 3ðxÞ:

419

ð4Þ

k¼0

Parent material

Metal Lead

Tin

Depth Topsoil

Subsoil

Topsoil

Subsoil

Alluvium Sample size Mean Median Std deviation Skewness

40 63.1 46.5 62.6 4.3

40 49.1 38.0 43.4 3.2

40 5.7 5.0 3.9 2.1

40 4.9 4.0 3.1 3.6

Chalk Sample size Mean Median Std deviation Skewness

83 45.2 43.0 9.5 1.1

82 37.5 38.0 10.5 0.1

83 3.4 3.0 1.9 1.0

82 3.3 3.0 1.1 0.0

Till Sample size Mean Median Std deviation Skewness

49 43.0 35.0 45.4 6.5

49 34.9 34.0 9.8 0.64

49 3.7 3.0 2.7 2.8

49 3.4 3.0 0.9 0.3

y ¼ lnðz  zmin þ 0:1Þ;

ð3Þ

where zmin is the minimum of z in the data. The estimates of a are in Table 2. Fig. 3 displays the log-transformed data as post-plots.

2.4. Spatial modelling by REML We wished to map the spatial distribution of the adjusted metal concentrations as continuous surfaces rather than simply as sets of points and so be able to see the general pattern of pollution. Further, we wished to do so optimally by kriging on a dense grid of points from which to make isarithmic maps. However, Figs. 2 and 3 suggest that these data, both before and after transformation, contain spatial trend, the presence of which complicates geostatistical analyses. Matheron (1969) introduced his ‘universal kriging’ to deal with such situations. Underlying the technique is the following model of variation:

Table 2 Summary statistics for the adjusted concentrations in the plume after subtraction of the corresponding medians for the same parent materials (units mg kg1) Metal Lead

Tin

Depth

The model has two components. The first is the trend term in which the fk are known functions of the spatial coordinates, x, and the bk are unknown coefficients. The second term, 3(x), is a spatially dependent random variable with zero mean and variogram g(h) defined by  1  gðhÞ ¼ E f3ðxÞ  3ðx þ hÞg2 ; 2

in which E denotes expectation and the symbol h is the separation, or lag, in both distance and direction. Note that g(h) is a function of h and only of h; it does not depend on x in the way that the trend term does. If the random component is second-order stationary then it has a covariance function, which is simply CðhÞ ¼ Cð0Þ  gðhÞ;

yðXÞ ¼ Fb þ 3ðXÞ:

ð7Þ

Here y(X) is vector of length N containing the N observations at positions X, 3ðXÞ is the vector of random components, and F is a N  (K þ 1) matrix, known as a design matrix, containing the predictors for the trend surface at all observation points, thus 2 T 3 f ðx1 Þ 6 f T ðx2 Þ 7 7 Fh6 4 « 5: T f ðxN Þ

Subsoil

Topsoil

Subsoil

Sample size Mean Median Std deviation Skewness

134 27.8 12.8 46.0 2.7

133 19.4 10.0 31.5 2.8

134 7.2 3.0 16.6 5.7

133 4.8 2.0 10.4 5.8

then we can compute

a

21.8

18.6

1.3

y ¼ LT y;

The value of a is the constant for the three-parameter log-normal transform subsequently applied to these data. The transform applied was y ¼ lnfz þ 3:1g; see text. a Directive DISTRIBUTION failed to converge.

ð6Þ

where C(0) is the variance of the process. In this paper we consider the variation in the random term to be isotropic, since we have rather fewer data than are usually thought necessary to estimate an anisotropic variance model (Webster and Oliver, 1992). So the lag becomes a scalar in distance, h ¼ jhj, only, and the variogram and covariance function are denoted by g(h) and C(h), respectively. Universal kriging uses a model of this variogram together with the data to predict values at unsampled points or the average values over blocks (though we do not use the block option here). We present the kriging system below. The problem is to obtain the variogram from the data, which contain both trend and random components. Olea (1975) showed how to do it for data on regular grids and transects by a structural analysis, and Webster and Burgess (1980) applied this solution in a case study. If the data are irregularly scattered, as around the Capper Pass smelter, this solution is not feasible. An alternative is to use residual maximum likelihood (REML) to model both the trend and the random residuals from the trend simultaneously, and it is the solution we pursue here. The REML technique was introduced by Patterson and Thompson (1971) for the estimation of variance components. In essence it obtains a new random variable, a function of the data, that is independent of the nuisance parameters and that has a covariance matrix C, the elements of which derive from C(h). We are therefore restricted to conditions where second-order stationarity can be assumed. The technique estimates the parameters of a mathematical model of C(h), or equivalently g(h), by applying maximum likelihood to this new variable; this is the residual likelihood. For compactness we switch to matrix notation. If we have N data then we can express Eq. (4) for those data by

Topsoil

a

ð5Þ

We assume that 3 is multivariate normal with zero mean and covariance matrix C, which is completely determined by C(h), as above. Now, if for some non-singular matrix L LT F ¼ 0;

and   y wN 0; LT CL :

B.G. Rawlins et al. / Environmental Pollution 143 (2006) 416e426

420

444000

444000

a)

442000

442000

440000

440000

438000

438000

436000

436000

434000

434000

432000

432000

b)

430000

430000

428000

428000

426000

426000 492000

496000

500000

504000

492000

508000

496000

27 to 41 41 to 49 49 to 58

504000

508000

24 to 36 36 to 43 43 to 52 52 to 68 68 to 236

58 to 79 79 to 293

444000

500000

444000

c)

442000

442000

440000

440000

438000

438000

436000

436000

434000

434000

432000

432000

d)

430000

430000

428000

428000

426000

426000 492000

496000

500000 2 5 6 9

to to to to

504000

508000

492000

496000

500000 2 4 5 6

5 6 9 14

to to to to

504000

508000

4 5 6 9

9 to 95

14 to 147

Fig. 2. Quintiles of the adjusted metal concentrations (in mg kg1) within the plume (after subtraction of the sample medians for the corresponding parent material type in the background data set: (a) Pb in surface soil, (b) Pb in deeper soil, (c) Sn in surface soil and (d) Sn in deeper soil.

For the general linear model, as used here, the log residual likelihood is (Stuart et al., 1999) 1 

1 1 l f ^ b; b ¼ constant  lnjCj  ln FT C1 F  yT C1 ðI  QÞy; 2 2 2

ð8Þ

where

1  QhF FT C1 F FT C1 :

ð9Þ

In practice we have to enter values into the covariance matrix, C; these are obtained from a mathematical model of the covariance function, C(h). The parameters of this covariance function are also parameters of the variogram under second-order stationarity, as expressed in Eq. (6), but it must be remembered that the covariance function does not exist in all circumstances when the variogram does. In this paper we use the variogram in our discussion of spatial variation and estimation, with the implicit assumption of second-order stationarity. In order that we may determine terms of C we must model C(h), or equivalently the variogram, g(h), with some continuous function of the lag such that the covariance matrix is necessarily positive definite. There

B.G. Rawlins et al. / Environmental Pollution 143 (2006) 416e426

444000

444000

a)

442000

442000

440000

440000

438000

438000

436000

436000

434000

434000

432000

432000

430000

430000

b)

428000

428000

426000

426000 492000

496000

500000

504000

508000

492000

496000

500000

504000

508000

504000

508000

1.5 to 2.3 2.3 to 3.1 3.1 to 3.8 3.8 to 4.6 4.6 to 5.4

1.2 to 2.1 2.1 to 3.0 3.0 to 3.9 3.9 to 4.7 4.7 to 5.6

444000

421

444000

c)

442000

442000

440000

440000

438000

438000

436000

436000

434000

434000

432000

432000

430000

430000

d)

428000

428000

426000

426000 492000

496000

500000

504000

508000

492000

496000

500000

-1.1 to 0.0 0.0 to 1.1 1.1 to 2.3 2.3 to 3.4 3.4 to 4.5

-2.3 to -0.8 -0.8 to 0.6 0.6 to 2.1 2.1 to 3.5 3.5 to 5.0

Fig. 3. Log-transformed values (mg kg1) of the adjusted metal concentrations to the north-east of the smelter ( ): (a) Pb in surface soil, (b) Pb in deeper soil, (c) Sn in surface soil and (d) Sn in deeper soil. The hachuring is on the side of the smaller values for each contour.

are rather few simple functions that guarantee this condition (see Webster and Oliver, 2001). The two we have used in this case study are the popular spherical and exponential; their definitions are as follows.

2.4.2. Exponential

2.4.1. Spherical  3h 1 h 3  for 0  h < a gðhÞ ¼ c 2a 2 a ¼c for h  a:

random variables 3(x) and 3(x þ h) are statistically uncorrelated with one another if h  a.

 h : gðhÞ ¼ c 1  exp  r

ð11Þ

ð10Þ

Here c is the a priori variance of the process and is the upper bound of the function, its ‘sill’; and a is a distance parameter, the range, which is finite. The

The exponential function also has a sill, c, which it approaches asymptotically; it does not have a finite range, but an effective range is often taken as a0 ¼ 3r where it reaches approximately 95% of its sill value.

B.G. Rawlins et al. / Environmental Pollution 143 (2006) 416e426

422

Almost always we must add to such a model a spatially uncorrelated ‘nugget’ variance, which we denote c0. So, the complete formula for the spherical variogram, for example, is 3  3h 1 h  for 0 < h < a 2a 2 a ¼ c0 þ c1 for h  a

gðhÞ ¼ c0 þ c1 ¼0

for h ¼ 0:

trend model (Stein, 1999), and denoted ‘empirical’ because it is also conditional on our model for the variogram derived from the data. For each target position x0 the prediction is a linear combination of the N values of y: e 0Þ ¼ Yðx

N X

li yðxi Þ:

ð14Þ

i¼1

ð12Þ

There are other functions that describe the variogram. Note that only functions for second-order stationary random variables (i.e. bounded variogram functions) are compatible with the existence of the covariance function. These simple functions describing the spatial dependence in 3(x) are thus completely defined by their form, spherical or exponential, and the three parameters,

Its expectation is K X N i X h e 0Þ ¼ bk li fk ðxi Þ; E Yðx

and the prediction is unbiased if N X

f ¼ ½c0 ; c1 ; a:

ð15Þ

k¼0 i¼1

li fk ðxÞ ¼ fk ðx0 Þ for all k ¼ 1; 2; .; K:

ð16Þ

i¼1

To proceed to the kriging we require estimates of these parameters, and we obtain them for a given type of model by REML. The REML estimates of the variance parameters are those that maximize the residual likelihood conditional on the data. These are found numerically. The average information (AI) algorithm of Gilmour et al. (1995) is efficient. It is not suitable for estimating the parameters of spherical model, however, because these do not have a smooth likelihood function, and so the gradient method used in the AI algorithm can stick at local optima. Lark and Cullis (2004) used simulated annealing to find the REML estimates of spherical model parameters, and they discovered that these could be better than those from the AI algorithm; so this is the method that we have used here. For each log-transformed variable we estimated the variance parameters for linear and quadratic trend surfaces after first rescaling the coordinates from metres to kilometres and adjusting them to a local origin for numerical stability. We then obtained the REML estimates of the variance parameters by simulated annealing. We did this for both spherical and exponential models of the variogram, and finally chose the model for which the maximized residual likelihood was largest. Universal kriging does not require the actual trend model to be computed separately because the trend is implicit in the kriging system. We were not obliged to estimate the trend parameters b. Nevertheless, we did so; we computed the estimates and their standard errors by generalized least-squares so that we could see whether particular components of the two trend models were likely to be useful, and so select a model to use in the kriging. The generalized least-squares estimate of b is 

^ ¼ FT C1 F 1 FT C1 y: b

ð13Þ

Two comments on the assumptions of the REML analysis are worth making. First, we noted above that 3 is assumed to be a realization of a multivariate normal process. Since we can only have one observation of 3 (one observation at each sample site) this assumption is unverifiable. It is supported (although not ensured) if the histogram of the data appears approximately normal, perhaps after transformation. However, Kitanidis (1985) showed that likelihood-based estimates of spatial variance parameters were robust to departures from normality in simulations; and Pardo-Igu´zquiza (1998) showed that, given our ignorance of the actual underlying multivariate distribution, the assumption of normality may be justified by an entropy criterion. Second, we assume the existence of a covariance matrix for 3. This requires that the random process be second-order stationarity, a stronger requirement than the intrinsic hypothesis, which is all that is necessary for the existence of the variogram. We are therefore limited to bounded variograms, such as the spherical (in which the sill value is the maximum variance) and the exponential (which is asymptotically bounded by the sill variance).

2.5. Lognormal universal kriging

Subject to this condition the weights li are chosen to minimize the expected mean squared error of the prediction, the UK variance s2UK, by solution of the following system of equations: N X

K X 





li g xi  xj þ j0 þ jk fk xj ¼ g x0  xj for all j ¼ 1; 2; .; N;

i¼1

k¼1

N X

li ¼ 1;

i¼1 N X

li fk ðxÞ ¼ fk ðx0 Þ for all k ¼ 1; 2; .; K:

This is the universal kriging system in which the g(xi  xj) are the semivariances of 3(x) between the data points xi and xj, and g(x0  xj) are the semivariances between the target point, x0 and the data points. The quantities jk, k ¼ 0,1,2,.,K, are Lagrange multipliers introduced for the minimization of the variance subject to the unbiasedness constraints. It is a set of linear equations, which can be succinctly written in matrix notation as Al ¼ u: Matrix A is 2 gðx1  x1 Þ gðx1  x2 Þ 6 gðx2  x1 Þ gðx2  x2 Þ 6 6 « « 6 6 gðxN  x1 Þ gðxN  x2 Þ 6 1 1 A¼6 6 6 f1 ðxÞ f1 ðx2 Þ 1 6 6 f2 ðxÞ f2 ðx2 Þ 1 6 4 « « fK ðxÞ1 fK ðx2 Þ

ð18Þ

. gðx1  xN Þ . gðx2  xN Þ . « . gðxN  xN Þ . 1 . f1 ðxN Þ . f2 ðxN Þ . « . fK ðxN Þ

3 1 f1 ðx1 Þ f2 ðx1 Þ . fK ðx1 Þ 1 f1 ðx2 Þ f2 ðx2 Þ . fK ðx2 Þ 7 7 « « « . « 7 7 1 f1 ðxN Þ f2 ðxN Þ . fK ðxN Þ 7 7 0 0 0 . 0 7 7 0 0 0 . 0 7 7 0 0 0 . 0 7 7 « « « . « 5 0 0 0 . 0

and l and u are 3 2 3 2 l1 gðx1  x0 Þ 6 l2 7 6 gðx2  x0 Þ 7 7 6 7 6 7 6 « 7 6 « 7 6 7 6 6 lN 7 6 gðxN  x0 Þ 7 7 6 7 6 7: 7 6 1 l¼6 7 6 j0 7 and u ¼ 6 6 j1 7 6 f1 ðx0 Þ 7 7 6 7 6 6 j2 7 6 f2 ðx0 Þ 7 7 6 7 6 5 4 « 5 4 « fK ðx0 Þ jK We solve the kriging equation by l ¼ A1 u;

We transformed the original concentrations to approximately normally distributed variables, y, as described above. We then predicted values of the y at the nodes of a fine grid by punctual universal kriging (UK) based on all the topsoil and deeper soil data separately. The UK estimate of a variable is the empirical best linear unbiased predictor (E-BLUP) conditional on the selected

ð17Þ

i¼1

ð19Þ

to obtain the kriging weights, li, l2,., which we then insert into Eq. (14) to give our predictions. The kriging variance is given by s2UK ¼ uT l:

ð20Þ

B.G. Rawlins et al. / Environmental Pollution 143 (2006) 416e426 Universal kriging returns the E-BLUP for the normal variable Y(x); but we require estimates on the scale of the original data z(x). As with any estimate derived from log-transformed data, we cannot simply back transform the estimates on the logarithmic scale; we must also correct for bias. Cressie (2004) e0 ðx0 Þ, based on the has shown that the UK estimate of a lognormal variable Z e 0 Þof the corresponding Y, is UK estimate Yðx ( ) K X 1 2 0 e e ji fi ðx0 Þ : Z ðx0 Þ ¼ exp Yðx0 Þ þ sUK  j0  2 i¼1

ð21Þ

We therefore back-transformed our kriged estimates in this way. We kriged the log-transformed variables at the nodes of a regular grid with interval 500 m over the region. We specified the predictor variables selected after the trend analysis of the data using REML, and used the variogram model estimated by REML. We used all observations for every kriging system because we wanted the trend model at all target sites to be the same as the overall trend model to which our variogram refers. We then used Eq. (21) to backtransform the estimates to the original scale, and corrected for the shift constant, a or zmin, in the log-transformation. The final predictions of adjusted metal were then ‘contoured’ to produce the isarithmic maps displayed in Fig. 4.

3. Results and their interpretation 3.1. Trend and variogram models based on REML ^ for the quadratic We examined the parameter estimates b and linear trend, for both metals and both depths, with whichever of the spherical and exponential variogram models maximized the residual likelihood. We noted that in all cases the parameters for the quadratic terms, and the linear term in the eastings were small relative to their standard errors (a t ratio smaller than 1.96). The linear coefficient for the northing was always large (except for topsoil tin for which it was larger than any other coefficient, but with a t ratio still smaller than 1.96). For this reason we fitted a simple trend surface linear in the northing for all variables. Since the trend appears to be limited to one direction, an experimental variogram for the error variable 3(x) could have been obtained by the usual method of moments estimator

a)

c)

444000

444000

442000

442000

440000

440000

438000

438000

436000

436000

434000

434000

432000

432000

430000

430000

428000

428000

426000

426000 492000

496000

500000

504000

508000

b)

d)

444000

444000

442000

442000

440000

440000

438000

438000

436000

436000

434000

434000

432000

432000

430000

430000

428000

428000

426000

423

492000

496000

500000

504000

508000

492000

496000

500000

504000

508000

426000 492000

496000

500000

504000

508000

Fig. 4. Contour maps of adjusted metal concentrations (in mg kg1) around the smelter ( ) for (a) Pb in surface soil, (b) Pb in deeper soil, (c) Sn in surface soil and (d) Sn in deeper soil. The hachuring is on the side of the smaller values for each contour.

B.G. Rawlins et al. / Environmental Pollution 143 (2006) 416e426

424

applied only to comparisons between pairs of points separated by a lag vector perpendicular to this trend. A model, such as the exponential or spherical could then have been fitted to this and used for kriging. This approach, which is often advocated for geostatistical analysis of data with trend, might be preferred to REML because it is simpler. However, we concluded that this particular trend model is appropriate from inferences about the parameters of different models that are minimum variance estimates only because we have estimated them with REML. Further, the fact that we have used REML means that all paired comparisons between observations contribute to our estimate of the variance parameters, not just a subset which might be small when the sampling sites are irregularly scattered. Table 3 lists the parameters of the fitted models. Note that, apart from subsoil tin, the range of spatial dependence is large, and that for topsoil tin the variogram is still far from its bound at lag distances within the confines of the region. However, the effective range is not smaller for more complex trend models, and the fact that the coefficients for higher-order trend components are small suggests that the long-range variation, after removal of the linear trend on northings, can be treated as random.

3.2. Geochemical maps of adjusted metal concentration There is a striking difference between the map of adjusted Pb in the topsoil (Fig. 4a) and that in the subsoil (Fig. 4b): the former has steep gradients around the site of the smelter, whereas the latter does not. In addition, there are two exceptionally large Pb concentrations (293 and 194 mg kg1) surrounded by a series of closely spaced contours to the east on the topsoil map with nothing comparable on the map of the subsoil. The two locations are on the urban fringe of Hull, and the large concentrations there might be from a source other than the smelter. Otherwise, the two maps are similar in showing towards the north-east end of the plume (504 km easting and 438 km northing) concentrations of 60 to 80 mg kg1 greater than those of the backgrounds of the parent materials. There are also unusually large concentrations of Sn in the topsoil in the same part of the region (Fig. 4c). If these larger concentrations of Pb and Sn are the result of deposition of particles emitted from the smelter then this pattern is contrary to observations that metal deposition diminishes Table 3 Models fitted to log-transformed, adjusted metal concentrations Metal Depth

b0

b1

t Ratioa Model

Lead Topsoil 4.53 0.098 2.24 Subsoil 4.01 0.066 1.95 Tin

Topsoil 3.22 0.151 2.55 Subsoil 2.07 0.084 2.23

Spherical Spherical

c0

c1

a/km r/km

0.22 0.46 18.0 0.2 0.36 18.9

Exponential 0.33 1.30 Exponential 0.32 0.41

21.3 5.6

The parameters b0 and b1 are for the trend from south to north (and coordinates in km), and c0, c1 and a and r (in km) are those of the variogram models of the random components. a For null hypothesis that b1 ¼ 0.

rapidly with increasing distance (De Caritat et al., 1997). On days with strong south-westerly winds, particulate deposition might have been enhanced in this part of the region which forms the leeward slope of the northesouth trending Yorkshire Wolds (diminishing rapidly from a maximum elevation of 60 m). Alternatively, the larger concentrations might simply be due to natural variation in the geochemistry of the parent materials. One might resolve this uncertainty by further investigations based on differences in the Pb isotope composition of the smelter emissions and native Pb in the soil. 3.3. Estimates of excess metal in the soil We wanted to estimate the excess quantities of Pb and Sn in the soil across the 286 km2 of the plume, based on our kriged estimates of adjusted metal concentration. We assumed that the topsoil and deeper soil samples would provide reasonable estimates of the concentration across their depth ranges (0e 15 cm and 25e40 cm, respectively). We also assumed that the average of the concentrations at these two depths would be a reasonable estimate for the soil in the depth interval (15e25 cm), subsequently referred to as the intermediate depth. We selected the final estimates of adjusted metal concentrations from the nodes of our 500-m grid at each of the three depths. We then averaged these to provide an estimate of the average adjusted metal content of the soil across the plume. We converted the adjusted metal concentrations into total quantities of metal in the soil (at each of the three depths). Here we made assumptions concerning the proportions of stones in the soil and its bulk density. The dominant soil types in the region, as judged from the scheme of soil classification adopted in England and Wales (Avery, 1990), are ‘fine loamy soils’ and ‘well drained calcareous fine silty soils’. We have assumed a stone content of 10% e the centre of the class termed slightly stony (Avery, 1990) e and a typical soil bulk density of 1.35 g cm3 (Soil Survey of England and Wales, 1977) uniform down to 45 cm. We then aggregated the total amount of adjusted metal for each node within the plume at each of the depths for both metals. The aggregated adjusted metal will overestimate the excess metal, as defined above, since it is based on the difference between the observed metal concentration and the median background concentration for the parent material. For this reason we took the difference between the mean and median background concentrations (Table 1) and from these computed a correction to the aggregated metal concentration over the plume. We calculated the mass of soil over the area of each parent material in the plume, till (108 km2), chalk (106 km2) and alluvium (74 km2), making the same assumptions about bulk density and stoniness that we describe above. We multiplied this mass by the difference between the mean and median concentration in the corresponding background, and summed these results to obtain an overall correction. The corrections were then subtracted from the aggregated adjusted metal to provide an estimate of excess metal. For the intermediate depth we used the average of the corrections for the

B.G. Rawlins et al. / Environmental Pollution 143 (2006) 416e426

topsoil and subsoil. Over the area of the plume, we estimated excess amounts of Pb to be 1174, 633 and 723 t for the top, intermediate and deeper soil, respectively. The corresponding amounts of excess Sn are 424, 208 and 199 t. Given that our estimates are based on several assumptions, we express the total excess metal estimates to two significant figures. Hence, we estimate that the total excess Pb is approximately 2500 t, and total excess Sn is 830 t. The downward transfer of aerially deposited particles is likely to have been enhanced by ploughing (plough depths typically 20e30 cm) as arable agriculture is the dominant land use in the region. Our estimates cannot account for any excess metal transported below 40 cm. In addition to Pb derived from the smelter, there are diffuse sources in the locality (particularly Pb from vehicle exhausts) which could in part contribute to the estimate of excess metal. However, the background sample subset would also have been subject to diffuse, aerial deposition of Pb, and therefore part of its impact is likely to have been cancelled out. Notwithstanding this, we did compare the range of diffuse inputs based on data for aerial deposition of Pb to agricultural soils across England from major sources between 1995 and 1998 (Nicholson et al., 2003) with our estimates of excess Pb. The minimum and maximum rates are equivalent to 0.54 and 4.0 t Pb per year across the plume. Even at the maximum rate of Pb deposition reported by Nicholson et al., the total over 20 years is unlikely to exceed 80 t over the area of the plume.

4. Discussion We believe that, in the absence of other significant sources of Sn in the local environment, the vast majority of excess Pb and Sn in the soil across the region came from the Capper Pass smelter as fallout of airborne particulates. Unpublished data based on scanning electron microscope analyses of bark samples from present-day trees and attic dusts provide further evidence to support this conclusion. A previous analysis of malignancies among both children and adults in north Humberside showed increased risks close to the smelter (Alexander et al., 1984). Data on the magnitude and distribution of historical metal deposition related to the former operation of the smelter could aid subsequent epidemiological studies.

Acknowledgements We thank all the staff of the British Geological Survey and volunteers involved in the collection and analysis of soil samples in the G-BASE project, and an anonymous reviewer for helpful comments on an earlier draft of the script. This paper is published with the permission of the Director of the British Geological Survey (Natural Environment Research Council). R.M. Lark’s contribution was supported by Rothamsted Research’s core grant from the Biotechnology and Biological Sciences Research Council.

425

References Alexander, F., McKinney, P.A., Cartwright, R.A., 1984. The pattern of childhood and related adult malignancies near Kingston-upon-Hull. Journal of Public Health Medicine 13, 96e100. Avery, B.W., 1990. Soils of the British Isles. CAB International, Wallingford. British Geological Survey, 1983a. York Sheet 63: Solid and Drift. Ordnance Survey for the Institute of Geological Sciences, Southampton. British Geological Survey, 1983b. Kingston Upon Hull Sheet 80: Solid and Drift. Ordnance Survey for the Institute of Geological Sciences, Southampton. British Geological Survey, 1993. Great Driffield Sheet 64: Solid and Drift. British Geological Survey, Keyworth. British Geological Survey, 1995. Beverley Sheet 72: Solid and Drift. British Geological Survey, Keyworth. British Geological Survey, 2000. Regional Geochemistry of Wales and Part of West-central England e Stream Sediment and Soil. British Geological Survey, Keyworth. Colgan, A., Hankard, P.K., Spurgeon, D.J., Svendsen, C., Wadsworth, R.A., Weeks, J.M., 2003. Closing the loop: a spatial analysis to link observed environmental damage to predicted heavy metal emissions. Environmental Toxicology and Chemistry 22, 970e976. Cressie, N., 2004. Block Kriging for Lognormal Spatial Processes. Technical Report No 739. Department of Statistics, Ohio State University, Columbus, OH. De Caritat, P., Reimann, C., Chekushin, V., Bogatyrev, I., Niskavarra, H., Braun, J., 1997. Mass balance between emission and deposition of airborne contaminants. Environmental Science and Technology 31, 2966e2972. Department of the Environment, 1992. The UK Environment. Her Majesty’s Stationery Office, London. Gilmour, A.R., Thompson, R., Cullis, B.R., 1995. Average information REML: an efficient algorithm for variance parameter estimation in linear mixed models. Biometrics 51, 1440e1450. Govindaraju, K., 1994. Compilation of working values and sample description for 383 geostandards. Geostandards Newsletter 18, 1e158. Kitanidis, P.K., 1985. Minimum-variance unbiased quadratic estimation of covariances of regionalized variables. Journal of the International Association of Mathematical Geology 17, 195e208. Lark, R.M., Cullis, B.R., 2004. Model-based analysis using REML for inference from systematically sampled data on soil. European Journal of Soil Science 55, 799e813. Litten, J.A., Strachan, A.M., 1995. Aspects of the closure of Capper Pass and Son. Minerals Industry International (May), 28e34. Matheron, G., 1969. Le krigeage universel. Cahiers du Centre de Morphologie Mathe´matique. Ecole des Mines de Paris, Fontainebleau. McMartin, I., Henderson, P.J., Nielsen, E., 1999. Impact of a base metal smelter on the geochemistry of soils of the Flin Flon region, Manitoba and Saskatchewan. Canadian Journal of Earth Sciences 36, 141e160. Nahmani, J., Lavelle, P., Lapied, E., van Oort, F., 2003. Effects of heavy metal soil pollution on earthworm communities in the north of France. Pedobiologia 47, 663e669. Nicholson, F.A., Smith, S.R., Alloway, B.J., Carlton-Smith, C., Chambers, B., 2003. An inventory of heavy metals inputs to agricultural soils in England and Wales. Science of the Total Environment 311, 205e219. Olea, R.A., 1975. Optimum mapping techniques using regionalized variable theory. In: Series on Spatial Analysis No 2. Kansas Geological Survey, Lawrence, Kansas. Pardo-Igu´zquiza, E., 1998. Maximum likelihood estimation of spatial covariance parameters. Mathematical Geology 30, 95e108. Patterson, D.D., Thompson, R., 1971. Recovery of inter-block information when block sizes are unequal. Biometrika 58, 545e554. Payne, R., Murray, D., Harding, S., Baird, D., Soutar, D., Lane, P., 2003. GenStat for Windows. VSN International, Hemel Hempstead. Roels, H.A., Buchet, J.P., Lauwerys, R.R., Bruaux, P., Claeys-Thoreau, F., Lafontaine, A., Verduyn, G., 1980. Exposure to lead by the oral and the pulmonary routes of children living in the vicinity of a primary lead smelter. Environmental Research 22, 81e94.

426

B.G. Rawlins et al. / Environmental Pollution 143 (2006) 416e426

Soil Survey of England and Wales, 1977. Water Retention, Porosity and Density of Field Soils. Technical Monograph No 9. Lawes Agricultural Trust, Harpenden. Stein, M.L., 1999. Interpolation of Spatial Data: Some Theory for Kriging. Springer, New York. Sterckeman, T., Douay, F., Proix, N., Fourrier, H., Perdrix, E., 2002. Assessment of the contamination of cultivated soils by eighteen trace elements around smelters in the North of France. Water, Air and Soil Pollution 2002, 173e194.

Stuart, A., Ord, J.K., Arnold, S., 1999. Kendall’s advanced theory of statistics. In: Classical Inference and the Linear Model, sixth ed., vol. 2A. Arnold, London. Webster, R., Burgess, T.M., 1980. Optimal interpolation and isarithmic mapping of soil properties. III. Changing drift and universal kriging. Journal of Soil Science 31, 505e524. Webster, R., Oliver, M.A., 1992. Sample adequately to estimate variograms of soil properties. Journal of Soil Science 43, 177e192. Webster, R., Oliver, M.A., 2001. Geostatistics for Environmental Scientists. John Wiley and Sons, Chichester.

The use of soil survey data to determine the magnitude ...

form of a wind rose (Department of the Environment, 1992, page 6) show that the strongest winds are from the ... each pellet by energy- and wavelength-dispersive XRFS (X-Ray Fluorescence. Spectrometry). .... An alternative is to use residual ...

1MB Sizes 0 Downloads 129 Views

Recommend Documents

The use of soil survey data to determine the magnitude ...
Soil survey data are used to estimate the deposition of metals to land surrounding ..... 2 and 3 suggest that these data, both before and after transformation, con-.

ICAR-National Bureau Of Soil Survey And Land Use Planning.pdf ...
Page 1 of 2. Page 1 of 2. भा.कृ. अनु. प. - मृदा एवं भूिम उपयोग िनयोजन. , , - ५६० ०२४. ICAR- NATIONAL BUREAU OF SOIL SURVEY AND LAND USE ...

quality control of soil survey - Blog UGM
soil map and the purpose of the production of the soil map must be taken into consideration when defining the standard of quality. The case of seruyan, Central ...

quality control of soil survey - Faperta UGM
PREDICTIVE ACCURACY OF SOIL MAPS. The predictive accuracy of a soil map is a measure of its quality. Quality as is not related so much to the amount of information contained n the map, the realibility of the information ..... usually belongs to a com

quality control of soil survey - Faperta UGM
nobody actually maps soil by units which are spesified by surface and subsurface properties. It is not feasible to follow on the ground the actual boundary of the properties that are only present to the subsoil. Soil mappers have to rely upon outside

neural computing methods to determine the relevance of memory ...
current networks ofthe Elman type, have been applied to the Joint European Torus (JET) database to extract these potential memory effectsfrom the time series ofthe avail able signals. Both architectures can detect the depen dence on the previous evol

The Magnitude of the Task Ahead: Macro Implications ...
May 18, 2016 - and inference in macro panel data which are at the heart of this paper. ..... For our empirical analysis we employ aggregate sectoral data for ...

Using the Quality control to determine the Factors of Failure Operation ...
Using the Quality control to determine the Factors of Failure Operation in cement Sector.pdf. Using the Quality control to determine the Factors of Failure ...

Using aircraft measurements to estimate the magnitude ...
cloud-free conditions of the biomass burning aerosol characterized by measurements made ...... and therefore offered an explanation of the discrepancy in.

PDF DOWNLOAD The Use of Data in School ...
Data can make the difference for today's embattled school counselling programs, and this insightful book shows how to collect and manage it. School counseling ...

Evaluating the use of whalewatch data in determining ...
To study the Pager Network data, a land-based survey was designed in order ... number of adult and adolescent males, number of calves and ..... Summer social.

Best Use of Data - CaaNZ
Best Use of Data category in the Beacon Awards. ... Campaign Objective ... MBM's strategy was to close the purchase loop by using data to create display.

Using the Quality control to determine the Factors of Failure Operation ...
Using the Quality control to determine the Factors of Failure Operation in cement Sector.pdf. Using the Quality control to determine the Factors of Failure Operation in cement Sector.pdf. Open. Extract. Open with. Sign In. Main menu. Displaying Using

The measure of an ecosystem's capacity to increase soil carbon
May 7, 2004 - Department of Geology and Geophysics, Boston College, Devlin Hall, 213, Chestnut Hill, ...... 173–176, Academic, San Diego, Calif.

The measure of an ecosystem's capacity to increase soil carbon
May 7, 2004 - adding an additional term to the steady state flux: SCI. П .... Processes that add carbon to soil include input ..... This theme is echoed by Harte.

The use of remotely sensed data and innovative ...
aNOAA Atlantic Oceanographic and Meteorological Laboratory, Miami, FL b Goddard Earth Science and Technology Center, University of Maryland, Baltimore ...

The use of remotely sensed data and innovative ...
The development of advanced next-generation models in combination new .... As an illustration of the potential of AIRS to improve tropical cyclone prediction, figure 4 .... and better models and data assimilation techniques is being performed.

Trajectories of symbolic and nonsymbolic magnitude processing in the ...
Trajectories of symbolic and nonsymbolic magnitude processing in the first year of formal schooling.pdf. Trajectories of symbolic and nonsymbolic magnitude ...

The Size of the LGBT Population and the Magnitude of ...
The veiled method increased self-reports of antigay sentiment, particularly in the workplace: respondents were 67% more ... cational investment, the demand for children, and the gender-based divisions of labor.5 Data on the LGBT ...... J. Forecasting