Final Part 1—Solution

lakes <- read.table('ecol 561/lakes.txt', header=T, sep='\t')

Question 1

The data are currently in "lake-level" format in which each lake has a single record. We need to put this into "lake-period" format with multiple records for each lake, in this case one record for each measurement occasion. This requires unlisting the records X1976, X1977, X1978, and X1981 stacking them into a single column, replicating the Lake, Latitude, and Longitude variables four times, and adding a new column that lists the year the measurement was taken.

newlakes <- data.frame(rep(lakes$Lake,4), rep(lakes$Latitude,4), rep(lakes$Longitude,4), unlist(lakes[,4:7]), rep(c(1976:1978,1981), rep(dim(lakes)[1],4)))

colnames(newlakes) <- c('lake', 'latitude', 'longitude', 'SO4', 'year')

Choosing a probability model for the response

There are a number of guidelines for helping us choose a probability model for the response variable SO₄. I repeat some of these here.

Clearly SO₄ is a continuous variable and, being a concentration, is bounded below by zero (but in principle unbounded above). Three continuous distributions mentioned at various times in the course are the normal, lognormal, and gamma distributions. Both the lognormal and gamma are positive data distributions and are bounded below by zero, while the normal distribution is unbounded. Of course if the data are displaced far enough from zero, then the fact that the normal distribution is unbounded below may not be a problem. The easiest way to obtain a lognormal distribution is to log-transform a response variable and assume the log-transformed variable has a normal distribution.

Because the goal is to model SO₄ concentration against year the probability model we're seeking must hold at each year separately rather than in the aggregate. Because both the lognormal and gamma distributions tend to be skewed while the normal is symmetric, a histogram of SO₄ concentration at each year is a useful display.

par(mar=c(5.1,3.1,1.1,1.1))

sapply(c(1976:1978,1981), function(x) hist(newlakes$SO4[newlakes$year==x], xlab=expression("SO"[4]), main=x, col='lightblue'))

From Fig. 1 it's pretty clear that a normal distribution is likely to be inappropriate. The actual distributions are skewed and have many values close to zero. Thus a model based on a normal distribution is likely to predict negative concentrations. Both the lognormal and gamma distributions have the same mean-variance relation (quadratic), so the best way to evaluate them is to actually use them in a model and compare the results with AIC. Even though we know a normal model is probably inappropriate I include it among the candidate models.

I start by fitting each model with year as a linear predictor. The normal and lognormal models can be fit with lm, the lognormal by first log-transforming the response SO₄. The gamma distribution is fit with glm and the family=Gamma argument.

out.norm <- lm(SO4~year, data=newlakes)

out.gamma <- glm(SO4~year, data=newlakes, family=Gamma)

out.lnorm <- lm(log(SO4)~year, data=newlakes)

The log-likelihood of the log-transformed model is not directly comparable to models with an untransformed response without additional work. We wrote a function in lecture 11 called norm.loglike5 that carries out the necessary steps and I reproduce it below. We did two versions of the function. Here we need the version that is based on a probability density function (rather than a probability) because we've comparing density functions.

norm.loglike5 <- function(model, k, y) {

sigma2 <- (sum(residuals(model)^2))/length(y)

prob <- dnorm(log(y+k), mean=predict(model), sd=sqrt(sigma2)) * 1/(y+k)

loglike <- sum(log(prob))

To get this function to work we need to give it a version of the data set in which the missing values of the response are removed. (These observations are automatically removed by the lm function when a model involving them is fit.) We can do this by subsetting the data set using the !is.na( ) construction. The first argument to the norm.loglike5 function is the model, the second is 0 because we didn't add a constant to the response, and the third is the response variable with missing observations removed.

out.loglike <- norm.loglike5(out.lnorm, 0, newlakes$SO4[!is.na(newlakes$SO4)])

Each of the models has three estimated parameters: (β₀, β₁, and σ²) for the normal and lognormal and (β₀, β₁, and scale) for the gamma, so we could just as well compare log-likelihoods rather than AIC.

-2*out.loglike[[1]] + 2*3

The lognormal model has the lowest AIC by a sizeable amount, so we'll proceed with a lognormal probability model for the response. Fig. 2 shows the three fitted distributions separately by year superimposed on histograms of the data.

histogram(~SO4|factor(year), data=newlakes, col='grey85', type='density', ylim=c(0,.35), xlab=expression("SO"[4]), panel=function(x, subscripts, ...) {
panel.histogram(x,...)

myscale <- summary(out.gamma)$dispersion

cur.mu <- fitted(out.gamma)[subscripts][1]

myshape <- cur.mu/myscale

panel.mathdensity(dmath = dgamma, args = list(shape=myshape, scale=myscale), col=4, n=100)

cur.mulog <- predict(out.lnorm)[subscripts][1]

lsigma <- summary(out.lnorm)$sigma

panel.mathdensity(dmath = dlnorm, args = list(meanlog=cur.mulog, sdlog=lsigma), col=2, n=100)

mu <- predict(out.norm)[subscripts][1]

sigma <- summary(out.norm)$sigma

panel.mathdensity(dmath = dnorm, args = list(mean=mu, sd=sigma), col='seagreen', n=100)

} , key=list(x=.7, y=.8, corner=c(0,0), text=list(c('normal', 'lognormal', 'gamma'), cex=.85), lines=list(col=c('seagreen', 2, 4), lty=1)) )

Choosing the form of the linear predictor

The variable year has four unique values. Hence there are a limited number of choices for the linear predictor. Without any theory to guide us we can

Each model was fit assuming a lognormally distributed response. AIC can be used to compare the models without any adjustment because all three models use the same response, log SO₄.

out.lnorm2 <- lm(log(SO4)~year+I(year^2), data=newlakes)

out.lnorm3 <- lm(log(SO4)~factor(year), data=newlakes)

sapply(list(out.lnorm, out.lnorm2, out.lnorm3), AIC)

Clearly the linear model is to be preferred. It has both the lowest AIC and is simpler. To see what's going on I graph the data (jittered) and superimpose the three models. For the separate means model I just plot the estimated mean at each year as predicted by the model. The graphs of the three models are indistinguishable.

plot(log(SO4)~jitter(year), data=newlakes, xlim=c(1975.5,1981.5), xlab='year', ylab=expression(log('SO'[4])))

abline(out.lnorm, col='grey80', lwd=5)

curve(coef(out.lnorm2)[1]+ coef(out.lnorm2)[2]*x+ coef(out.lnorm2)[3]*x^2, col=2, lty=2, add=TRUE)

points(c(1976:1978,1981), c(coef(out.lnorm3)[1], coef(out.lnorm3)[1]+ coef(out.lnorm3)[2], coef(out.lnorm3)[1]+ coef(out.lnorm3)[3], coef(out.lnorm3)[1]+coef(out.lnorm3)[4]), col=3, pch=16, cex=2)

legend(1979,2.5, c('linear','quadratic','means'), lty=c(1,2,NA), lwd=c(3,1,1), pch=c(NA,NA,16), col=c('grey80',2,3), bty='n', cex=.9)

Fig. 3 Three lognormal models using different versions of the linear predictor for year

Our conclusions remaining the same even if we use a normal or a gamma probability model for the response. A model linear in year is best.

out.gamma1 <- glm(SO4~year+I(year^2), data=newlakes, family=Gamma)

out.gamma2 <- glm(SO4~factor(year) ,data=newlakes, family=Gamma)

sapply(list(out.gamma, out.gamma1, out.gamma2), AIC)

out.norm1 <- lm(SO4~year+I(year^2), data=newlakes)

out.norm2 <- lm(SO4~factor(year), data=newlakes)

sapply(list(out.norm, out.norm1, out.norm2), AIC)

For comparison I obtain the AIC of the lognormal models on the scale of the response variable.

LL.llnorm <- sapply(list(out.lnorm, out.lnorm2, out.lnorm3), function(x) norm.loglike5(x, 0, newlakes$SO4[!is.na(newlakes$SO4)]))

lnorm.df <- sapply(list(out.lnorm, out.lnorm2, out.lnorm3), function(x) length(coef(x)))+1

LL.AIC <- -2*LL.llnorm + 2*lnorm.df

Of course it is possible that some weird transformation of year is preferable to what we've done. To assess that possibility I plot the residuals from the linear model against year and superimpose a lowess curve.

plot(residuals(out.norm)~ jitter(newlakes$year[!is.na(newlakes$SO4)]), ylab='residuals', xlab='year')

lines(lowess(residuals(out.norm)~ jitter(newlakes$year[!is.na(newlakes$SO4)])), col=2)

Fig. 4 Plot of residuals from model linear in year against the predictor "year"

The plot provides no evidence that form of the linear predictor is inadequate. It's worth noting at this point that the modeled relationship between log(SO₄) and year is not statistically significant.

Coefficients:
            Estimate Std. Error t value Pr(>|t|)
(Intercept) 41.66739   49.32240   0.845    0.399
year        -0.02051    0.02493 -0.822    0.412

Residual standard error: 0.6167 on 166 degrees of freedom
Multiple R-Squared: 0.004057, Adjusted R-squared: -0.001942
F-statistic: 0.6763 on 1 and 166 DF, p-value: 0.4120

Question 2

The basic structure possessed by these these data is that they consist of repeated measures on individual lakes. Due to local geology and edaphic factors, we should expect measurements coming from the same lake, even separated by a time interval of a year or more, to be more similar to each other than to observations coming from other lakes. This provides the potential for what we've called observational heterogeneity—differing degrees of variability in subsets of observations. Observe that the repeated measures aspect of these data was imposed at the design level. Structure that arises from the sampling design must be accounted for in the analysis.

In addition to temporal correlation there is the possibility of spatial structure in these data as a function of their geographic proximities. We might expect lakes that are close to each other to share a similar chemistry. I consider structure of this sort to be inadvertent structure in these data. It arises because lakes are fixed objects that occupy space and the problem of their varying proximities will be an issue in even the most well-designed random selection scheme. The spatial structure may turn out to be important, but it is not intrinsic to the experimental design. As a result we'll account for the designed structure first and then deal with the spatial structuring if necessary.

The ideal way to display the temporal structure of these data is in a lattice graph in which we plot the SO₄ concentration versus year separately for each lake. Because we've already seen that a log transformation of the SO₄ concentration is justified, the most useful lattice graph plots log(SO₄) versus year.

xyplot(log(SO4)~year|lake, data=newlakes, ylab=expression(paste("SO"[4], ' Concentration', sep=' ')), layout=c(8,6), panel=function(x,y) {

panel.abline(lm(y~x), col=2)},

par.settings=list(axis.text=list(cex=.8)))

Fig. 5 Lattice graph displaying the repeated measures structure of the data set

Question 3

As Fig. 5 indicates there is a dramatic difference in baseline SO₄ concentrations across lakes. This is reflected in the widely different values of the intercepts in the different regression lines. We also see some evidence for heterogeneity in slopes across the lakes. The natural way to handle structured data such as these is with a multilevel model. Fig. 5 coupled with our early work using the complete pooling model suggests that a linear model relating log(SO₄) concentration to continuous time should be an adequate starting point.

Unconditional means model

The correct place to begin when fitting a multilevel model is with the common pooling (unconditional means) model. The common pooling model includes only an intercept but does allow that intercept to vary among the different lakes. It serves to partition the variance between and among lakes and acts as a benchmark that allows us to assess where most of the variability lies—between lakes or between years for the same lake. The common pooling model is formulated as follows.

unc.mean <- lme(log(SO4)~1, random=~1|lake, data=newlakes, method='ML', na.action=na.omit)

The unconditional means model is of interest only for its variance components. From these we can calculate the intraclass correlation coefficient.

as.numeric(VarCorr(unc.mean)[1,1])/ (as.numeric(VarCorr(unc.mean)[1,1])+ as.numeric(VarCorr(unc.mean)[2,1]))

So we see that 91% of the variability in log(SO₄) concentrations occurs between lakes rather than within lakes across years. Put another way measurements from the same lake exhibit a correlation of 0.91. This is dramatic evidence that a multilevel model is needed here and argues strongly against using a common pooling model.

Random intercepts model

The next step is to add the predictor year. As we've seen the individual trend lines in Fig. 4 exhibit quite a bit of variability in their intercepts. Thus a natural second model to consider is a random intercepts model, a linear population model in which the intercepts of the regression lines for individual lakes are allowed to vary but they still share a common slope.

random.ints <- lme(log(SO4)~year, random=~1|lake, data=newlakes, method='ML', na.action=na.omit)

AIC(out.lnorm, unc.mean, random.ints)

Observe the dramatic decrease in AIC relative to the common pooling model, strong evidence that we need to account for the repeated measures structure of the data set. (The AIC values are directly comparable here because all models use log(SO₄) as the response.) The random intercepts model exhibits a more modest decrease in AIC relative to the unconditional means model indicating that the inclusion of year has improved the fit of the model somewhat. Next we examine the variance components.

VarCorr(random.ints)

As measured by the small relative reduction in level-1 variance, year accounts for only a modest amount of within-lake variability.

(as.numeric(VarCorr(unc.mean)[2,1]) - as.numeric(VarCorr(random.ints)[2,1])) / as.numeric(VarCorr(unc.mean)[2,1])

So we've explained only 8% of the within-lake variability by including year as a predictor. Still the predictor year is statistically significant.

summary(random.ints)

Random effects:
Formula: ~1 | lake
(Intercept) Residual
StdDev: 0.5774515 0.1689954

Fixed effects: log(SO4) ~ year
Value Std.Error DF t-value p-value
(Intercept) 45.76822 13.897196 119 3.293342 0.0013
year -0.02259 0.007026 119 -3.215169 0.0017
Correlation:
(Intr)
year -1

Standardized Within-Group Residuals:
Min Q1 Med Q3 Max
-4.02214991 -0.47329909 -0.05670254 0.55159957 3.33206296

Random slopes and intercepts model

Another model we might consider is a random slopes and intercepts model. The lattice graph of Fig. 5 suggests there is some variability in individual slopes. Unfortunately this model is very difficult to fit. One of the pernicious features of fitting mixed effects models is that it's not always clear that there things have gone wrong. Recall that the random slopes and intercepts model takes the following form.

random.slopes <- lme(log(SO4)~year,random=~year|lake, data=newlakes, method='ML', na.action=na.omit)

VarCorr(random.slopes)

Based on the displayed correlation of 0 between the random slopes and intercepts and the estimated variance component of the slopes that is approximately 0 we see that R has converged to a solution in which the slopes don't vary. If we compare the log-likelihoods of the random slopes and intercepts model with that of the random intercepts model we see that they are identical!

sapply(list(random.ints, random.slopes), logLik)

It is extremely unlikely that introducing two additional parameters to the model has had no effect on the log-likelihood. Given that we've already seen from our lattice plot that the slopes do vary between lakes we should surmise that lme has converged to a local solution (the solution of the random intercepts model) rather than a global solution.

To remedy the situation we can try reparameterizing the predictor, perhaps by centering, and if that doesn't work to modify some of the control settings of the maximization algorithm. In the next model run I shift the origin to 1976, increase the default iteration settings, and switch the optimization method to optim.

random.intslopes <- lme(log(SO4)~I(year-1976),random=~I(year-1976)|lake, data=newlakes, method='ML', na.action=na.omit, control=lmeControl(maxIter=5000, msMaxIter=5000, niterEM=1000, msMaxEval=1000, opt='optim'))

logLik(random.slopes)

This time the log-likelihood has increases. However this model does not beat the random intercepts model.

AIC(out.lnorm, unc.mean, random.ints, random.intslopes)

Random slopes model

Another model we might consider is a random slopes model although the lattice graph of Fig. 5 suggests there isn't much variability in the slopes. A random slopes model is one in which the slopes of the regression lines for individual lakes are allowed to vary but they still share a common intercept.

To fit this model we need to explicitly remove the intercept from the random argument of lme.

random.slope <- lme(log(SO4)~year, random=~year-1|lake, data=newlakes, method='ML', na.action=na.omit)

AIC(out.lnorm, unc.mean, random.ints, random.intslopes, random.slope)

The table below summarizes the results. Because all the models being compared have log(SO4) as the response we can use AIC applied directly to the model object to compare them. The AIC values reported here cannot be used to compare these models against models with other probability models for the response. Note: the norm.loglike5 function we used above does not return the correct AIC when random effects are involved. A different function is necessary.

Other models

A model I don't include in the above list is the separate intercepts model, a model that estimates a separate intercept for each lake. This model would be fit as follows.

lm(log(SO4)~ year + factor(lake), data=newlakes)

In this model we would end up estimating 45 additional parameters, one intercept for each lake. Not surprisingly this model does turn to be far and away the best model in terms of log-likelihood and AIC, but given that we have at most four observations per lake using such a model is clearly an example of overfitting.

Another model considered by some people was a factor year model with either random intercepts or a random factor.

random.fact1 <- lme(log(SO4)~factor(year), random=~1|lake, data=newlakes, method='ML', na.action=na.omit)

random.fact2 <- lme(log(SO4)~factor(year), random=~factor(year)|lake, data=newlakes, method='ML', na.action=na.omit)

AIC(random.fact1, random.fact2)

This random factor model is the best-ranked model of all the ones we've considered but it is a little difficult to interpret. We now have four random effects per lake, one for each year.

Question 4

A multilevel model returns a population level model from which individual lakes are allowed to deviate. The manner in which the lakes are allowed to deviate from the population model depends on how the random effects portion was specified. The population model we obtained is the following.

The population model states that for every one year increment the average log(SO₄) concentrations are predicted to fall by 0.023. In a random intercepts model each of the lakes show this same trend but each lake is allowed to have its own intercept, a random deviation about the population intercept 45.768.

To put this into more understandable terms we can express this in terms of concentration units. If we exponentiate the population model equation we obtain an expression for the geometric mean (not the arithmetic mean) sulfate concentration.

The geometric mean also corresponds to the median of the lognormal distribution so we can interpret things in terms of either one. So in each subsequent year the model predicts the geometric mean sulfate concentration will be 98% of what is was the year before. Because of the way the model was formulated the intercept is currently not interpretable. It actually represents the geometric mean sulfate concentration in year 0 AD!! To fix this we can refit the model centering year at the first measured year 1976.

random.ints2 <- lme(log(SO4)~I(year-1976), random=~1|lake, data=newlakes, method='ML', na.action=na.omit)

So the average lake in 1976 had a geometric mean sulfate concentration of 3.1 that will decrease by a factor of 0.98 with each subsequent year over the range of the data.

The random factor model has a slightly more complicated interpretation. The factor model allows there to be a different mean log in each year.

fixef(random.fact2)[1] + c(0,fixef(random.fact2)[2:4])

Exponentiating these values yields four separate geometric mean sulfate concentrations (or median sulfate concentrations) in each year.

exp(fixef(random.fact2)[1]+c(0,fixef(random.fact2)[2:4]))

Table 3 Predictions from the random factor model

Questions 5–6

Because we have the geographic locations of the lakes we can use the semivariogram to assess spatial correlation. For question 5 we use the residuals from the complete pooling model and for question 6 the residuals from the random intercepts model. We need to remove the missing values from the original data set so that the residual vector and the latitude and longitude vectors are the same length. This also needs to be done to the vector of years. For the random intercepts model I use the Pearson residuals.

lakes.reduced <- newlakes[!is.na(newlakes$SO4),]

#assemble data in the form preferred by the geoR package

newdat1 <- data.frame(x=lakes.reduced$latitude, y=lakes.reduced$longitude, r=residuals(out.lnorm))

newdat2 <- data.frame(x=lakes.reduced$latitude, y=lakes.reduced$longitude, r=residuals(random.ints, type='pearson'))

years.reduced <- newlakes$year[!is.na(newlakes$SO4)]

Technically we should convert latitude and longitude measured in degrees to distance units. Since we're only operating on a limited geographic scale, this should not have a big effect so I stick with degrees. I do some initial experimentation to determine how wide to make the bins and what the maximum distance between the lakes is.

curdata <- newdat1[years.reduced==1976,]

geodat1 <- as.geodata(curdata)

geodata.v1 <- variog(geodat1, uvec=seq(0,8,.25), max.dist=8, option='bin')

From the output we see that by the time we reach a distance of 5 the number of observations per bin has dropped below 30, a number that's close to a minimum value for obtaining a reasonable estimate of the semivariogram. Thus we should not plot or calculate the semivariogram for distances beyond 5.

I fit and plot a semivariogram separately for each year (because duplicate coordinates are not allowed and it probably makes sense to treat years separately). To facilitate things I write a generic function that calculates and plots the semivariogram separately by year that then I sapply to a list of year values.

par(mar=c(5.1,4.1,2.1,1.1))

sapply(c(1976:1978, 1981), function(x) {

curdata <- newdat1[years.reduced==x,]

geodat1 <- as.geodata(curdata)

geodata.v1 <- variog(geodat1,uvec=seq(0,5,.25), max.dist=5, option='bin')

mtext(side=3, line=.5, paste(x,': Complete Pooling Model Residuals', sep=''), cex=.8)

Observe that each panel shows an increasing semivariance with distance that doesn't appear to level off. This is indicative of a non-stationary process, one in which the mean and variance are changing with distance. Nest I repeat these steps for the random intercepts model.

sapply(c(1976:1978, 1981), function(x) {

curdata <- newdat2[years.reduced==x,]

geodat1 <- as.geodata(curdata)

geodata.v1 <- variog(geodat1, uvec=seq(0,5,.25), max.dist=5, option='bin')

mtext(side=3, line=.5, paste(x,': Random Intercepts Model Residuals', sep=''), cex=.8)

It's pretty clear that while there was serious spatial correlation among the residuals in the complete pooling model of Problem 1 (as evidenced by the clear increasing trend in the semivariogram plot), this has more or less disappeared in the residuals of the random intercepts model. In Fig. 7 there appears to be only random scatter with no clear trend except perhaps for a slight trend in the years 1976 and 1978. Thus by accounting for the hierarchical nature of the data using a mixed effects model we may have also managed to account for much of the spatial correlation.

Question 7

A formal test of spatial correlation is the Mantel test. The Mantel correlation for this problem is the Pearson correlation of the distance between pairs of residuals and the geographic distance between pairs of lakes. The null distribution is obtained by randomly permuting the residuals among the lakes. We can't do the permutation to all of the residuals simultaneously because that would disrupt both their temporal and their spatial relationships. To do this correctly we would need to permute each four year block of residuals at each lake as a unit to keep their temporal patterns intact. Alternatively we can carry out the Mantel test separately by year. I write a function that selects the appropriate observations by year and carries out the Mantel test for that year.

my.mantel <- function(year) {

temp <- newdat2[years.reduced==year,]

dist.spat <- dist(temp[,1:2])

dist.r <- dist(temp[,3])

mantel(dist.spat, dist.r)

m.1976 <- my.mantel(1976)

m.1977 <- my.mantel(1977)

m.1978 <- my.mantel(1978)

m.1981 <- my.mantel(1981)

out <- t(sapply(list(m.1976, m.1977, m.1978, m.1981), function(x) cbind(x$statistic, x$signif)))

colnames(out)<-c('Mantel', 'p-value')

The Mantel test indicates that there is a significant residual spatial correlation in 1978.

Question 8

Given that the Mantel test is only just significant and the fact that we've had some success in reducing spatial correlation by including random intercepts in the model (which are necessarily spatially referenced since we have one per lake), an obvious choice is to include latitude and longitude in the model. We can just add them as predictors, add them as a response surface, or add them as smooths. I try both an additive smooth of latitude and longitude as well as a symmetrical two-dimensional smooth.

random.ints.add <- lme(log(SO4)~year + latitude + longitude, random=~1|lake, data=newlakes, method='ML', na.action=na.omit)

random.ints.resp <- lme(log(SO4)~year + latitude + longitude + I(latitude^2) + I(longitude^2) + I(latitude*longitude), random=~1|lake, data=newlakes, method='ML', na.action=na.omit)

random.ints.gam <- gamm(log(SO4)~year + s(latitude) + s(longitude), random=list(lake=~1), data=newlakes, method='ML', na.action=na.omit)

random.ints.gam2 <- gamm(log(SO4)~year + s(latitude,longitude), random=list(lake=~1), data=newlakes, method='ML', na.action=na.omit)

AIC(random.ints,random.ints.add, random.ints.resp, random.ints.gam$lme, random.ints.gam2$lme)

All of the models that include latitude and longitude improve on the random intercepts model. I extract the Pearson residuals from each model and carry out the Mantel test. To facilitate doing this repeatedly I organize everything as a function.

my.mantel2 <- function(year,r) {

temp <- newdat2[years.reduced==year,]

res <- r[years.reduced==year]

dist.spat <- dist(temp[,1:2])

mantel(dist.spat, dist.r)

#carry out test separately by year for each model

mantel.add <- t(sapply(list(1976, 1977, 1978, 1981), function(x) my.mantel2(x, residuals(random.ints.add, type='pearson'))))

mantel.resp <- t(sapply(list(1976, 1977, 1978, 1981), function(x) my.mantel2(x, residuals(random.ints.resp, type='pearson'))))

mantel.gam1 <- t(sapply(list(1976, 1977, 1978, 1981), function(x) my.mantel2(x, residuals(random.ints.gam$lme, type='pearson'))))

mantel.gam2 <- t(sapply(list(1976, 1977, 1978, 1981), function(x) my.mantel2(x, residuals(random.ints.gam2$lme, type='pearson'))))

out.mant <- cbind(mantel.add, mantel.resp, mantel.gam1, mantel.gam2)

rownames(out.mant) <- c(1976:1978,1981)

colnames(out.mant) <- c('add', 'p', 'resp', 'p', 'gam1', 'p', 'gam2', 'p')

Now none of the correlations are statistically significant. Any of the models that include latitude and longitude will work here.

Model	Linear predictor	k	log-likelihood	AIC
lognormal	linear year	3	–342.018	690.035
	quadratic year	4	–342.015	692.030
	categorical year	5	–342.014	694.028
gamma	linear year	3	–349.840	705.681
	quadratic year	4	–349.708	707.416
	categorical year	5	–349.605	709.210
normal	linear year	3	–390.424	786.849
	quadratic year	4	–390.333	788.667
	categorical year	5	–390.239	790.478

Lognormal Model	Predictor	# parameters	log-likelihood (for log y)	AIC (for log y)
common pooling	year	3	–159.159	318.318
unconditional means	—	3	–33.882	73.763
random intercepts	year	4	–28.866	65.732
random slopes and intercepts	year	6	–27.192	66.383
random slopes	year	4	–28.945	65.890

Year	mean log(SO₄)	median SO₄ (geometric mean)
1976	1.116679	3.055
1977	1.162403	3.198
1978	1.081715	2.950
1981	1.015036	2.759