poissonisfish

Pingback: Linear mixed-effect models in R – Mubashir Qasim

2018-10-30T18:14:41+00:00

I went over this site and I conceive you have a lot of great info, saved to bookmarks (:.

LikeLike

Reply

2018-10-30T18:27:15+00:00

Thanks!

LikeLike

Reply

Pingback: The all-new caret interface in R – poissonisfish

2019-12-05T20:34:57+00:00

Dear Francisco,

I am analyzing agricultural trial data pooled from different locations (same treatment list over different sites). I was advised to build my data model as below when pooling same treatment response (e.g., yield) from different site:

SOV Df
Location l-1
Rep(Loc) l(r-1)
Treatment t-1
Location*Treatment (l-1)(t-1)
Error l(r-1)(t-1)
Total lrt-1

To do the mean separation test I was told to calculate the LSD using df and MSE from the interaction term (Location*Treatment).

I wonder does anyone here know the R code to develop the mix model above and run the LSD test using interaction term df and MSE to my personal email wangtao.home@gmail.com?

Many thanks!!

LikeLike

Reply

2019-12-05T21:32:38+00:00

Hi Tao,
Apologies but I did not understand your query, if you clarify it I can eventually provide some comments! Regards,
Francisco

LikeLike

Reply

2022-02-05T23:48:59+00:00

I’ve gone through this and am not seeing the negative correlation (-0.61) between random intercept and slope. Am I missing something?

LikeLike

Reply

2022-02-07T08:46:38+00:00

Hi Zach, do you get some value close to -0.61? This could be a precision problem, potentially owing to new package releases. Do the plots look the same or closely identical?

LikeLike

Reply

2022-03-08T08:36:38+00:00

Hi Francisco,

Fantastic article so far and has really helped me to develop my LMM. I’m wondering if you could clarify something for me however:

You noted that the data should be in Long Format for this. Can you confirm what stage of the procedure outlined this is required for? (i.e., can I perform regular LMs using long data too, or should it only be transformed from wide to long when you are ready to look at the LMMs?)

Thanks in advance,
Tom

LikeLike

Reply

2022-03-08T09:08:58+00:00

Hi Tom, thanks for the kind words! I would perhaps note that the need for melting or reshaping wide to long format varies case by case – here this was not necessary. For the sake of argument, think about a time series listing timepoints as separate columns; this is a case where you would need to stack the values (e.g. seed size) in a single column and create a new column designating the timepoints, in order to pass their order to the LMM model. This example should be relatively clear, however there never is right or wrong with reshaping a dataset – it is all up to your hypothesis and the way you want to encode the information therein. In my experience, in most LMMs it works better but it is not strictly necessary. Btw if this topic interests you I suggest you have a look into my Bayesian modelling tutorial, a fantastic alternative to LMMs which I find more satisfying and intuitive. Hope this helps?
Francisco

LikeLike

Reply

	# Install (if necessary) and load nlme and lme4
	library(nlme)
	library(lme4)
	# Load dataset, inspect size and additional info
	data(Arabidopsis)
	dim(Arabidopsis) # 625 observations, 8 variables
	?Arabidopsis
	attach(Arabidopsis)

	# Overview of the variables
	par(mfrow = c(2,4))
	barplot(table(reg), ylab = "Frequency", main = "Region")
	barplot(table(popu), ylab = "Frequency", main = "Population")
	barplot(table(gen), ylab = "Frequency", las = 2, main = "Genotype")
	barplot(table(rack), ylab = "Frequency", main = "Rack")
	barplot(table(nutrient), ylab = "Frequency", main = "Nutrient")
	barplot(table(amd), ylab = "Frequency", main = "AMD")
	barplot(table(status), ylab = "Frequency", main = "Status")
	hist(total.fruits, col = "grey", main = "Total fruits", xlab = NULL)

	# Transform the three factor variables gen, rack and nutrient
	Arabidopsis[,c("gen","rack","nutrient")] <- lapply(Arabidopsis[,c("gen","rack","nutrient")], factor)
	str(Arabidopsis)
	# Re-attach after correction, ignore warnings
	attach(Arabidopsis)
	# Add 1 to total fruits, otherwise log of 0 will prompt error
	total.fruits <- log(1 + total.fruits)

	# gen x popu table
	table(gen, popu)
	# Any NAs?
	any(is.na(Arabidopsis)) # FALSE

	LM <- lm(total.fruits ~ rack + nutrient + amd + status)
	summary(LM)
	par(mfrow = c(2,2))
	plot(LM)

poissonisfish

Linear mixed-effect models in R

Mixed-effect linear models

Let’s get started with R

Formula syntax basics

Classic linear model

Generalized linear model

Optimal random structure

Optimal fixed structure

Fit optimal model with REML

Conclusions

Wrap-up

Citation

10 thoughts on “Linear mixed-effect models in R”

Leave a comment Cancel reply

	GLM <- gls(total.fruits ~ rack + nutrient + amd + status,
	method = "ML")
	summary(GLM)

	lmm1 <- lme(total.fruits ~ rack + nutrient + amd + status,
	random = ~1\|reg, method = "ML")
	lmm2 <- lme(total.fruits ~ rack + nutrient + amd + status,
	random = ~1\|popu, method = "ML")
	lmm3 <- lme(total.fruits ~ rack + nutrient + amd + status,
	random = ~1\|gen, method = "ML")
	lmm4 <- lme(total.fruits ~ rack + nutrient + amd + status,
	random = ~1\|reg/popu, method = "ML")
	lmm5 <- lme(total.fruits ~ rack + nutrient + amd + status,
	random = ~1\|reg/gen, method = "ML")
	lmm6 <- lme(total.fruits ~ rack + nutrient + amd + status,
	random = ~1\|popu/gen, method = "ML")
	lmm7 <- lme(total.fruits ~ rack + nutrient + amd + status,
	random = ~1\|reg/popu/gen, method = "ML")
	anova(GLM, lmm1, lmm2, lmm3, lmm4, lmm5, lmm6, lmm7)

	# Set optimization pars
	ctrl <- lmeControl(opt="optim")
	lmm6.2 <- update(lmm6, .~., random = ~nutrient\|popu/gen, control = ctrl)
	lmm7.2 <- update(lmm7, .~., random = ~nutrient\|reg/popu/gen, control = ctrl)
	anova(lmm6, lmm6.2, lmm7, lmm7.2) # both models improved
	anova(lmm6.2, lmm7.2) # similar fit; lmm6.2 more parsimonious
	summary(lmm6.2)

	# QQ plots (drawn to the same scale!)
	par(mfrow = c(1,2))
	lims <- c(-3.5,3.5)
	qqnorm(resid(GLM, type = "normalized"),
	xlim = lims, ylim = lims,main = "GLM")
	abline(0,1, col = "red", lty = 2)
	qqnorm(resid(lmm6.2, type = "normalized"),
	xlim = lims, ylim = lims, main = "lmm6.2")
	abline(0,1, col = "red", lty = 2)

	lmm8 <- update(lmm6.2, .~. + nutrient:amd)
	summary(lmm8)
	anova(lmm8, lmm6.2)

	finalModel <- update(lmm6.2, .~., method = "REML")
	summary(finalModel)

	dev.off() # Reset previous graphical pars
	# New GLM, updated from the first by estimating with REML
	GLM2 <- update(GLM, .~., method = "REML")
	# Plot side by side, beta with respective SEs
	plot(coef(GLM2), xlab = "Fixed Effects", ylab = expression(beta), axes = F,
	pch = 16, col = "black", ylim = c(-.9,2.2))
	stdErrors <- coef(summary(GLM2))[,2]
	segments(x0 = 1:6, x1 = 1:6, y0 = coef(GLM2) - stdErrors, y1 = coef(GLM2) + stdErrors,
	col = "black")
	axis(2)
	abline(h = 0, col = "grey", lty = 2)
	axis(1, at = 1:6,
	labels = c("Intercept", "Rack", "Nutrient (Treated)","AMD (Unclipped)","Status (PP)",
	"Status (Transplant)"), cex.axis = .7)
	# LMM
	points(1:6 + .1, fixef(finalModel), pch = 16, col = "red")
	stdErrorsLMM <- coef(summary(finalModel))[,2]
	segments(x0 = 1:6 + .1, x1 = 1:6 + .1, y0 = fixef(finalModel) - stdErrorsLMM,
	y1 = fixef(finalModel) + stdErrorsLMM, col = "red")
	# Legend
	legend("topright", legend = c("GLM","LMM"), text.col = c("black","red"), bty = "n")

Mixed-effect linear models

Let’s get started with R

Formula syntax basics

Classic linear model

Generalized linear model

Optimal random structure

Optimal fixed structure

Fit optimal model with REML

Conclusions

Wrap-up

Citation

Share this:

10 thoughts on “Linear mixed-effect models in R”

Leave a comment Cancel reply