The inlautils from timcdlucas

Make Stepinla work and be tested

inlaSDM has lots of hefty dependencies

Should probably "suggests" dismo for example. Or replace it.

INLAsdm with step = TRUE doesn't return a formula.

INLAstep test forward/backward zero, >0 retained variables

Print methods for most objects

The printing for most of these objects is a mess

inlaSDM with stepwise regression working

Requires #32

plot and autoplot method for projector objects.

inla.mesh.projector(mesh, dims = c(100, 100))

Should plot the raster and probably the mesh (at least raw data) over the top with options to turn stuff off.

Deal with SpatialPolygons vs SpatialPolygonsDataFrame properly in parallel raster extract

stepinla breaks if it gets to 0 predictors.

Some errors

library(INLA)
library(INLAutils)

data <- data.frame(y = rpois(100, 10), x1 = rnorm(100))

data$x2 <- sin(data$y / 2) + rnorm(100, sd = 0.1)

ggplot(data, aes(y, x2)) + geom_point()

model <- inla(y ~ x1 + x2, data = data, family = 'poisson')

autoplot(model)

Plot the distribution that random effects get their slopes/intercepts from with a rug showing location of groups

INLAsdm cross_validation working

data stack in INLA step

stk <- inla.stack(data = list(y = dataframe$y), 
                  A = list(A, 1),
                  effects = list(s.index,
                                 list(y.intercept = rep(1, length(dataframe$y)),
                                      covariate = dataframe[-1])), 
                  tag='est')
                  
stk <- inla.stack(data = list(y = dataframe$y),
                  A = list(A, 1), 
                  effects = list(c(s.index, list(y.intercept = 1)),
                                 list(dataframe[-1])),
                  tag='est')

I think both of these formats work in inla but only the second seems to work in INLAstep. No idea why.

Run plot with other examples

What do other model types get from plot. Need to check stuff works ok.

tests for which need to remove which values

data(Tokyo)
summary(Tokyo)

## Define the model
formula = y ~ f(time, model="rw2", cyclic=TRUE, param=c(1,0.0001)) - 1

## The call to inla
result = inla(formula, family="binomial", Ntrials=n, data=Tokyo)

autoplot(result)

fails because which = 1 doesn't work for whatever reason. It gives the error but then doesn't remove 1 from which.

Plot priors

Add option to plot priors.

I guess this should be for all plots where that makes sense.

It should default to off as the default prior is just mega flat

Error on priors = T with binomial data.

  inla.model <- inla(formula, 
                     family = 'binomial',
                     data = inla.stack.data(inla_point_stack, spde_full = spde_full),
                     Ntrials = examined,
                      control.family = list(link = "probit"),
                     control.predictor = list(A = inla.stack.A(inla_point_stack), compute=TRUE, link = 1)
                     )

autoplot(inla.model, which = 1, priors = T)

errors. Not sure why. This is the model with crazy posteriors.

plot.inla counts the number of random effect levels

It counts and does lines if lots and boxplots if few. Do we want to do this?

Quite possibly not.

autoplot to do

Axis titles
Tick labels i.e. 1:250 is not useful. Cut down to 10 max.
Understand what the fitted values and linear predictor plots actually are. Then make funciton names and docs fit.
reconsider CI. Darker grey inside, etc.

autoplot.mesh is a pain to add data too

Currently have to set all other aes to NULL.

Autoplot with smoother

formula3 = Y ~ f(region.struct, model="besag", graph.file = g) +
           f(region, model = "iid") + f(x, model = "rw2")

result3 = inla(formula3, family = "poisson", data = Germany, E = E)
autoplot(result3)

The smoother has x axis label "ID" which doesn't make sense.

Plot mixed effects regression slopes from inla object

inlasloo fails if lat or long is called 'y'


library(sp)
data(meuse)




coords <- meuse[, c('x', 'y')] %>% scale
dataf1 <- sp::SpatialPointsDataFrame(coords = coords, data = meuse[, -c(1:2)])

mesh <- inla.mesh.2d(loc = sp::coordinates(dataf1), max.edge = c(0.2, 0.5), cutoff = 0.1)
spde <- inla.spde2.matern(mesh, alpha=2) # SPDE model is defined
A <- inla.spde.make.A(mesh, loc = sp::coordinates(dataf1)) # projector matrix
dataframe <- data.frame(dataf1) # generate dataframe with response and covariate
modform <- cadmium ~ -1 + y.intercept + ffreq + om + soil + lime + f(spatial.field, model = spde)
modform2 <- cadmium ~ -1 + y.intercept + ffreq + om + soil + lime

# make index for spatial field
s.index <- inla.spde.make.index(name="spatial.field",n.spde=spde$n.spde)

## Prepare the data
stk <- inla.stack(data=list(cadmium=dataframe$cadmium),
                      A=list(A,1), 
                      effects=list(c(s.index,list(y.intercept=1)),
                                   list(dataframe[, 7:10])),
                      tag='est')

out <- inla(modform, family = 'normal', Ntrials = 1,
            data = inla.stack.data(stk, spde = spde),
            control.predictor = list(A = inla.stack.A(stk), link = 1),
            control.compute = list(config = TRUE), 
            control.inla = list(int.strategy = 'eb'))
out.field <- inla.spde2.result(out,'spatial.field', spde, do.transf = TRUE)
range.out <- inla.emarginal(function(x) x, out.field$marginals.range.nominal[[1]])

# parameters for the SLOO process
ss <- 20 # sample size to process (number of SLOO runs)
# define the radius of the spatial buffer surrounding the removed point. 
rad <- min(range.out, max(dist(coords)) / 4) 
# Make sure it isn't bigger than 25% of the study area (see Le Rest et al.(2014))
alpha <- 0.05 # rmse and mae confidence intervals (1-alpha)

# run the function to compare both models
cv <- inlasloo(dataframe = dataframe, 
               long = 'x', lat = 'y',
               y = 'cadmium', ss = ss, 
               rad = rad, 
               modform = list(modform, modform2),
               mesh = mesh, family = 'normal',
               mae = TRUE)

because of the funky ordering of these lines.

    colnames(dataframe)[colnames(dataframe) == y] <- "y"
    colnames(dataframe)[colnames(dataframe) == long] <- "long"
    colnames(dataframe)[colnames(dataframe) == lat] <- "lat"

Add tests for plots

Mega basic plots to check that functions don't error.

Will bump up test coverage which is nice.

Worth doing to set out what needs to be in model autoplot. Combine with going through INLA plot function.

Interface to bayesplot plot

Copy pit plot from here.

https://arxiv.org/pdf/1806.02748.pdf

autoplot with multiple likelihood models


## An example with three independent AR(1)'s with separate means, but
## with the same hyperparameters. These are observed with three
## different likelihoods.

n = 100
x1 = arima.sim(n=n, model=list(ar=c(0.9))) + 0
x2 = arima.sim(n=n, model=list(ar=c(0.9))) + 1
x3 = arima.sim(n=n, model=list(ar=c(0.9))) + 2

## Binomial observations
Nt = 10 + rpois(n,lambda=1)
y1 = rbinom(n, size=Nt, prob = exp(x1)/(1+exp(x1)))

## Poisson observations
Ep = runif(n, min=1, max=10)
y2 = rpois(n, lambda = Ep*exp(x2))

## Gaussian observations
y3 = rnorm(n, mean=x3, sd=0.1)

## stack these in a 3-column matrix with NA's where not observed
y = matrix(NA, 3*n, 3)
y[1:n, 1] = y1
y[n + 1:n, 2] = y2
y[2*n + 1:n, 3] = y3

## define the model
r = c(rep(1,n), rep(2,n), rep(3,n))
rf = as.factor(r)
i = rep(1:n, 3)
formula = y ~ f(i, model="ar1", replicate=r, constr=TRUE) + rf -1
data = data.frame(y, i, r, rf)

## parameters for the binomial and the poisson
Ntrial = rep(NA, 3*n)
Ntrial[1:n] = Nt
E = rep(NA, 3*n)
E[1:n + n] = Ep

result = inla(formula, family = c("binomial", "poisson", "normal"),
              data = data, Ntrial = Ntrial, E = E,
              control.family = list(
                      list(),
                      list(),
                      list(initial=0)))

gives

Error in eval(expr, envir, enclos) : object 'X0.975quant' not found

Ribbons rather than dashes for 95% confidence?

Probably.

Vignettes

Going to want fairly extensive vignettes:

General INLA usage
Spatial INLA usage
inlaSDM analysis

Function to makes GAMs easy.

Again might just be a function to build the formula.

Forward selection is not working

  # Try and make a dataset where a variable WILL get added.
  Epil2 <- Epil
  Epil2$Base <- Epil$y + rnorm(nrow(Epil), sd = 0.01)
  Epil2$Age <- Epil$y + rnorm(nrow(Epil), sd = 0.01)
  
  stack2 <- inla.stack(data = list(y = Epil2$y),
                      A = list(1),
                      effects = list(data.frame(Intercept = 1, Epil2[3:5])))
  
  result1 <- INLAstep(fam1 = "poisson", 
                      Epil2,
                      in_stack = stack2,
                      invariant = "0 + Intercept",
                      direction = 'forwads',
                      include = 3:5,
                      y = 'y',
                      y2 = 'y',
                      powerl = 1,
                      inter = 1,
                      thresh = 0.001)

Does the sloo readme make sense?

Hi @pyt215 I'm just tidying up the readme and reknitting it.

https://github.com/timcdlucas/INLAutils/tree/dev

I think the code that was in there wasn't working. So I took code from the examples. But now I'm a bit confused because it looks like it has binomial response data, fitted with a normal likelihood in the first case and with a gamma in the sloo line?

I'll try and work it out, or maybe you could let me know what it should be.

Surg example causes autoplot error

data(Surg)
formula = r ~ f(hospital,model="iid",param=c(0.001,0.001))
mod.surg = inla(formula,data=Surg,family="binomial",Ntrials=n)

autoplot(mod.surg)

breaks

Nice way to plot mean and stdv of spatial projection

http://people.bath.ac.uk/fl353/isba/isbaspde.R

levelplot(row.values=proj$x,
          column.values=proj$y,
          x=inla.mesh.project(proj,
          inla.result$summary.linear.predictor$mean[index[mesh.index$field.repl==1]]),
          xlim=c(0,1),
          ylim=c(0,1),
          col.regions=cp, at=at+5,
          aspect="iso",
          contour=FALSE, labels=FALSE, pretty=TRUE,
          xlab=NULL,ylab=NULL,scales=list(draw=FALSE))

Projects that depend on INLA

http://scisoft-net-map.isri.cmu.edu/application/INLA/gitprojects

Worth scouring for what they use/need regularly.

INLA dwplot.

dot whisker plot for effects. Probably useful?

Credible intervals in autoplot

Do em. Potentially highest density credible intervals too.

Create a better "make graph" function.

Random effects sorted

Might be useful to sort the random effects in autoplot. This will easily show the range of the random effects.

Switch ggfortify for cowplot

autoplot should

print with grid_arrange or something.
return the plain plot list.

Documentation should show

p <- autoplot()
p[1] <- p[1] + theme_grey()
plot_grid(p)

Does Ntrials break autoplot?

  formula <- positive ~ x1 + x2
  
  
  xx = data.frame(x1 = rnorm(length(pointcases$examined)), x2 = rnorm(length(pointcases$examined)))
  inla_point_stack <- inla.stack(tag = 'est', 
                                 data = list(positive = pointcases$examined,
                                             examined = pointcases$examined),
                                 A = list(1), 
                                 effects = list(xx))
                                 
  
  inla.model <- inla(formula, 
                     family = 'binomial',
                     data = inla.stack.data(inla_point_stack),
                     Ntrials = examined
                     )

timcdlucas / inlautils Goto Github PK

inlautils's Introduction

INLAutils

Installation

Overview

Plotting

Analysis

Spatial leave-one-out cross-validation (sloo-cv)

inlautils's People

Contributors

Stargazers

Watchers

Forkers

inlautils's Issues

Recommend Projects

Recommend Topics

Recommend Org