Modelling Cognitive Processes

ggdmc is an R package for modelling cognitive processes. While its primary focus is on complex hierarchical Bayesian models fitted using Bayesian MCMC, it also supports fitting cognitive models with traditional methods like maximum likelihood estimation and least squares. The package employs population-based Markov chain Monte Carlo (pMCMC) for sampling.

Getting Started

This example showcases the Wiener diffusion model. For exploring other models, visit my tutorials site. The naming convention for functions in ggdmc aims to be clear and informative. For instance, BuildModel creates a model object.

As a common practice when using Bayesian tools, it's crucial to verify the results after fitting a model. Remember, the order of parameters in your parameter vector (p.vector) must match the order reported by BuildModel in p.vector. While some built-in checks attempt to prevent mismatches, they aren't foolproof. It's always best to double-check the order yourself.

Fit a fixed-effect model to a participant

## Set up model ----
## Fixing sv & sz to 0 to set up a Wiener diffusion model
require(ggdmc)
model <- BuildModel(
  p.map     = list(a = "1", v="1", z="1", d="1", sz="1", sv="1", t0="1", 
                   st0="1"),
  match.map = list(M = list(s1 = "r1", s2 = "r2")),
  factors   = list(S = c("s1", "s2")),
  responses = c("r1","r2"),
  constants = c(st0 = 0, d = 0, sv = 0, sz = 0),  
  type      = "rd")   

npar <- model@npar   ## Note this works for version > 0.2.7.5; 
## npar <- length(GetPNames(model))   ## Use GetPNames instead in 0.2.6.0

p.vector <- c(a=1, v=1.5, z=0.5, t0=.15)
dat <- simulate(model, nsim = 50, ps = p.vector)
dmi <- BuildDMI(dat, model)

p.prior <- BuildPrior(
  dists = rep("tnorm", npar),
  p1=c(a=1, v=0, z=1, t0=1),
  p2=c(a=1, v=2, z=1, t0=1),
  lower = c(0, -5, rep(0, 2)),
  upper = rep(NA, npar))

## Fit model -------------
fit0 <- StartNewsamples(dmi, p.prior)
fit  <- run(fit0)

## Check model -----------
plot(fit)
plot(fit, den = TRUE)
plot(fit, pll = FALSE)
plot(fit, pll = FALSE, den = TRUE)

isconv <- gelman(fit)
est    <- summary(fit, recovery = TRUE, ps = p.vector, verbose = TRUE)

Multilevel Modeling: Fixed and Random Effects for Multiple Participants

require(ggdmc);
model <- BuildModel(
  p.map     = list(a = "1", v ="1", z ="1", d ="1", sz ="1", sv ="1", t0 ="1", 
                   st0 ="1"),
  match.map = list(M = list(s1 = "r1", s2 = "r2")),
  factors   = list(S = c("s1", "s2")),
  responses = c("r1","r2"),
  constants = c(st0 = 0, d = 0, sv = 0, sz = 0),
  type      = "rd")

npar <- model@npar
pop.mean  <- c(a = 2,   v = 4,  z = 0.5, t0 = 0.3)
pop.scale <- c(a = 0.5, v = .5, z = 0.1, t0 = 0.05)
pop.prior <- BuildPrior(
    dists = rep("tnorm", npar),
    p1    = pop.mean,
    p2    = pop.scale,
    lower = c(0,-5,  0, 0),
    upper = c(5, 7,  1, 1))

## Simulate some data
dat <- simulate(model, nsub = 50, nsim = 30, prior = pop.prior)
dmi <- BuildDMI(dat, model)
ps <- attr(dat, "parameters")

p.prior <- BuildPrior(
    dists = rep("tnorm", npar),
    p1    = pop.mean,
    p2    = pop.scale*5,
    lower = c(0,-5, 0, 0),
    upper = c(5, 7, 1, 1))

plot(p.prior, ps = ps)  ## Check if all true values are in the range 

## Sampling separately
fit0 <- StartNewsamples(dmi, p.prior, ncore = 4)
fit  <- run(fit0, 5e2, ncore = 4)
fit  <- run(fit, 1e2, add = TRUE, ncore = 4)  ## add additional 100 samples

## Check model -----
isconv <- gelman(fit, verbose = TRUE)
plot(fit)
est0 <- summary(fit, recovery = TRUE, ps = ps, verbose = TRUE)

## Sampling hierarchically
mu.prior <- BuildPrior(
    dists = rep("tnorm", npar),
    p1    = pop.mean,
    p2    = pop.scale*5,
    lower = c(0,-5,  0, 0),
    upper = c(5, 7,  1, 1))

sigma.prior <- BuildPrior(
    dists = rep("beta", npar),
    p1    = c(a=1, v=1, z=1, t0=1),
    p2    = rep(1, npar),
    upper = rep(1, npar))

## !!!The names are important!!!
priors <- list(pprior = p.prior, location = mu.prior, scale = sigma.prior)
names(priors)
## [1] "pprior"   "location" "scale"

## Fit hierarchical model ----
fit0 <- StartNewsamples(dmi, priors)
fit  <- run(fit0, 5e2)

p0 <- plot(fit, hyper = TRUE)
p0 <- plot(fit, hyper = TRUE, den = TRUE, pll=FALSE)

## Check model -----------
## hgelman function is deprecated 
res  <- gelman(fit, verbose = TRUE)
est0 <- summary(fit, recovery = TRUE, ps = ps, verbose = TRUE)
est1 <- summary(fit, hyper = TRUE, recovery = TRUE, ps = pop.mean,  type = 1, verbose = TRUE)
est2 <- summary(fit, hyper = TRUE, recovery = TRUE, ps = pop.scale, type = 2, verbose = TRUE)

Response time models

The LBA model, type = "norm",
The DDM, type = "rd",
The Wiener diffusion, type = "rd" and set sv=0 and sz=0

PDA-based models

The Piecewise LBA model 0; CPU-based PDA likelihoods; type = "plba0",
The Piecewise LBA model 1; CPU-based PDA likelihoods; type = "plba1",
The Piecewise LBA model 0; GPU-based PDA likelihoods; type = "plba0_gpu",
The Piecewise LBA model 1; GPU-based PDA likelihoods; type = "plba1_gpu",
The LBA model; GPU-based PDA likelihoods;, type = "norm_pda_gpu",
The leaky, competing accumulator model (Experimental!).

4 to 8 are separated from the latest version of the package. For these PDA-based models, see my BRM paper and associated packages there (osf project). 9 is in a separate module, which has yet to be incorporated. See the LCA tutorial for its testing result using MLE.

For the details regarding PLBA types, please see Holmes, Trueblood, and Heathcote (2016)

Experimental (untested) models

2-D/circular drift-diffusion model, type = "cddm"
Prospective memory model, type = "norm" (see tutorial for more details)
Time-varying changes in other free parameters

Further information

A primary goal of ggdmc is to be compatible with DMC objects, which share many similarities. However, some structural differences exist. For instance, in the latest version of ggdmc, the dimensions of theta and phi arrays are 'npar x nchain x nmc', while DMC uses 'nchain x npar x nmc'. To optimize computational efficiency and align with Armadillo conventions, we adopted this format. Likewise, the 'log_likelihoods' and 'summed_log_prior' matrices have dimensions 'nchain x nmc' in ggdmc, compared to 'nmc x nchain' in DMC. When transferring data between the two packages, it's essential to transpose these matrices or arrays using R's aperm or t functions. To streamline this process, we've included two dedicated functions.

DMC2ggdmc <- function(x) {
  ## x is an object of posterior samples from individual subject fit
  x$theta <- aperm(x$theta, c(2, 1, 3))
  x$summed_log_prior <- t(x$summed_log_prior)
  x$log_likelihoods <- t(x$log_likelihoods)
  class(x) <- c('list', 'model')
  return(x)
}
ggdmc2DMC <- function(x) {
  ## Should change $ to @ when an object is generated from ggdmc's run function,
  ## because ggdmc uses S4 class
  x$theta <- aperm(x$theta, c(2, 1, 3))
  x$summed_log_prior <- t(x$summed_log_prior)
  x$log_likelihoods <- t(x$log_likelihoods)
  return(x)
}

Note Dstats.dmc in DMC is also affected by the issue of the different array and matrix dimensions because Dstats.dmc calculates the means of the theta/phi array across the column.

apply(samples$theta,2,mean)

The tutorial in 3-accumulator LBA model illustrates an example of doing the above-mentioned operation.

While ggdmc provides a DIC function that relies on the internal deviance_model function, it's important to note that DIC is generally not recommended for model comparison as of 2024. Instead, consider using the statistics stored in a model fit to calculate the Bayes factor for the same purpose.

Starting from version 0.2.7.5, ggdmc utilizes S4 classes. To extract object components (slots) after this version, use the "@" operator instead of the previous syntax.

Prerequisites

R (>= 3.3.0)
R packages: Rcpp (>= 0.12.10), RcppArmadillo (>= 0.7.700.3.0), ggplot2 (>= 2.1.0), coda (>= 0.16-1), matrixStats, data.table
Windows users need Rtools (>= 3.3.0.1959)
~~Mac OS users need to make Clang understand the OpenMP flag.~~
~~Linux/Unix users may need to install the Open MPI library if it has not been installed.~~
~~Armadillo may need a recent g++ compiler > 4.6~~

Installation

We now use S4 class after version 0.2.7.5. The new design enables a more user-friendly interface.

From CRAN (0.2.6.0):

install.packages("ggdmc")

From source:

install.packages("ggdmc_0.2.8.1.tar.gz", repos = NULL, type="source")

From GitHub (need devtools) (0.2.8.0):

devtools::install_github("yxlin/ggdmc")

For Microsoft R users:

As of June 1, 2020, deploying ggdmc on Microsoft R, which uses R version 3.5.3, presents two primary challenges. First, the RcppArmadillo package on MRAN lags behind the CRAN version, lacking essential Armadillo functions like randperm. To address this, users should install RcppArmadillo directly from its source code on CRAN. Second, the default Windows installation process seeks package binaries matching the local R version, potentially causing issues with ggdmc compatibility. Installing ggdmc from its source tarball circumvents this problem.

For Mac Users:

~~1. Install gfortran. As of 27, Aug, 2018, the gfortran version has to be 6.1, even if you are using a macOS High Sierra Version 10.13.4. gfortran 6.3 may not work.~~

2. Install clang4-r. James Balamuta has created a convenient tool, clang4-r. Once you install clang4-r, your clang will understand the OpenMP flag in ggdmc. The aim is to allow macOS to understand the OpenMP flag, so you may use other methods if you do not want to install clang4-r. The clang4-r is the most straightforward we have found so far. However, we have yet to look into the source code of clang4-r. You can use it at your own risk.

The configure script now disables OpenMP, so macOS users should be able to install without encountering the OpenMP problem.

FAQ

How may I resolve the error, "/usr/bin/ld: cannot find -lgsl" and/or "/usr/bin/ld: cannot find -lgslcblas"?

installing libgsl-dev may resolve this problem (Ubuntu).

Citation

Lin, Y.-S and Strickland, L. (2020). Evidence accumulation models with R: A a practical guide to hierarchical Bayesian methods. The Quantitative Methods for Psychology.

Contributors

The R documentation, tutorials, C++ codes, parallel computations, new genetic algorithm, R helper functions, and R packaging are developed by Yi-Shin Lin. A substantial part of R codes for handling experimental designs are adapted from the DMC, developed by Andrew Heathcote (Heathcote et al., 2018). You could find different and more interesting cognitive models in DMC.

Please report bugs to me or start an issue here.

Correction

The help page for the function, likelihood in Density.cpp states that it returns log-likelihood (v2.8.0). An inspection of the source code found that it returns likelihood, not log likelihood. (13-06-2022; v2.8.1). Thanks for Nachshon Meiran pointing it out.

License

GPL-2

Acknowledgments

The PDF, CDF, and random number generation of DDM were derived from Voss & Voss's fast-dm 30.2 and rtdists 0.9-0.
Truncated normal functions were originally based on Jonathan Olmsted's RcppTN 0.1-8 at https://github.com/olmjo/RcppTN, Christopher Jackson's R codes in msm package, and Robert's paper (1995, Statistics & Computing).
Thanks to Matthew Gretton's consultation regarding the rtdists package.
Thanks to Andrew Heathcote for lending me his MacBook Air. ggdmc works on OS X (macOS High Sierra Version 10.13.4)
The PDF and random number generation of the 2-D diffusion/circular diffusion model is based on Smith (2016).

Reference

Heathcote, A., Lin, Y.-S., Reynolds, A., Strickland. L. Gretton, M., & Matzke, D. (2018). Dynamic models of choice, Behavior Research Methods. https://doi.org/10.3758/s13428-018-1067-y
Lin, Y.-S. and Strickland, L. (2020). Evidence accumulation models with R: A practical guide to hierarchical Bayesian methods. The Quantitative Methods for Psychology.
Smith, P. (2016). Diffusion Theory of Decision Making in Continuous Report, Psychological Review, 123(4), 425-451. http:https://dx.doi.org/10.1037/rev0000023

Name		Name	Last commit message	Last commit date
Latest commit History 98 Commits
R		R
docs		docs
inst/include		inst/include
man		man
src		src
.Rbuildignore		.Rbuildignore
.gitignore		.gitignore
DESCRIPTION		DESCRIPTION
NAMESPACE		NAMESPACE
README.md		README.md
cleanup		cleanup
configure		configure
configure.ac		configure.ac
ggdmc.Rproj		ggdmc.Rproj

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Modelling Cognitive Processes

Getting Started

Fit a fixed-effect model to a participant

Multilevel Modeling: Fixed and Random Effects for Multiple Participants

Response time models

PDA-based models

Experimental (untested) models

Further information

Prerequisites

Installation

FAQ

Citation

Contributors

Correction

License

Acknowledgments

Reference

About

Releases

Packages

Languages

yxlin/ggdmc

Folders and files

Latest commit

History

Repository files navigation

Modelling Cognitive Processes

Getting Started

Fit a fixed-effect model to a participant

Multilevel Modeling: Fixed and Random Effects for Multiple Participants

Response time models

PDA-based models

Experimental (untested) models

Further information

Prerequisites

Installation

FAQ

Citation

Contributors

Correction

License

Acknowledgments

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages