SequentialSamplingModels.jl WIP: Ratcliff Diffusion Model pdf and simulators

I have added a PDF file containing the Ratcliff Diffusion Model and two methods for simulating the diffusion process. However, the PDF code still needs testing as there are some remaining bugs. I wanted to share the progress so that we can identify the tests that need to be conducted and address any issues.

Jun 22 '23 03:06 kiante-fernandez

Codecov Report

Patch coverage has no change and project coverage change: -30.14 :warning:

Comparison is base (4414161) 82.93% compared to head (7fd93cd) 52.80%.

Additional details and impacted files

@@             Coverage Diff             @@
##           master      #29       +/-   ##
===========================================
- Coverage   82.93%   52.80%   -30.14%     
===========================================
  Files          11       12        +1     
  Lines         545      856      +311     
===========================================
  Hits          452      452               
- Misses         93      404      +311

Impacted Files	Coverage Δ
src/DDM.jl	`82.68% <ø> (ø)`
src/RatcliffDDM.jl	`0.00% <0.00%> (ø)`
src/SequentialSamplingModels.jl	`100.00% <ø> (ø)`

:umbrella: View full report in Codecov by Sentry.
:loudspeaker: Do you have feedback about the report comment? Let us know in this issue.

Jun 22 '23 03:06 codecov-commenter

@kiante-fernandez, thanks for the progress update. I realize this is a WIP, but I left a few comments that may still be applicable.

My approach to tests is to start by plotting the pdf over the histogram of simulated data. That is part of the reason I include those plots in the documentation. I also like to compare the pdf to a kernel density of the pdf. It's not perfect, but it gives you some idea of whether the generative model and the pdf are consistent with each other. I also test against established libraries when available. Another thing I do is develop a test whenever I find a bug to ensure there are no regressions in the future.

Jun 22 '23 09:06 itsdfish

Could we also add (in the docstrings?) a brief mention about the difference between DDM and RatcliffDDM and maybe a cross-link ("See also DDM()")

Jun 22 '23 11:06 DominiqueMakowski

Could we also add (in the docstrings?) a brief mention about the difference between DDM and RatcliffDDM and maybe a cross-link ("See also DDM()")

If we can get this model working, another option would be to integrate everything into one model given that DDM is currently a special case in which the cross-trial variability is zero. The integration procedures can be skipped in the special case. We could avoid breaking changes by using the type DDM for the general model.

Jun 22 '23 12:06 itsdfish

@itsdfish I have added some updated PDF along with the CDF code, but I am getting a precompile error that might be related to some bugs with Bijectors.

Jul 03 '23 13:07 kiante-fernandez

Thanks @kiante-fernandez for your hard work. It is difficult for me to look at a fork locally. Based on the unit tests, it seems like there is a variable called x that is not defined somewhere. It's not clear to me where its at based on the error trace. I recommend using Revise.jl and commenting out files or parts of files until you can locate the issue. If you continue to have issues, I can make a new branch and you can merge into that. I don't want to merge code with errors into main.

I'll leave a few comments for now and look through the code in more detail probably tomorrow.

Jul 03 '23 14:07 itsdfish

How is it going @kiante-fernandez I hope this didn't give you a nervous breakdown 😅 Let us know if you need help or support :)

Jul 07 '23 11:07 DominiqueMakowski

How is it going @kiante-fernandez I hope this didn't give you a nervous breakdown 😅 Let us know if you need help or support :) No worries thanks for checking in. I just need to write some checks for the PDF and CDF. Otherwise the methods are all written. I am at a conference this weekend. Will take a look Sunday.

Jul 07 '23 12:07 kiante-fernandez

@kiante-fernandez, I just saw a banner at the top of the this page indicating you are requesting a review. I didn't receive a notification. Sorry if you have been waiting for a while. I should have some time on Friday to review. In the meantime, I noticed that some unit tests do not pass.

Jul 25 '23 19:07 itsdfish

How's it going @kiante-fernandez with this? If you don't have bandwidth currently, could we lay out the to-do list required to complete this in a more explicit way and maybe I can try to find some help (perhaps GPT4 can be useful ^^)

Oct 18 '23 10:10 DominiqueMakowski

I did not see that @itsdfish got back to me. I would like to take some time to review updates to the code base first so I can make sure the implementation is in line with what we have.

Oct 18 '23 17:10 kiante-fernandez

@kiante-fernandez, that sounds good. If I recall correctly, there were three outstanding issues: (1) fix the NaNs in the pdf and logpdf, (2) fix the unit tests, and (3) make sure the docs render correctly. Let me know if you have any questions.

Oct 18 '23 17:10 itsdfish

@kiante-fernandez, I hope you are doing well. I wanted to follow up with you about the status of the PR and a related question about rand. I was hoping to use rand, but I am having trouble reproducing results with RT dists. I thought maybe I am doing something incorrectly, or I am misunderstanding the parameterization of the model. Here is what I have:

Julia

using SequentialSamplingModels
using SequentialSamplingModels: RatcliffDDM

dist = RatcliffDDM(ν = 1.0,α = 0.80,τ = 0.30,z = 0.25,η = 0.16,sz = 0.05, st = .10,σ = 1.0)
choice,rt = rand(dist, 10_000)

mean1 = mean(rt[choice .== 1])
prob1 = mean(choice .== 1)

mean2 = mean(rt[choice .== 2])
prob2 = mean(choice .== 2)

[mean1, mean2, prob1, prob2]

4-element Vector{Float64}:
 0.49065225546420305
 0.3867084522660908
 0.4116
 0.5884

R

library("rtdists")

# z = a * .25
sim_data = rdiffusion(100000, a=.8, v=1, t0=.30, z = .20, sz = .05, sv = .16,
                      st0 = .10, s = 1)

data_lower = sim_data$rt[sim_data$response == "lower"]
mean_lower = mean(data_lower)
prob_lower = mean(sim_data$response == "lower")

data_upper = sim_data$rt[sim_data$response == "upper"]
mean_upper = mean(data_upper)
prob_upper = mean(sim_data$response == "upper")
c(mean_upper,mean_lower, prob_upper, prob_lower)

0.5408594 0.4371890 0.4108300 0.5891700

The response probabilities are similar, but the rts are different. Any ideas?

Dec 18 '23 21:12 itsdfish

Thanks for following up. With the holiday here, I should have some time to take a look. Also, from the example you gave, you have set z to two different values (Julia: z = 0.25 and R: z = 0.20).

Dec 21 '23 20:12 kiante-fernandez

Thanks for taking a look.

You are right that the values for z are different. The documentation for rtdist was a bit unclear. However, the examples lead me to believe it is absolute, so the appropriate value is z = .80 * .25 = .20. I did try the absolute value, but the results diverged even more. I created a random walk version because it is easier to understand. Interestingly, the results between that and your code are similar. I will share that code in case it is helpful for determining a ground truth.

Anyways, have a good break!

function rand(dist::DDM; Δt=.001)
    (;ν,α,z,σ,τ,η,st,sz) = dist
    t = 0.0
    z′ = α * z 
    Δx = σ * √(Δt)
    x = sz == 0 ? z′ : rand(Uniform(z′ - sz / 2, z′ + sz / 2))
    ν′ = rand(Normal(ν, η))
    p = .50 * (1 + (ν′ * √Δt) / σ)
    while (x < α) && (x > 0)
        x += rand() ≤ p ? Δx : -Δx
        t += Δt
    end
    choice = (x < α) + 1
    t += st == 0 ?  τ : rand(Uniform(τ - st / 2, τ + st / 2))
    return choice,t
end

function rand(dist::DDM, n::Int; Δt=.001)
    choices = fill(0, n)
    rts = fill(0.0, n)
    for i ∈ 1:n 
        choices[i],rts[i] = rand(dist; Δt)
    end
    return (;choices,rts)
end

Update

After further investigation, I found that the culprit is st0 and st. When I set the variability parameters to zero, the results agreed. However, the only diverged when st0 and st > 0:

sim_data = rdiffusion(100000, a=.8, v=1, t0=.30, z = .20, sz = .0, sv = .00,
                      st0 = .2, s = 1)
dist = RatcliffDDM(;ν = 1.0,α = 0.80,τ = 0.3,z = 0.25,η = 0.00,sz = 0.0, st = .20,σ = 1.0)

The mean reaction time increased for rtdists, but stayed the same for SSM.jl. If I understand correctly, st0 should not change the mean reaction time. Is that your understanding too?

Dec 21 '23 22:12 itsdfish

@itsdfish That is my understanding as well. I tried to hunt down the sampling function rtdist uses. But I also confirmed this using a few different simulation functions across both R and Python.


# Load necessary libraries
library(ggplot2)
library(rtdists)
library(Rhddmjags)

# Function to simulate and get mean RT for a given st0 value
simulate_mean_rt <- function(st0, sample_method = 1) {
  if (sample_method == 1) {
    sim_data <- rdiffusion(100000, a = 0.8, v = 1, t0 = 0.30, z = 0.20, 
                           sz = 0.0, sv = 0.0, st0 = st0, s = 1)
    mean_rt <- mean(sim_data$rt)
  } else if (sample_method == 2) {
    sim_data <- Rhddmjags::simulratcliff(100000, Alpha = 0.8, Nu = 1, Tau = 0.30, Beta = 0.20, 
                                         rangeBeta = 0.0, Eta = 0.0, rangeTau = st0, Varsigma = 1)
    mean_rt <- mean(abs(sim_data))
  } else {
    sim_data <- Rhddmjags::simul_ratcliff_slow(100000, Alpha = 0.8, Nu = 1, Tau = 0.30, Beta = 0.20, 
                                               rangeBeta = 0.0, Eta = 0.0, rangeTau = st0, Varsigma = 1, nsteps = 1000)
    mean_rt <- mean(abs(sim_data))
  }

  return(mean_rt)
}

st0_values <- seq(0, 0.7, by = 0.05)
# Define a range of st0 values and store results
results1 <- data.frame(st0 = st0_values, mean_rt = sapply(st0_values, simulate_mean_rt))
results2 <- data.frame(st0 = st0_values, mean_rt = sapply(st0_values, simulate_mean_rt, sample_method = 2))
results3 <- data.frame(st0 = st0_values, mean_rt = sapply(st0_values, simulate_mean_rt, sample_method = 3))

results1$method <- "fast_dm"
results2$method <- "rejection_rhddmjags"
results3$method <- "euler_rhddmjags"

results <- rbind(results1, results2, results3)

# Plot using ggplot2
ggplot(results, aes(x = st0, y = mean_rt, group = method, color = method)) +
  geom_line() +
  geom_point() +
  labs(title = "Mean Response Time by st0", 
       x = "st0 Value", 
       y = "Mean RT") +
  theme_minimal()

sim_res

Here fast_dm is the rdiffusion function

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt

import hssm
from ssms.basic_simulators.simulator import simulator

v_true, a_true, z_true, t_true, sz_true, sv_true, st_true, s  = [1, 0.8, 0.2, 0.3, 0, 0, 0, 1]

# Define the range of st_true values
st_true_values = np.linspace(0, 1, 10)

# Initialize lists to store the mean response times and st_true values
mean_rts = []
st_values = []

# Loop over the range of st_true values
for st_true in st_true_values:
    # Simulate data
    sim_out = simulator([v_true, a_true, z_true, t_true, sz_true, sv_true, st_true, s], model="full_ddm", n_samples=10000)
    
    # Turn data into a pandas dataframe
    dataset_plots = pd.DataFrame(
        np.column_stack([sim_out["rts"][:, 0], sim_out["choices"][:, 0]]),
        columns=["rt", "response"],
    )
    
    # Round the response times to 4 decimal places
    dataset_plots['rt'] = dataset_plots['rt'].round(4)
    
    # Calculate the mean response time and store it
    mean_rt = dataset_plots['rt'].mean()
    mean_rts.append(mean_rt)
    
    # Store the st_true value
    st_values.append(st_true)

# Plot the mean response times by st_true values
plt.plot(st_values, mean_rts)
plt.xlabel('st_true')
plt.ylabel('Mean response time')

# Make the y-axis larger
plt.ylim(0, max(mean_rts) * 1.1)

plt.show()

So at first glance, we might have found something wrong with rdiffusion?

Also, just so I am clear on the PR, I am still writing DDM.jl to replace RatcliffDDM.jl. So, when we have non-zero values of the across-trial parameters, it just calls those functions (the RatcliffDDM) instead?

Dec 26 '23 01:12 kiante-fernandez

@kiante-fernandez, thanks for looking through the issue above. It does appear to be a bug. I will submit an issue later today or tomorrow. I think the code above can be repurposed for unit tests at some point. It would be good to include a few tests to make sure changes in parameters have the expected effect (or lack of effect). It goes to show that it is easy to make a simple error. I would be happy to add those tests if it would help.

Yeah. I think it would be good to remove the DDM code and use the RatcliffDDM code because it subsumes the DDM code. There are probably different ways to approach it. Perhaps the simplest way is to use conditional checks to skip calculations needed for st, eta and sz if the value is equal to zero. For example, if you perform an expensive operation (e.g., a convolution) in the RatcliffDDM code, we can potentially skip it if eta = 0. Of course, you will probably have a better idea of what approach is the best in terms readability, efficiency, and maintainability. But I think this will be simpler overall because DDM is a special case where st, eta,sz = 0 in RatcliffDDM. Once all of that is done, I recommend switching the name from RatcliffDDM to DDM, which will prevent a breaking change (e.g., we are just making DDM more general). Let me know if you have any questions or concerns.

Update

I looked through the documentation for rtdists again and noticed I may have misunderstood the parameterization. Here is what it says

"inter-trial-variability of non-decisional components. Range of a uniform distribution with mean t0 + st0/2 describing the distribution of actual t0 values across trials. Accounts for response times below t0."

In contrast to sz, st0/2 is added to mean non-decision time. That seems a bit unusual to me, but is consistent with the observed behavior. The only thing that does not make sense to me is "Accounts for response times below t0". Maybe that is not important. Anyways, sorry for the oversight. I thought I read something else.

Dec 26 '23 16:12 itsdfish

@itsdfish I have finally had a chance to take a look through the new codebase; great work on organizing things! Following the conventions, I added the AbstractDDM and moved everything we needed to DDM.jl. I am still failing some of the tests I set up, but I thought I would ping for feedback while I have time to have eyes on this for a few days.

Dec 27 '23 02:12 kiante-fernandez

@itsdfish I have finally had a chance to take a look through the new codebase; great work on organizing things! Following the conventions, I added the AbstractDDM and moved everything we needed to DDM.jl. I am still failing some of the tests I set up, but I thought I would ping for feedback while I have time to have eyes on this for a few days.

@kiante-fernandez, thanks! I think we are converging on a good system and API.

I made a few comments throughout. Everything looks good for the most part. I think the primary item is getting the unit tests to pass and adding more if needed.

Dec 27 '23 20:12 itsdfish

@itsdfish I have finally had a chance to take a look through the new codebase; great work on organizing things! Following the conventions, I added the AbstractDDM and moved everything we needed to DDM.jl. I am still failing some of the tests I set up, but I thought I would ping for feedback while I have time to have eyes on this for a few days.

@kiante-fernandez, thanks! I think we are converging on a good system and API.

I made a few comments throughout. Everything looks good for the most part. I think the primary item is getting the unit tests to pass and adding more if needed.

@itsdfish Thank you for the comments. I have been working to resolve some issues, but I have not yet identified what is wrong with the implementation. I thought it might be due to the numerical integration implementation (inspired by [https://github.com/hddm-devs/hddm/blob/master/src/integrate.pxi]), though I have not pinned it down yet. That is just my sense from some of the test results so far.

Jan 01 '24 18:01 kiante-fernandez

Benchmark Results

	master	85156123beed5c...	t[master]/t[85156123beed5c...]
logpdf/("SequentialSamplingModels.DDM", 10)	1.67 ± 0.11 μs	14.5 ± 3.5 μs	0.115
logpdf/("SequentialSamplingModels.DDM", 100)	17.2 ± 1.2 μs	0.136 ± 0.0071 ms	0.126
logpdf/("SequentialSamplingModels.LBA", 10)	2.45 ± 0.18 μs	2.45 ± 0.18 μs	1
logpdf/("SequentialSamplingModels.LBA", 100)	23.5 ± 0.63 μs	23.5 ± 0.66 μs	0.999
logpdf/("SequentialSamplingModels.LNR", 10)	1.01 ± 0.17 μs	1 ± 0.15 μs	1.01
logpdf/("SequentialSamplingModels.LNR", 100)	8.54 ± 0.29 μs	8.56 ± 0.28 μs	0.998
logpdf/("SequentialSamplingModels.RDM", 10)	2.61 ± 0.25 μs	2.54 ± 0.19 μs	1.03
logpdf/("SequentialSamplingModels.RDM", 100)	24.9 ± 0.71 μs	24.9 ± 0.73 μs	1
logpdf/("SequentialSamplingModels.Wald", 10)	0.227 ± 0.16 μs	0.227 ± 0.17 μs	1
logpdf/("SequentialSamplingModels.Wald", 100)	2 ± 0.17 μs	2 ± 0.18 μs	1
logpdf/("SequentialSamplingModels.WaldMixture", 10)	1.09 ± 0.16 μs	1.09 ± 0.16 μs	0.999
logpdf/("SequentialSamplingModels.WaldMixture", 100)	10.6 ± 0.16 μs	10.6 ± 0.17 μs	1
rand/("SequentialSamplingModels.DDM", 10)	3.92 ± 0.54 μs	9.39 ± 1.3 μs	0.418
rand/("SequentialSamplingModels.DDM", 100)	0.0383 ± 0.0019 ms	0.088 ± 0.0058 ms	0.435
rand/("SequentialSamplingModels.LBA", 10)	3.21 ± 1.3 μs	3.23 ± 0.39 μs	0.993
rand/("SequentialSamplingModels.LBA", 100)	16.5 ± 0.63 μs	16.5 ± 1 μs	1
rand/("SequentialSamplingModels.LCA", 10)	0.769 ± 0.24 ms	0.77 ± 0.24 ms	0.999
rand/("SequentialSamplingModels.LCA", 100)	8.24 ± 0.24 ms	8.3 ± 0.26 ms	0.992
rand/("SequentialSamplingModels.LNR", 10)	1.06 ± 0.18 μs	1.09 ± 0.27 μs	0.972
rand/("SequentialSamplingModels.LNR", 100)	7.34 ± 3.8 μs	7.59 ± 3.8 μs	0.967
rand/("SequentialSamplingModels.RDM", 10)	1.29 ± 0.18 μs	1.33 ± 0.2 μs	0.971
rand/("SequentialSamplingModels.RDM", 100)	10.7 ± 3.8 μs	10.8 ± 3.9 μs	0.998
rand/("SequentialSamplingModels.Wald", 10)	0.46 ± 0.16 μs	0.464 ± 0.16 μs	0.991
rand/("SequentialSamplingModels.Wald", 100)	2.89 ± 0.21 μs	2.91 ± 0.2 μs	0.991
rand/("SequentialSamplingModels.WaldMixture", 10)	1.2 ± 0.15 μs	1.19 ± 0.15 μs	1
rand/("SequentialSamplingModels.WaldMixture", 100)	11.4 ± 0.17 μs	11.4 ± 0.19 μs	0.997
simulate/SequentialSamplingModels.DDM	3.7 ± 1.5 μs	3.32 ± 1.4 μs	1.12
simulate/SequentialSamplingModels.LBA	3.8 ± 0.5 μs	3.88 ± 1.9 μs	0.979
simulate/SequentialSamplingModels.LCA	0.102 ± 0.021 ms	0.103 ± 0.023 ms	0.989
simulate/SequentialSamplingModels.RDM	0.0693 ± 0.03 ms	0.0674 ± 0.025 ms	1.03
simulate/SequentialSamplingModels.Wald	9.4 ± 4.7 μs	9.23 ± 4.8 μs	1.02
simulate/SequentialSamplingModels.WaldMixture	4.05 ± 1.6 μs	4.02 ± 1.6 μs	1.01
time_to_load	3.45 ± 0.0097 s	3.45 ± 0.013 s	0.999

Benchmark Plots

A plot of the benchmark results have been uploaded as an artifact to the workflow run for this PR. Go to "Actions"->"Benchmark a pull request"->[the most recent run]->"Artifacts" (at the bottom).

Jan 01 '24 18:01 github-actions[bot]

Should we aim at merging this before @kiante-fernandez's talk ^^ ?

Mar 08 '24 18:03 DominiqueMakowski

I'll close this now that it has been moved to #90

Jul 12 '24 21:07 itsdfish