Observed correlations between person-means depend on within-person correlations

Q: What is the difference between within-person and between-person effects?

Between-person effects describe relationships across different people, typically looking at their average or stable traits. For example, “Are people who are on average more anxious also on average more depressed?” Within-person effects describe relationships within a single individual over time. For example, “When I feel more anxious than my personal average, do I also tend to feel more depressed at that same time?” This study investigates how these two different levels of analysis can get confused.

Q: What is a person-wise mean in longitudinal data?

A person-wise mean (or person-mean) is an individual’s average score on a specific variable, calculated across all the time points in an intensive longitudinal study. For example, if a participant’s anxiety is measured 100 times over a month, their person-wise mean for anxiety would be the single average score of those 100 measurements.

Q: How do within-person effects bias correlations between person-means?

This study by Haslbeck & Epskamp (2024) in advances.in/psychology demonstrates that the correlation observed between person-means (the between-person analysis) can be artificially inflated or deflated by the underlying within-person relationships. The bias becomes stronger (i.e., the observed correlation is more misleading) when: The number of time points is small. There is high temporal dependency (strong autocorrelation, or “inertia,” where a feeling like anxiety persists over time). There is a strong within-person contemporaneous correlation (e.g., feeling anxious and depressed at the exact same time). This bias decreases as the number of time points for each person gets larger.

Q: What is the risk of correlating person-means from time-series data?

The primary risk is drawing false scientific conclusions. This study’s simulations show that the bias can be substantial. For example, a true medium between-person correlation could be incorrectly observed as a very large correlation simply because of the influence from the within-person dynamics. This bias can affect psychological theories and the results of common analysis algorithms that rely on person-means, leading to a flawed understanding of how variables are related.

Haslbeck, Jonas; Epskamp, Sacha

doi:10.56296/aip00020

Jonas Haslbeck

, & Sacha Epskamp

1,386 viewsCited by 2Transparent Review

Published under the Creative Commons BY 4.0 license.

PDF

Cite

Author Details

Jonas Haslbeck: Department of Clinical Psychological Science, Maastricht University; Department of Psychological Methods, University of Amsterdam

Sacha Epskamp: Department of Psychology, National University of Singapore

*Please address correspondence to Sacha Epskamp, sepskamp@nus.edu.sg, Faculty of Arts and Social Sciences, Block AS4, Level 2, 9 Arts Link, Singapore 117570.

Transparent Peer Review

The current article passed one round of double-blind peer review. The anonymous review report can be found here.

OpenAlex Database Metrics

Citations

This article has been cited 2 times in the OpenAlex database.

Articles Citing This Paper

Pamela Baess, Shabnamalsadat Ayatollahi, Christina Bermeitinger, Luisa Bogenschütz, Pia Fenske, Ryan P. M. Hackländer, Mohammad Hamzeloo, Gustavo Adolfo León Montoya (2026). Shifting and zooming through space … or not: The role of attention in spatial compatibility tasks - A replication of Stoffer (1991). Psychological Research. https://doi.org/10.1007/s00426-026-02299-1
Jorge Fábrega, Aníbal Olivera (2026). Same Score, Different Network: Configuration-Level Heterogeneity in Composite Risk Measures. Research Square. https://doi.org/10.21203/rs.3.rs-8880036/v1

Citation

Haslbeck, J. & Epskamp, S. (2024). Observed correlations between person-means depend on within-person correlations. advances.in/psychology, 2, e853425. https://doi.org/10.56296/aip00020

Haslbeck, Jonas, and Sacha Epskamp. "Observed correlations between person-means depend on within-person correlations." advances.in/psychology, vol. 2, no. 1, 2024, e853425. https://doi.org/10.56296/aip00020.

Haslbeck, Jonas, and Sacha Epskamp. 2024. "Observed correlations between person-means depend on within-person correlations." advances.in/psychology 2 (1): e853425. https://doi.org/10.56296/aip00020.

Haslbeck J, Epskamp S. Observed correlations between person-means depend on within-person correlations. advances.in/psychology. 2024;2(1):e853425. doi:10.56296/aip00020.

Haslbeck, J. and Epskamp, S. (2024) 'Observed correlations between person-means depend on within-person correlations', advances.in/psychology, 2(1), e853425. Available at: https://doi.org/10.56296/aip00020.

Copy Reference

Download .RIS

Download .bib

Abstract

Intensive longitudinal data have become a common data type across psychological disciplines. A key issue in the analysis of such data is the separation of within-person and between-person effects. This problem is well studied for the effect of between-person effects (e.g., varying intercepts) on within-person parameters (e.g., cross-lagged effects). In this paper, we discuss a less appreciated effect of within-person correlations on correlations between person-wise means. Using simulations and an analytical derivation, we show how observed correlations between person-wise means are a function of both population between-person correlations and within-person correlations. This has implications for the interpretation of statistical relationships between person-wise means, for example when estimated directly from the data or within stepwise approaches to estimating multilevel vector autoregressive models, such as in the popular R package mlVAR. We discuss implications for applied research and possible strategies to avoid this problem.

Editor Curated

Key Takeaways

In intensive longitudinal data, using person-wise sample means to estimate between-person effects (e.g., correlations between stable traits) is a common but potentially flawed practice, as these estimates can be biased by within-person dynamics (e.g., day-to-day fluctuations).
The research demonstrates through simulations and an analytical derivation that observed correlations between person-wise means are a function of both the true between-person correlation and the within-person correlations, meaning a correlation can appear between two traits even if none truly exists at the between-person level.
This bias is most severe when the number of time points per person is low, between-person variance is small, and within-person effects are strong; to avoid this, the authors recommend using methods that jointly estimate within- and between-person effects in a single step, such as Dynamic Structural Equation Modeling (DSEM).

Keywords:

Introduction

Cross-sectional data does not allow one to adequately distinguish between within-person effects and between-person effects (Molenaar, 2004). This limitation is one of the driving forces behind the trend towards collecting intensive longitudinal data of several subjects measured multiple times (Conner & Barrett, 2012; Hamaker & Wichers, 2017; Kuppens et al., 2022; Miller, 2012; Trull & Ebner-Priemer, 2014). Such data does, in principle, allow for the separation of within- and between-person variance (Epskamp, 2020), which is crucial if the targets of inference are statistical relationships (e.g., correlations, regressions) at the within-person or between-person level. For example, consider a dataset of several subjects measured repeatedly on a set of mood states. A clinical psychologist might be interested in using such data to study within-person relationships to find, for example, relationships between day-to-day fluctuations (states) from the person-wise means. A personality researcher, on the other hand, might be more interested to study between-person relationships to find how the average mood states (traits) relate to one-another.

Part of a Special IssueNetwork PsychometricsThis article belongs to a curated special issue. Discover more research on this topic from the full collection.Explore the special issue →

While multivariate psychometric models exist that adequately separate within- and between-person variance (e.g., Epskamp, 2020; Hamaker et al., 2015; Molenaar, 1985, 2017), they often become intractable to use when the number of variables and/or measurements per person is large. To this end, methods have been proposed to simplify those models. A popular trick is to use the personwise sample means — the sample mean of all measures of a single person on a particular variable — to stand in for the corresponding true person-wise means. Using the person-wise sample means as a proxy for the true means, the data can then be within-person centered to study within-person effects and relationships between the sample means can be used to study between-person effects (Epskamp et al., 2018; Hamaker & Grasman, 2015). While this method, implemented in the popular Rpackage mlVAR (Epskamp et al., 2018), generally works well for large datasets, it has been recognized that using the person-wise means as such can influence the within-person parameters obtained in an analysis, especially when the number of time points per person is low. For example, a temporal auto-regression can be estimated to be negative due to Nickell’s bias (Nickell, 1981) when data with too few repeated measures are within-person centered (Burger et al., 2022; Jordan et al., 2020)^[1].

While the possibility that between-person effects can bias within-person effects is well known in the literature, here we study the reverse effect. Namely, the possibility that within-person effects are biasing between-person effects. We will show that using the person-wise sample means as proxies for the true person-wise means can lead to bias in estimated between person parameters, such as (partial) correlation coefficients between the person-wise means (Epskamp et al., 2022; Epskamp, 2020). We first illustrate in a simulation study using multilevel vector auto-regressive (VAR) models how correlations between person-wise means can be induced by within-person correlations. That is, we observe correlations between person-wise means, even if the data generating mechanism does not include any such correlation, but only correlations at the within-person level. We then report a second simulation to show that this bias also carries over to the estimation of between-person networks in the context of estimating multilevel VAR models. After illustrating the effect, we provide an analytic derivation of the correlation between person-wise means as a function of population correlations between person-wise means and within-person correlations. We end by discussing implications for interpreting correlation between person-wise means and suggest possible avenues for future research to avoid this bias.

Within- and Between-Person Effects

We begin by introducing notation and terminology. Let $\pmb{y}_{t,p}$ be a set of $n_v$ variables measured in person $p \in \left(1,2,\ldots,n_p\right)$ at time point $t \in \left\{1,2,\ldots,n_t\right\}$. We will assume an underlying statistical model that separates $\pmb{y}_{t,p}$ into a part that is stable for person $p$, $\pmb{\mu}_{p}$, and deviation from this stable mean in time point $t$, $\pmb{\xi}_{t,p}$:

$
\begin{equation}\label{withinbetween}
\pmb{y}_{t,p} = \pmb{\mu}_{p} + \pmb{\xi}_{t,p}. \end{equation}
\tag{1}
$

Capitalized subscripts indicate the dimension that is taken to be random. That is, $\pmb{y}_{T,p}$ indicates responses on a random time point $T$ from fixed person $p$, and $\pmb{y}_{t,P}$ indicates responses from a random person $P$ on fixed time-point $t$. This allows us to establish the terminology about within- and between-person effects which we use in the remainder of this paper; for a more elaborate discussion of this notation, see Epskamp et al. (2022).

Within-person Effects

Within-person effects in observational data typically refer to effects that are generated by (some function of) the variance–covariance structure of a fixed person $p$ within the same window of measurement or across different windows of measurement

$
\text{within-person (co-)variance: } \mathrm{cov}\left(\pmb{y}_{T,p}, \pmb{y}_{T+k,p} \right) = \mathrm{cov}\left(\pmb{\xi}_{T,p}, \pmb{\xi}_{T+k,p} \right),
$

in which $k \in \mathbb{Z}$ (which could be zero) is called the lag, indicating a difference across time. This variance is called within-person because it is fully derived from within the dataset of person $p$. When analyzing multiple people, researchers often estimate fixed-effects, which describe the within-person variances of the average person, here denoted with the symbol $∗$:

$
\text{fixed-effect within-person (co-)variance: } \mathrm{cov}\left(\pmb{y}_{T,*}, \pmb{y}_{T+k,*} \right) = \mathrm{cov}\left(\pmb{\xi}_{T,*}, \pmb{\xi}_{T+k,*} \right).
$

These effects are often simply called within-person effects, but it could be argued that this is a misleading label, because the effects are not based on the data of any single subject in the dataset. In this paper, we will circumvent this possible confusion by assuming full homogeneity $\mathrm{cov}\left(\pmb{y}_{T,1}, \pmb{y}_{T+k,1}\right) = \mathrm{cov}\left(\pmb{y}_{T,2}, \pmb{y}_{T+k,2}\right) = \ldots = \mathrm{cov}\left(\pmb{y}_{T,n_p}, \pmb{y}_{T+k,n_p}\right)$ across persons for simplicity, while noting that any findings do generalize to heterogeneous samples because the fixed-effects adequately describe the covariance structure across people (Epskamp, 2020).

Between-person Effects

Between-person effects and cross-sectional effects are often used interchangeably in the literature, but they refer to different variances. Cross-sectional effects refer to the variance–covariance structure of cross-sectional data:

$
\text{Cross-sectional variance: } \mathrm{var}\left(\pmb{y}_{t,P}\right)
$

indicating the variance–covariance structure across people on a fixed time-point $t$. This variance–covariance structure, however, is in part also a function of the (fixed-effect) within-person variance covariance structure (see below for more detail). As such, “between-person” is not an adequate term to use for effects established in cross-sectional data, unless it can reasonably be assumed that withinperson variance in the data is nil (e.g., when including aggregates over time or stable traits in the data; Epskamp et al., 2018). An adequate definition of between-person effects is that these effects rest solely on the variance–covariance structure of the part that is stable within individuals

$
\text{Between-person variance: } \mathrm{var}\left(\pmb{\mu}_{P}\right),
$

that is, the covariances between stable means encoded in $\pmb{\mu}$.

Epskamp et al. (2018) give two examples of situations in which the within-person and between-person correlations between two variables can diverge. The first example, based on Hamaker (2012), discusses a dataset in which authors are measured on the number of spelling errors and typing speed for individual texts; a fixed author might make more spelling errors if they type faster (within-person effect), whereas an author who types fast on average might also make few spelling errors on average (between-person effect). The second example, based on Hoffman (2015), discusses a dataset in which subjects are measured on their physical activity and heart rate; whenever a person is more physically active than average (exercising), their heart rate tends to be higher (within-person effect), and people who often exercise tend to have a lower heart rate on average (between-person effect). In both examples, investigating only one level will not tell the full picture, and performing a cross-sectional analysis will allow only limited conclusions because the results will be a blend of the two sources of variation. This shows that separating within- and between-person effects is often crucial.

Using person-wise Sample Means as Proxies for True person-wise Means

While modeling techniques exist that adequately aim to separate the within-person and betweenperson sources of variation (e.g., Asparouhov et al., 2018; Epskamp, 2020), doing so is not always trivial and often leads to computationally heavy estimation procedures that quickly grow intractable for larger numbers of time points and/or variables. To this end, an often-used trick to greatly simplify modeling of such data is to use the person-wise sample means as proxies for the true person-wise means (Hamaker & Grasman, 2015):

$
\begin{equation}\label{eq:samplemean}
\hat{\pmb{\mu}}_p = \bar{\pmb{y}}_p = \frac{1}{n_t} \sum_{t=1}^{n_t} \pmb{y}_{t,p} = \frac{1}{n_t} \left( n_t \pmb{\mu}_p + \sum_{t=1}^{n_t} \pmb{\xi}_{t,p}\right)
\end{equation}
\tag{2}
$

Here, the data can be within-person centered using person-wise sample means $\bar{\pmb{y}}_p$to obtain withinperson effects, and relationships between (or the variance–covariance structure of) elements of $\bar{\pmb{y}}_p$can be used to obtain estimates of between-person effects. This trick relies on the fact that the person-wise sample means are unbiased maximum likelihood estimators of the population personwise means. An example of an application of this method is the popular mlVAR R package for estimating multilevel graphical vector-autoregressive models (Epskamp et al., 2018), which uses the person-wise sample means to (a) within-person center data for obtaining within-person temporal and contemporaneous effects, and (b) obtain between-person relationships between stable means. We describe the mlVAR algorithm in more detail in Appendix A.1.

While the person-wise sample means are indeed unbiased maximum likelihood estimators for the true person-wise means, Equation (2) also shows that $\pmb{\mu}_p \not= \bar{\pmb{y}}_p$ and that the person-wise sample means in $\bar{\pmb{y}}_p$ are in part caused by within-person deviations in $\pmb{\xi}_{t,p}$. This means that the variance-covariance structure of $\bar{\pmb{y}}_P$ will be in part determined by within-person variance – a property that we show formally in Section 4.1. As such, we can expect that analyzing relationships between elements of $\bar{\pmb{y}}_p$ may not only reflect relationships in $\pmb{\mu}_p$, and thus that the estimated between-person relationships based on person-wise sample means may not reflect population between-person relationships. In what follows, we investigate this in more detail. First by illustrating the issue in a simulation study and then by providing mathematical derivation of the variance–covariance structure of $\bar{\pmb{y}}_P$.

Illustrating Dependency of Between-Person Effects on Within-Person Correlations

We first simulate from a two-variable multilevel VAR model to show how within-person correlations lead to bias in between-person effects as a function of how large the between-person variance is relative to the within-person variance, and sample size. This bias also affects the between-person network estimated with the mlVAR method, which we explicitly demonstrate in a second simulation.

Effects on Between-Person Correlations

We can illustrate the effect of within-person correlations on correlations between person-wise means using any longitudinal model that separates within-person and between-person variances. Here, we generate data under a multilevel lag-1 vector autoregressive (VAR) model with $n_v=2$ variables, since we suspect most readers will be familiar with it:

$
\begin{equation}
\pmb{y}_{t,p} = \pmb{\mu}_p + \pmb{B}_p \left(\pmb{y}_{t-1,p} – \pmb{\mu}_p \right) + \pmb{\zeta}_{t,p}
,
\label{eq:VAR}
\end{equation}
\tag{3}
$

where $\pmb{\mu}_p$ are person-specific means, $\pmb{B}_p$ are person-specific lagged effects of order $1$ and $\pmb{\zeta}_{t,p}$ is the person-specific innovation variance. We choose the entries of $\pmb{B}_p$ such that the VAR model is stable/stationary (for an accessible introduction to non-stationarity see Ryan et al., 2023).

To keep the illustration simple, we only allowed the means to differ between subjects, and generate homogeneous temporal coefficients in which every element is the same value $\beta$:

$
\pmb{B}_p = \beta \begin{bmatrix} 1 & 1 \\ 1 & 1 \end{bmatrix} \quad \forall_p,
$

and set the innovation variance–covariance matrix to an identity matrix to generate standard normal orthogonal innovations:^[2]

$
\mathrm{var}\left( \pmb{\zeta}_{T,p}\right) = \begin{bmatrix} 1 & 0 \\ 0 & 1 \end{bmatrix} \quad \forall_p.
$

Finally, we generate the true person-wise means for both variables from two independent normal distributions with standard deviation $\sigma_\mu$:

$
\mathrm{var}\left( \pmb{\mu}_{P}\right) = \sigma^2_\mu \begin{bmatrix} 1 & 0 \\ 0 & 1 \end{bmatrix}.
$

As such, the true between-person covariance/correlation between the two variables equals $0$, the parameter $\beta$ controls the strength of within-person (temporal) relationships between the two variables, and the parameter $\sigma_\mu$ controls the variance in true means across persons. Note that in this example, we manipulate the within-person correlations only with dependencies between variables in lagged effects. However, we could also manipulate within-person correlations by introducing correlations between innovations in $\mathrm{var}\left( \pmb{\zeta}_{T,p}\right)$. In the simulated example in the following subsection, we will introduce within-person correlations in both ways.

We denote the true between-person correlation with $\rho = \text{cor}(\mu_{1,P},\mu_{2,P}) = 0$ and the estimated between-person correlation based on the person wise sample means with $\hat \rho = \text{cor}(\bar{y}{1,P},\bar{y}{2,P})$. In what follows we simulate data under this multilevel VAR model for different values of $\beta$, $\sigma_\mu$ and $n_t$ and evaluate the resulting observed correlation between person-wise means. We varied the within-person strength of relationship $\beta$ ranging from $0$ to $0.45$, the between-person strength of relationship $\sigma_\mu$ ranging from $0$ to $1$, and the number of time points per person across four levels $n_t \in {20, 100, 250, 1000}$. We simulate data with a relatively large sample size of $n_p=500$ subjects in each condition to ensure that the influence of sampling variation on our results is minimal. Given that the only population differences between persons are in the true person-wise means, which are uncorrelated ($\rho= 0$), we might expect that also the observed correlations between sample means ($\hat{\rho}$) will be zero. As such, we are interested in seeing if $\hat{\rho}$ is a good approximation of $\rho$. Figure 1 shows the estimated correlation between sample means $\hat \rho$ rounded to the second decimal mapped to a color gradient from white (0) to blue (1).

Figure 1

The correlation between within-person sample means $\hat \rho$, as a function of the size of lagged effects $\beta$ (x-axis), the true between-person standard deviation $\sigma_\mu$ (y-axis), and the length of the time series (panels). Note that if there is no bias, the correlation should be about zero in all cells.

Since the true between-person correlation is $\rho = 0$ in the data generating model, any nonzero value for $\hat \rho$ in Figure 1 represents bias. If $\beta = 0$ the within-person correlation is equal to zero and therefore also $\hat \rho \approx 0$ for all $\sigma_{\mu}$ and $n_t$ in all simulated condition ($\hat \rho$ would be exactly $0$ if we let $n_p \rightarrow \infty$). This makes sense, because there is no mechanism in the model that could create a nonzero correlation. However, for $\beta > 0$ we see nonzero correlations between means ($\hat \rho > 0$). Specifically, we see that $\hat \rho$ increases with $\beta$ and decreases with higher between-person variation in $\sigma_{\mu}$ and longer time series $n_t$. While we will derive these results analytically below, we can already offer the following intuitive explanation: when increasing $\sigma_{\mu}$, the proportion of between-person variance in the overall variance term is increasing and as a result the proportion of variance coming from within-person correlations has to decrease. Therefore, the correlation $\hat{\rho}$ has to decrease when increasing $\sigma_{\mu}$. For large samples $n_t \rightarrow \infty$, the sample estimates will converge to their true values, which are uncorrelated, which means the covariance has to go to zero. However, this is not necessarily true for normalized measures such as correlations. In this case, $\hat \rho$ only converges to zero if $\sigma_{\mu} = 0$. This can be seen in Figure 1: if $\sigma_{\mu} >0$ the correlation $\hat \rho$ decreases with increasing $n_t$; however, for $\sigma_{\mu} = 0$ it is independent of $n_t$.

When is this bias a problem? If we take $\hat \rho$ simply as the observed correlation between sample means and draw no conclusion about how this correlation is generated, or acknowledge that it is generated by an unknown combination of within- and between-level effects, then there is no problem. However, if we take the observed correlation between person-wise means $\hat \rho$ as an estimate of the population correlation between person-wise means $\rho$, then we would interpret $\hat \rho$ incorrectly, since it is a function of both $\rho$ and within-person correlations. In that sense, $\hat \rho$ is a biased estimate of $\rho$, and Figure 1 shows that this bias can be considerable in conditions in which $\beta$ is large, $\sigma_\mu$ is small, and $n_t$ is small. This problem extends to other methods which use sample means as proxies of population means, such as the method of R-package mlVAR discussed in Section 2.3. We illustrate this next.

Effects on Between-Person Network in multilevel VAR Model

The mlVAR package uses person-wise sample means as proxies for person-wise population means. Here we illustrate how this approximation leads to biases in the “between-network” consisting of partial correlations between random intercepts in a multilevel VAR model.

To this end, we again use a small simulation. We generated data for $n_p = 250$ subjects on $n_t = 250$ time points on $n_v = 8$ variables. The data generating structures were inspired by previous simulation studies (Du et al., 2024; Epskamp et al., 2018): the temporal network was a skip-1 chain graph (each variable predictive of itself and a second neighbor) with auto-regressive effects set to $0.5$ and cross-lagged coefficients set to $0.25$, the contemporaneous network was a chain graph with edge weights (partial correlations) of $0.35$, the innovation variances were set to $1$, and the between-person network was empty (no between-person covariances). The person-wise means were allowed to vary across people, but no other heterogeneity across persons was included. We generated two datasets: a within-person dataset in which scores for each person were centered on $0$, and a set of means for each person with standard deviation of $1$. Next, we combined these datasets in four different ways by adding the within-person data to differently scaled simulated means (to control the between-person standard deviation in the generated data). This way, we constructed four datasets: a dataset in which the means were multiplied by $0$ to generate a data with no between-person variance ($\sigma_\mu=0$), and three datasets in which the means were multiplied with different values $\sigma_\mu \in { 0.1, 0.25, 0.5 }$ to generate both within- and between-person variance. We estimated the model parameters using mlVAR under default settings with orthogonal random effects.

Figure 2 shows the estimated between-person networks. Since the true between-person network is empty, we would hope that the estimates are are close to zero and set to zero with probability $1-\alpha$, where $\alpha$ is the specified significance threshold for hypothesis testing. However, the results show that the we estimated strong between-person effects. Most of these effects are not set to zero (solid vs. dashed lines) by a significance test with $\alpha = 0.05$. In fact, we even see negative relationships in the between-person network, despite the fact that we neither have negative relationships in the lagged effects, nor the innovation partial correlation matrix.^[3]These biases decrease with increasing variance of the true between-person variance, since this implies that the within-person variance component has to become relatively smaller. This is the same effect as illustrated by varying the x-axis in Figure 1 above.

Both illustrations show that when correlating person-wise means, the observed correlations are a function of both within-person and between-person correlations. We can separate the influence of both variance components using hierarchical models. However, if we estimate these models by using person-wise sample means as proxies for their population values, a bias remains. Now that we have illustrated how within-person correlations can bias between-person correlations, we formally derive how the latter are a function of the former.

Figure 2

Between-person networks estimated with the mlVAR method for between-person standard deviations in means of $0$, $0.1$, $0.25$, and $0.5$ on 250 subjects with a time series length of 250 each. Blue edges indicate positive parameters, red edges indicate negative parameters. Solid edges are included at a significance threshold of $\alpha=0.05$. The data generating (true) between-person network is empty, which means that all shown effects are bias induced by the lagged and contemporaneous within-effects.

Deriving how Between-Person Effects Depend on Within-Person Effects

In this section, we will analytically derive the variance–covariance structure of the person-wise sample means in order to investigate how between-person (co-)variances are a function of within-person relationships.

General Framework

Based on the model in Equation (1), we will assume that the person-wise deviations neither have bias over time nor has bias between people:

$
\mathcal{E}\left(\pmb{\xi}_{T,p}\right) = \mathcal{E}\left(\pmb{\xi}_{t,P}\right) = \pmb{0} \quad \forall_{t,p},
$

in which $\mathcal{E}$ denotes the expected value taken over the capitalized subscript (which indicates the dimension that is random). The expected value of $\pmb{\mu}$ will be denoted with a grand mean $\pmb{\mu}_{*}$:

$
\pmb{\mu}_{*} = \mathcal{E}\left(\pmb{\mu}_{P} \right).
$

We will assume further that $\pmb{\xi}$ and $\pmb{\mu}$ are normally distributed. The variance-covariance matrix of $\pmb{\mu}$ can then be interpreted as a between-person variance-covariance matrix:

$
\mathrm{Var}\left(\pmb{\mu}_{P}\right) = \underset{\mathrm{bet}}{\pmb{\Sigma}}.
$

The covariance of $\pmb{\xi}$ over time can be interpreted as a within-person variance–covariance matrix:^[4]

$
\mathrm{Cov}\left(\pmb{\xi}_{i,P}, \pmb{\xi}_{j,P}\right) = \underset{\mathrm{within}}{\pmb{\Sigma}^{(i,j)}}.
$

in which $i$ and $j$ are two different time-points. The assumption that $\mathcal{E}\left(\pmb{\xi}{T,p}\right) = \pmb{0} \, \forall{p}$ naturally implies that $\mathrm{Cov}\left(\pmb{\mu}{P}, \pmb{\xi}_{t,P}\right) = \pmb{O} \, \forall{t}$. As such, the covariance matrix between any two time points $i$ and $j$ becomes (Epskamp, 2020):

$
\mathrm{Cov}\left(\pmb{y}_{i,P}, \pmb{y}_{j,P}\right) = \pmb{\Sigma}^{(i,j)} = \underset{\mathrm{bet}}{\pmb{\Sigma}} + \underset{\mathrm{within}}{\pmb{\Sigma}^{(i,j)}}.
$

Assuming that $\underset{\mathrm{within}}{\pmb{\Sigma}^{(i,j)}} \not= \pmb{O} \, \forall_{i,j}$, the joint likelihood of all observations can not be simplified beyond modeling the covariance between every item at every time point. To do this, we can form a normally distributed joint vector $\pmb{u}$ that contains the vectorized time series of person $p$:

$
\pmb{u}_p = \begin{bmatrix} \pmb{y}_{1,p} \\ \pmb{y}_{2,p} \\ \vdots \\ \pmb{y}_{n_t,p} \end{bmatrix}, \quad \pmb{u} \sim N\left(\pmb{\mu}, \pmb{\Sigma}\right),
$

with the following distributional parameters:

$
\pmb{\mu} = \pmb{1}_{n_t} \otimes \pmb{\mu}_{*} = \begin{bmatrix} \pmb{\mu}_{*} \\ \pmb{\mu}_{*} \\ \vdots \\ \pmb{\mu}_{*} \end{bmatrix}, \quad \pmb{\Sigma} = \begin{bmatrix}
\pmb{\Sigma}^{(1,1)} & \pmb{\Sigma}^{(1,2)} & \cdots & \pmb{\Sigma}^{(1,n_t)} \\
\pmb{\Sigma}^{(2,1)} & \pmb{\Sigma}^{(2,2)} & \cdots & \pmb{\Sigma}^{(2,n_t-1)} \\
\vdots & \vdots & \ddots & \vdots \\
\pmb{\Sigma}^{(n_t,1)} & \pmb{\Sigma}^{(n_t,2)} & \cdots & \pmb{\Sigma}^{(n_t,n_t)} \\
\end{bmatrix},
$

in which $\pmb{1}_{n_t}$ represents a length $n_t$ vector of ones, and $\otimes$ the Kronecker product.

Of note, while the variance-covariance structure above can be used to understand the behavior of these models, the model parameters are not always estimated through maximum likelihood estimation using the full joint likelihood expressed above. This is especially the case for time-series data where $n_t$ and $n_v$ are large, leading to $\pmb{\Sigma}$ to become an intolerably large matrix that cannot be handled for optimisation purposes by standard commercial computers. To this end, several methods have been proposed that take shortcuts in estimating model parameters from such models, such as analyzing the Topelitz covariance matrix (Hamaker et al., 2002) or by estimating only parts of the model using univariate multilevel regression models (Bringmann et al., 2013).

Person-wise Sample Means

The person-wise sample means can be obtained as follows. First, we define matrix $\pmb{E}$ as a block matrix that repeats a $n_v$ size identity matrix $n_t$ times:

$
\pmb{E} = \pmb{1}^{\top}_{n_t} \otimes \pmb{I} = \begin{bmatrix}
\pmb{I} & \pmb{I} & \cdots & \pmb{I}
\end{bmatrix}.
$

Next, we can define the person-wise sample means as follows:

$
\bar{\pmb{y}}_p = \frac{1}{n_t} \pmb{E} \pmb{u}_p.
$

The person-wise sample mean $\bar{\pmb{y}}_p$ is an unbiased estimator of the person-wise mean $\pmb{\mu}_p$, and it follows also that over people the expected value of the person-wise sample mean correctly leads to the grand mean:

$
\mathcal{E}\left(\bar{\pmb{y}}_P\right) = \pmb{\mu}_*.
$

The variance of the person-wise sample mean becomes:

$
\begin{align*}
\mathrm{Var}\left( \bar{\pmb{y}}_P \right) &= \frac{1}{n_t^2} \pmb{E} \pmb{\Sigma} \pmb{E}^{\top} \\
&= \frac{1}{n_t^2} \sum_{i=1}^{n_t} \sum_{j=1}^{n_t} \pmb{\Sigma}^{(i,j)} \\
&= \frac{1}{n_t^2} \sum_{i=1}^{n_t} \sum_{j=1}^{n_t} \left[ \underset{\mathrm{bet}}{\pmb{\Sigma}} + \underset{\mathrm{within}}{\pmb{\Sigma}^{(i,j)}} \right] \\
&\not= \underset{\mathrm{bet}}{\pmb{\Sigma}} \quad \text{ if } \underset{\mathrm{within}}{\pmb{\Sigma}^{(i,j)}} \not= \pmb{O} \text{ for any $i$ and $j$}.
\end{align*}
$

As such, the variance–covariance structure of the person-wise sample means does not necessarily equate to the variance–covariance structure of the true person-wise sample means. Instead, it is a weighted sum, with the $\underset{\mathrm{bet}}{\pmb{\Sigma}}$ matrix weighted $n_t^2$ times and every possible $\underset{\mathrm{within}}{\pmb{\Sigma}^{(i,j)}}$ matrix weighted once. This means that we cannot expect analyzing relationships between person-wise sample means to lead to correct estimates of population between-person relationships, as within-person relationships can also influence results based on the analysis of person-wise sample means.

If a model is stationary, we can see that $\pmb{\Sigma}$ takes the form of a Toeplitz matrix with:

$
\begin{align*}
\underset{\mathrm{within}}{\pmb{\Sigma}^{\text{lag-}0}} &= \underset{\mathrm{within}}{\pmb{\Sigma}^{(1,1)}} = \underset{\mathrm{within}}{\pmb{\Sigma}^{(2,2)}} = \cdots = \underset{\mathrm{within}}{\pmb{\Sigma}^{(n_t,n_t)}} & \text{included $n_t$ times} \\
\underset{\mathrm{within}}{\pmb{\Sigma}^{\text{lag-}1}} &= \underset{\mathrm{within}}{\pmb{\Sigma}^{(1,2)}} = \underset{\mathrm{within}}{\pmb{\Sigma}^{(2,3)}} = \cdots = \underset{\mathrm{within}}{\pmb{\Sigma}^{(n_t-1,n_t)}} & \text{included $2 (n_t-1)$ times} \\
\underset{\mathrm{within}}{\pmb{\Sigma}^{\text{lag-}2}} &= \underset{\mathrm{within}}{\pmb{\Sigma}^{(1,3)}} = \underset{\mathrm{within}}{\pmb{\Sigma}^{(2,4)}} = \cdots = \underset{\mathrm{within}}{\pmb{\Sigma}^{(n_t-2,n_t)}} & \text{included $2 (n_t-2)$ times} \\
&\vdots \\
\underset{\mathrm{within}}{\pmb{\Sigma}^{\text{lag-}(n_t-1)}} &= \underset{\mathrm{within}}{\pmb{\Sigma}^{(1,n_t)}} & \text{included $2$ times}.
\end{align*}
$

We can reasonably assume also then that the covariance between observations far away in time will be neglishable small:

$
\underset{\mathrm{within}}{\pmb{\Sigma}^{\mathrm{lag-}k}} \rightarrow \pmb{O} \quad \text{for higher values of $k$},
$

as otherwise the time-series would not be stationary. As such, we can derive the following implications:

If $\underset{\mathrm{bet}}{\pmb{\Sigma}} = \pmb{O}$, all effects found between person-wise sample means are due to within-person covariances (or Type~1 errors).
These effects will typically be small, but can lead to large standardized parameters (e.g., partial correlation coefficients).
If $\underset{\mathrm{bet}}{\pmb{\Sigma}} \not= \pmb{O}$, we expect that $\underset{\mathrm{bet}}{\pmb{\Sigma}}$ will dominate $\mathrm{Var}\left( \bar{\pmb{y}}_P \right)$ for increasing time-series lengths $n_t$.

As such, we expect that analyzing relationships between sample means will lead to accurate inference on true between-person effects for increasing time-series lengths $n_t$ and pending on the strength of the between-person variance in the data generating mechanism. If there is no between-person variance, relationships between person-wise sample means will reflect within-person relationships regardless of the sample size ($n_p$) or length of time-series ($n_t$). Note that the number of subjects $n_p$ does not impact this bias. However, we can note that larger sample sizes $n_p$ do reduce the standard errors on the estimated relationships between sample means. This means that we can expect that increasing the sample size $n_p$ but retaining a low number of repeated measures per person $n_t$ might lead to an increase in Type 1 errors in estimated between-person relationships. The accuracy of the estimates of person-wise means can only be improved by increasing the time-series length per person. These expectations are also visible in Figure 1, which shows that the bias is worse when the between-person variance is lower, the time-series length is shorter and the within-person relationship is stronger.

Discussion

We showed that when analyzing a longitudinal dataset consisting of multiple people measured on multiple time points, the commonly used method of using the person-wise sample means as proxies for true person-wise means in order to uncover between person effects can be biased by withinperson effects. This has implications for the interpretation of models for the statistical relationships between person-wise means, and for methods that make use of person person-wise sample means, such as the popular R package mlVAR for multilevel graphical vector auto-regressive (VAR) models.

The main implication of these results is that when analyzing statistical relationships between personwise means by directly modeling them or as part of the multilevel VAR model in mlVAR, these relationships cannot be interpreted as pure between-person effects. Instead, the correlation will be a function of both the population-between person effect and within-person correlations, and which contributes to which degree is unknown when analyzing empirical data. We found that the bias will decrease as a function of the time-series lengths and the proportion of between-person variance, but also that situations are conceivable where the bias is considerably large in reasonable betweenperson variances and time-series lengths. Our results are related to those of Hamaker (2023) who showed that correlations between cross-sections of multiple time series (i.e., a single time point of each measure of each subject, instead of the within-person sample analyzed here) is also a function of both within-person and between-person effects.

One way in which this bias can be avoided is by jointly estimating the model parameters corresponding to within-person and between-person effects in a single step. For the multilevel VAR model, this can be done in two accessible software suites: the R package psychonetrics (Epskamp, 2020) and the Dynamic Structural Equation Modeling (DSEM) module in MPlus version 8 and higher (Asparouhov et al., 2018; McNeish & Hamaker, 2020). Both methods, however, also have practical downsides. The implementation of the model in psychonetrics only allows for estimating random means (no random network structures) and is only suitable for panel data in which the number of time points is relatively low (e.g., 3-10) and the number of variables is moderate (up to about 20-30 variables). The DSEM model in MPlus, on the other hand, is not freely available and that currently it is computationally feasible only for 5-6 variables. MPlus also does not allow the direct modeling of partial correlation structures. An important avenue for future work would therefore be to provide a free implementation of DSEM that scales to larger number of variables. Another possibility could be to still estimate the multilevel VAR model in two steps such as in the mlVAR package, but to iterate between the two steps until convergence. Studying such an approach could be promising, however, it will also have to deal with the considerable computational cost involved in estimating many multilevel models.

Conflicts of Interest

The authors report no competing interests.

Acknowledgements

We would like to thank Oisín Ryan and Lourens Waldorp for helpful discussions about the topic of this paper. We would like to thank Janne Adolf for carefully reviewing our paper and our reproducibility archive, and spotting a coding error in an earlier version of this manuscript. JMBH has been supported by the gravitation project ‘New Science of Mental Disorders’ (www.nsmd.eu), supported by the Dutch Research Council and the Dutch Ministry of Education, Culture and Science (NWO gravitation grant number 024.004.016). SE was supported by National University of Singapore grant A-8001098-00-00.

Materials

Code to reproduce the simulation shown in Figure 1 can be found at https://github.com/jmbh/person-wiseMeanCor.

Endnotes

[1] To understand Nickell’s bias, consider a hypothetical longitudinal analysis in which data are within-person centered using sample means and only two observations per person are available. In this case, if the first observation is higher (lower) than the second, the first observation will be above (below) the mean and the second observation will be below (above) the mean, leading to an auto-correlation of −1. This bias gradually dissipates with more measures per person. For mlVAR, a minimum of 20 measures per person is recommended (Burger et al., 2022).

[2] This also means that we do not simulate data with correlated innovations, which can also be characterized as a contemporaneous network. The effects in this contemporaneous network are within-person effects, however, and we expect relationships at the contemporaneous level to similarly impact relationships at the between-person level.

[3] However, the model implied within-person lag-0 network does include negative edges, explaining the observed bias in the between-person network. Interestingly, the model-implied marginal variance–covariance structure also does not include negative covariances.

[4] Epskamp (2020) shows that covariance of $\pmb{\xi}$ across people is the fixed-effect average of the true within-person covariances of $\pmb{\xi}$ across time for a given individual.

Readers of this article also read

Methods

Cross-lagged panel networks

by Anna Wysocki, Ian McCarthy, Riet van Bork, Angélique O. J. Cramer, & Mijke Rhemtulla Cited by 38

Methods

Changing networks: Moderated idiographic psychological networks

by Laura F. Bringmann, Sigert Ariens, Anja F. Ernst, Evelien Snippe, & Eva Ceulemans Cited by 14

Methods

Simplifying Bayesian analysis of graphical models for the social sciences with easybgm: A user-friendly R-package

by Karoline B. S. Huth, Sara Keetelaar, Nikola Sekulovski, Don van den Bergh, & Maarten Marsman Cited by 12

Methods

Comparing maximum likelihood and maximum pseudolikelihood estimators for the Ising model

by Sara Keetelaar, Nikola Sekulovski, Denny Borsboom, & Maarten Marsman Cited by 8

Methods

Discovering cyclic causal models in psychological research

by Kyuri Park, Lourens J. Waldorp, & Oisín Ryan Cited by 5

Methods

Relating network-instantiated constructs to psychological variables through network-derived metrics: An exploratory study

by Simran K. Johal & Mijke Rhemtulla Cited by 4

Methods

Sensitivity analysis of prior distributions in Bayesian graphical modeling: Guiding informed prior choices for conditional independence testing

by Nikola Sekulovski, Sara Keetelaar, Jonas Haslbeck, & Maarten Marsman Cited by 3

Methods

The effects of omitted variables and measurement error on cross-sectional network psychometric models

by Teague R. Henry & Ai Ye Cited by 3

Editor Curated

Frequently Asked Questions

What is the difference between within-person and between-person effects?
Between-person effects describe relationships across different people, typically looking at their average or stable traits. For example, “Are people who are on average more anxious also on average more depressed?” Within-person effects describe relationships within a single individual over time. For example, “When I feel more anxious than my personal average, do I also tend to feel more depressed at that same time?” This study investigates how these two different levels of analysis can get confused.
What is a person-wise mean in longitudinal data?
A person-wise mean (or person-mean) is an individual’s average score on a specific variable, calculated across all the time points in an intensive longitudinal study. For example, if a participant’s anxiety is measured 100 times over a month, their person-wise mean for anxiety would be the single average score of those 100 measurements.
Can you trust correlations between person-wise means?
Not always. This research warns that simply calculating the person-wise means for two variables (like anxiety and depression) and then correlating them can be highly misleading. The study shows that this observed correlation is not a pure measure of the between-person relationship. Instead, it is a “mixture” of the true between-person correlation and the within-person dynamics (how the variables interact over time within an individual).
How do within-person effects bias correlations between person-means?
This study by Haslbeck & Epskamp (2024) in advances.in/psychology demonstrates that the correlation observed between person-means (the between-person analysis) can be artificially inflated or deflated by the underlying within-person relationships. The bias becomes stronger (i.e., the observed correlation is more misleading) when:
1. The number of time points is small.
2. There is high temporal dependency (strong autocorrelation, or “inertia,” where a feeling like anxiety persists over time).
3. There is a strong within-person contemporaneous correlation (e.g., feeling anxious and depressed at the exact same time).
This bias decreases as the number of time points for each person gets larger.
What is the risk of correlating person-means from time-series data?
The primary risk is drawing false scientific conclusions. This study’s simulations show that the bias can be substantial. For example, a true medium between-person correlation could be incorrectly observed as a very large correlation simply because of the influence from the within-person dynamics. This bias can affect psychological theories and the results of common analysis algorithms that rely on person-means, leading to a flawed understanding of how variables are related.

References

Asparouhov, T., Hamaker, E. L., & Muthén, B. (2018). Dynamic structural equation models. Structural Equation Modeling: A Multidisciplinary Journal, 25(3), 359–388. https://doi.org/10.1080/10705511.2017.1406803

Bringmann, L. F., Vissers, N., Wichers, M., Geschwind, N., Kuppens, P., Peeters, F., Borsboom, D., & Tuerlinckx, F. (2013). A network approach to psychopathology: New insights into clinical longitudinal data. PloS one, 8(4), e60188. https://doi.org/10.1371/journal.pone.0060188

Burger, J., Hoekstra, R. H. A., Mansueto, A. C., & Epskamp, S. (2022). Network estimation from time series and panel data. In Network psychometrics with r (pp. 169–192). Routledge. https://doi.org/10.4324/9781003111238-13

Conner, T. S., & Barrett, L. F. (2012). Trends in ambulatory self-report: The role of momentary experience in psychosomatic medicine. Psychosomatic medicine, 74(4), 327–337. https://doi.org/10.1097/PSY.0b013e3182546f18

Du, X., Skjerdingstad, N., FreichJel, R., Ebrahimi, O. V., Hoekstra, R. H. A., & Epskamp,

S. (2024). Moving from exploratory to confirmatory network analysis: An evaluation of sem fitindices and cutoff values in network psychometrics. PsyArXiv preprint. https://doi. org/10.31234/osf.io/d76ab

Epskamp, S., Hoekstra, H. A., Burger, J., & Waldorp, L. J. (2022). Chapter 9: Longitudinal Design Choices: Relating Data to Analysis. In M. Isvoranu, S. Epskamp, L. J. Waldorp, & D. Borsboom (Eds.), Network Psychometrics with R: A guide for behavioral and social scientists. Routledge, Taylor & Francis Group. https://doi.org/10.4324/9781003111238-12

Epskamp, S. (2020). Psychometric network models from time-series and panel data. Psychometrika, 85(1), 206–231. https://doi.org/10. 1007/s11336-020-09697-3

Epskamp, S., Rhemtulla, M., & Borsboom, D. (2017). Generalized network psychometrics: Combining network and latent variable models. Psychometrika, 82, 904–927. https://doi. org/10.1007/s11336-017-9557-x

Epskamp, S., Waldorp, L. J., Mõttus, R., & Borsboom, D. (2018). The gaussian graphical model in cross-sectional and time-series data. Multivariate Behavioral Research, 53(4), 453–480. https://doi.org/10.1080/00273171. 2018.1454823

Hamaker, E. L. (2012). Why researchers should think “within-person”: A paradigmatic rationale. In Handbook of research methods for studying daily life (pp. 43–61). The Guilford Press New York, NY.

Hamaker, E. (2023). The curious case of the cross-sectional correlation. Multivariate Behavioral Research, 1–12. https://doi.org/10.1080/00273171.2022.2155930

Hamaker, E. L., Dolan, C. V., & Molenaar, P. C. (2002). On the nature of sem estimates of arma parameters. Structural Equation Modeling, 9(3), 347–368. https://doi.org/10.1207/ S15328007SEM0903_3

Hamaker, E. L., & Grasman, R. P. (2015). To center or not to center? investigating inertia with a multilevel autoregressive model. Frontiers in psychology, 5, 1492. https://doi.org/10.3389/ fpsyg.2014.01492

Hamaker, E. L., Kuiper, R. M., & Grasman, R. P. (2015). A critique of the cross-lagged panel model. Psychological methods, 20(1), 102–116. https://doi.org/10.1037/a0038889

Hamaker, E. L., & Wichers, M. (2017). No time like the present: Discovering the hidden dynamics in intensive longitudinal data. Current Directions in Psychological Science, 26(1), 10– 15. https://doi.org/10.1177/0963721416666518

Hoffman, L. (2015). Longitudinal analysis: Modeling within-person fluctuation and change. Routledge. https://doi.org/10.4324/ 9781315744094

Jordan, D. G., Winer, E. S., & Salem, T. (2020). The current status of temporal network analysis for clinical science: Considerations as the paradigm shifts? Journal of Clinical Psychology, 76(9), 1591–1612. https://doi.org/10.1002/ jclp.22957

Kuppens, P., Dejonckheere, E., Kalokerinos, E. K., & Koval, P. (2022). Some recommendations on the use of daily life methods in affective science. Affective Science, 3(2), 505–515. https://doi.org/10.1007/s42761-022-00101-0

Lauritzen, S. L. (1996). Graphical models (Vol. 17). Clarendon Press.

McNeish, D., & Hamaker, E. L. (2020). A primer on two-level dynamic structural equation models for intensive longitudinal data in mplus. Psychological methods, 25(5), 610–635. https://doi.org/10.1037/met0000250

Miller, G. (2012). The smartphone psychology manifesto. Perspectives on psychological science, 7(3), 221–237. https://doi.org/10.1177/1745691612441215

Molenaar, P. C. M. (2004). A Manifesto on Psychology as Idiographic Science: Bringing the Person Back Into Scientific Psychology, This Time Forever. Measurement: Interdisciplinary Research & Perspective, 2(4), 201–218. https://doi.org/10.1207/s15366359mea0204_1

Molenaar, P. C. (1985). A dynamic factor model for the analysis of multivariate time series. Psychometrika, 50(2), 181–202. https://doi.org/10.1007/BF02294246

Molenaar, P. C. (2017). Equivalent dynamic models. Multivariate behavioral research, 52(2), 242–258. https://doi.org/10.1080/00273171.2016.1277681

Nickell, S. (1981). Biases in dynamic models with fixed effects. Econometrica: Journal of the econometric society, 49(6), 1417–1426. https://doi.org/10.2307/1911408

Ryan, O., Haslbeck, J., & Waldorp, L. (2023). Non-stationarity in time-series analysis: Modeling stochastic and deterministic trends. PsyArXiv preprint. https://doi.org/10.31234/osf.io/z7ja2

Trull, T. J., & Ebner-Priemer, U. (2014). The role of ambulatory assessment in psychological science. Current directions in psychological science, 23(6), 466–470. https://doi.org/10.1177/0963721414550706

Appendix

A.1 The mlVAR algorithm: two-step multilevel graphical vector autoregression

A popular method in which we expect this effect to occur is the two-step multilevel graphical vector auto-regression algorithm implemented in the mlVAR package for R (Epskamp et al., 2018). We will term this model the mlVAR-model and the corresponding estimation algorithm the mlVARalgorithm for the remainder of this paper. The mlVAR-algorithm aims to estimate parameters corresponding to a multilevel lag-1 graphical vector auto-regressive model:

$
\pmb{y}_{t,p} = \pmb{\mu}_p + \pmb{B}_p \left(\pmb{y}_{t-1,p} – \pmb{\mu}_p \right) + \pmb{\zeta}_{t,p}.
$

Here, the $\pmb{B}_p$ matrix is a person-specific matrix of temporal coefficients, which is typically used to draw a temporal network structure. The variance-covariance structure of the innovation term $\pmb{\zeta}$ can be modeled as a Gaussian graphical model (GGM; Lauritzen, 1996)—a model of partial correlation coefficients (Epskamp et al., 2017):

$
\mathrm{Var}\left( \pmb{\zeta}_{T,p} \right) = \pmb{\Sigma}_p^{(\zeta)} = \pmb{\Delta}^{(\zeta)}_p \left( \pmb{I} – \pmb{\Omega}^{(\zeta)}_p \right)^{-1} \pmb{\Delta}^{(\zeta)}_p,
$

in which $\pmb{\Omega}_p$contains partial correlation coefficients—typically visualized as a person-specific contemporaneous network—and $\pmb{\Delta}_p$is a diagonal scaling matrix. When taking the person subscript as random, as needed for the sampling distribution of the person-specific sample mean, we can replace these matrices with their fixed-effects counterparts (Epskamp, 2020):

$
\begin{equation}
\mathrm{Var}\left( \pmb{\zeta}_{\dagger,P} \right) = \pmb{\Sigma}_*^{(\zeta)} = \pmb{\Delta}^{(\zeta)}_* \left( \pmb{I} – \pmb{\Omega}^{(\zeta)}_* \right)^{-1} \pmb{\Delta}^{(\zeta)}_*,
\end{equation}
\tag{4}
$

in which the $\dagger$ symbol stands for `any time-point’ and the asterisk subscript $*$ denotes the fixed-effect (average over people). The person-specific mean can also be modeled as a GGM:

$
\begin{equation}
\underset{\mathrm{bet}}{\pmb{\Sigma}} = \mathrm{Var}\left( \pmb{\mu}_{P} \right) = \pmb{\Sigma}^{(\mu)} = \pmb{\Delta}^{(\mu)} \left( \pmb{I} – \pmb{\Omega}^{(\mu)} \right)^{-1} \pmb{\Delta}^{(\mu)}.
\end{equation}
\tag{5}
$

Epskamp (2020) show that the implied within-person fixed-effect variance–covariance matrices of this model take the following forms:

$
\begin{align}
\mathrm{vec}\left(\underset{\mathrm{within}}{\pmb{\Sigma}^{\mathrm{lag-}0}} \right) &= \left( \pmb{I} \otimes \pmb{I} – \pmb{B}_* \otimes \pmb{B}_* \right) \mathrm{vec}\left( \pmb{\Sigma}^{(\zeta)}_{*} \right) \label{eq:lag0within}\\
\underset{\mathrm{within}}{\pmb{\Sigma}^{\mathrm{lag-}k}} &= \pmb{B}_* \underset{\mathrm{within}}{\pmb{\Sigma}^{\mathrm{lag-}(k-1)}} \quad \text{ if } k \not= 0 \nonumber
\end{align}
\tag{6}
$

Estimating this model directly through multivariate optimization, however, quickly grows prohibitively intractable for increasing values of $n_t$ and $n_v$.

Epskamp et al. (2018) propose to estimate the parameters of the mlVAR-model through a series of univariate multilevel regressions, by combining the work of Bringmann et al. (2013) on estimating multilevel temporal networks with the work of Hamaker and Grasman (2015) on separating withinand between-person effects. The mlVAR-algorithm comprises of two steps, each consisting of $n_v$ multilevel models to be estimated:

Step 1: Temporal and between-person effects are estimated by using each variable once as dependent variable, predicted by all within-person centered lagged variables (for the temporal effects) and person-wise sample means of each other variable (for the between-person effects).
Step 2: The estimated residuals from the Step 1 models are used in a new sequence of univariate multilevel models in which each residual is once treated as dependent variable predicted by the remaining residuals.

Clearly, the Step 1 models heavily rely on person-wise sample means to within-person center the lagged predictors as well as to estimate the between-person effects. We therefore expect the bias we demonstrate above to occur in the mlVAR-algorithm. Below, we study in simulations the extent of this bias.

Intellectual humility is linked to discernment but not response bias in this misinformation study

Research Article

Intellectual humility is associated with greater misinformation discernment and metacognitive insight but not response bias

September 13, 2024 (Cited by: 9)

Meta-analysis on intellectual humility, misinformation, and conspiracy theories

Review

Intellectual humility and misinformation receptivity: A meta-analytic review

September 27, 2024 (Cited by: 10)

Shifts and continuity in contemporary violent extremism: Moving toward a dynamic and developmental understanding?

The end of ideological violence

System-disillusioned youth radicalize: The role of political alienation in the formation of political radicalism

Observed correlations between person-means depend on within-person correlations

Abstract

Key Takeaways

Introduction

Within- and Between-Person Effects

Within-person Effects

Between-person Effects

Using person-wise Sample Means as Proxies for True person-wise Means

Illustrating Dependency of Between-Person Effects on Within-Person Correlations

Effects on Between-Person Correlations

Effects on Between-Person Network in multilevel VAR Model

Deriving how Between-Person Effects Depend on Within-Person Effects

General Framework

Person-wise Sample Means

Discussion

Conflicts of Interest

Acknowledgements

Materials

Endnotes

Cross-lagged panel networks

Changing networks: Moderated idiographic psychological networks

Simplifying Bayesian analysis of graphical models for the social sciences with easybgm: A user-friendly R-package

Comparing maximum likelihood and maximum pseudolikelihood estimators for the Ising model

Discovering cyclic causal models in psychological research

Relating network-instantiated constructs to psychological variables through network-derived metrics: An exploratory study

Sensitivity analysis of prior distributions in Bayesian graphical modeling: Guiding informed prior choices for conditional independence testing

The effects of omitted variables and measurement error on cross-sectional network psychometric models

Frequently Asked Questions

References

Appendix

Previous Post

Intellectual humility is associated with greater misinformation discernment and metacognitive insight but not response bias

Next Post

Intellectual humility and misinformation receptivity: A meta-analytic review