Assessing dynamic functional connectivity in heterogeneous samples
Abstract
Several methods have been developed to measure dynamic functional connectivity (dFC) in fMRI data. These methods are often based on a sliding-window analysis, which aims to capture how the brain's functional organization varies over the course of a scan. The aim of many studies is to compare dFC across groups, such as younger versus older people. However, spurious group differences in measured dFC may be caused by other sources of heterogeneity between people. For example, the shape of the haemodynamic response function (HRF) and levels of measurement noise have been found to vary with age. We use a generic simulation framework for fMRI data to investigate the effect of such heterogeneity on estimates of dFC. Our findings show that, despite no differences in true dFC, individual differences in measured dFC can result from other (non-dynamic) features of the data, such as differences in neural autocorrelation, HRF shape, connectivity strength and measurement noise. We also find that common dFC methods such as k-means and multilayer modularity approaches can detect spurious group differences in dynamic connectivity due to inappropriate setting of their hyperparameters. fMRI studies therefore need to consider alternative sources of heterogeneity across individuals before concluding differences in dFC.
Graphical abstract

Highlights
•Estimates of dFC may be affected by various sources of between-people heterogeneity.
•We simulated fMRI data to investigate the effect of such heterogeneity on dFC.
•Differences in estimated dFC resulted from non-dynamic features of the data.
•Choice of parameters in common methods produced spurious group differences in dFC.
Introduction
Brain connectivity can refer to a number of different types of relation between distinct regions in the brain. While structural connectivity refers to the anatomical links between brain regions, functional connectivity (FC) describes how activity in different regions is related over time. In functional magnetic resonance imaging (fMRI), this is commonly measured using the Pearson correlation between the fMRI time series in different brain regions (Biswal et al., 1995). Typically, one would calculate the correlation between two time series over the course of the whole fMRI scan. However, this approach may represent an average across informative fluctuations in FC. Indeed, recent evidence suggests that even in task-free, resting-states these functional connections change over the course of a scan (Allen et al., 2012, Chang and Glover, 2010, Kiviniemi et al., 2011). Moreover, measures of this dynamic functional connectivity (dFC) have been used in an attempt to identify biomarkers for schizophrenia (Sakoğlu et al., 2010) and Alzheimer's disease (Jones et al., 2012).
The most common way to measure dFC is to apply a sliding-window analysis (see Hutchison et al. (2013) for a review of dFC). Methods to analyse the changes in connectivity across windows vary in complexity. A simple approach characterises dFC as the standard deviation (SD) of the correlation values across time windows (Elton and Gao, 2015). Alternatively, one can pool data across individuals and use k-means clustering to identify recurring connectivity patterns (Allen et al., 2012), or “FC states”. Another class of methods applies network theory on an individual level. The brain can be characterised as a complex graph, with distinct brain regions corresponding to nodes, and functional connections corresponding to edges between nodes (Bullmore and Sporns, 2009). Differences in the properties of the resulting graphs can then be used as measures of dFC (Bassett et al., 2011;Bassett et al., 2013b). Crucially, both these classes of methods require a choice of hyperparameters, which have to be estimated from the data. For example, in a k-means analysis, the number of clusters k has to be prespecified.
Some of the challenges facing current methods, such as the choice of window width and the effect of data pre-processing, have already been discussed in Hutchison et al. (2013). Furthermore, Shakil et al. (2016) have shown that the width and offset of windows can have an effect on the detection of FC state transitions and duration in a k-means analysis. In this paper we address an additional issue, namely unaccounted heterogeneity between individuals. While the aim of group studies is to detect heterogeneity in true dFC, other sources of heterogeneity may have an impact on estimated dFC. Heterogeneity can arise in a variety of ways. For example, Arbabshirani et al. (2014a) found the autocorrelation of fMRI time series within brain regions to differ between healthy brains and those with schizophrenia, and autocorrelation is known to affect estimation of cross-correlation (Arbabshirani et al., 2014b). Although it is unclear whether this change in autocorrelation is due to neural or vascular factors, work with dynamic causal modeling has suggested that neural autocorrelation within some networks can vary between young and older participants (Tsvetanov et al., 2016). Another example of heterogeneity is variability in the haemodynamic response function (HRF). The shape of the HRF, which can be modelled as a finite impulse response kernel, has been found to vary between healthy patients and patients with schizophrenia (Hanlon et al., 2016) and also between age groups (Huettel et al., 2001, Aizenstein et al., 2004, D'Esposito et al., 1999). Even non-neural physiological noise levels might differ across groups, owing for example to greater within-scan head movement in old relative to young subjects (Geerligs et al., 2015).
Here, we aim to investigate how unaccounted heterogeneity impacts estimates of dFC. Building on previous work by Allen et al. (2012), we designed a simulation framework to generate data from a dynamic connectivity structure based on FC states. We characterise FC states as time periods in which brain regions can be grouped into specific sets, or “modules”. In a given FC state, regions are considered connected if, and only if, they are in the same module. In this framework, changes in connectivity structure then correspond to FC state transitions. Data were generated to investigate the effect of individual differences in neural autocorrelation, HRF shape, connectivity strength and measurement noise on estimated dFC. We specifically used the case of aging to illustrate how plausible age-related sources of heterogeneity could impact dFC estimates. Furthermore, we varied the number of FC states and frequency of FC state transitions, in order to explore the effect of hyperparameter selection on the results of popular dFC methods such as the k-means method used by Allen et al. (2012) and the multilayer modularity approach of Bassett et al. (2011). Our analysis follows a typical sliding-window dFC pipeline based on a number of recent studies (e.g. Allen et al., 2012, Elton and Gao, 2015, Sakoğlu et al., 2010). Our findings show that group-level differences in neural autocorrelation, HRF shape, connectivity strength, measurement noise, number of FC states and frequencies of FC state transitions can lead to systematic differences in observed dFC between simulated fMRI time-series data.
Methods: simulation framework
To demonstrate some of the issues associated with assessing dFC in heterogeneous samples, we developed a simulation framework based on Allen et al. (2012). For each type of heterogeneity, we report the effects of changing only one parameter at a time, in order to isolate its relative impact on the analysis, though we consider interactions between parameters in the Supplementary Material. Thus, each source of heterogeneity corresponded to a change in a single step of the data generation process. To illustrate the effects of neural autocorrelation, HRF, connectivity strength and measurement noise, it was sufficient to simulate data from a model with only four regions of interest (ROIs) - Base Simulation 1. In order to analyse the impact of changes in both number of FC states and frequency of FC state transitions, we increased the number of ROIs to 32 to allow for a greater variety of states - Base Simulation 2. We now describe the simulation framework in detail, illustrated schematically in Fig. 1, before outlining both Base Simulations and the specific variations for each source of heterogeneity.
The generic simulation framework used to generate fMRI data for a brain consisting of four regions. State i is denoted by Si and region j is denote by Rj. Two brain regions are connected at any given time if they are in the same module, which are distinguished here by colours. These panels show data for 40 TRs: the first 20 TRs are spent in S1 while the last 20 TRs are spent in S2.

We characterised FC states as time periods in which regions are partitioned into “modules”. For convenience, we denote state i as Si and region j as Rj. To simulate fMRI data for an individual, we first generated a FC state sequence to describe the changes in functional connectivity. This process is outlined in detail in the Base Simulations below. We used a sampling rate of TR = 2 s and generated binary neural event sequences of length T = 360 TRs for each module and each region. The module-specific event sequences drive the connectivity structure: a module-specific event is one which occurs for all regions within that module in the current FC state. In contrast, region-specific events are those which occur for single regions only, independent of other regions, and thus correspond to neural noise.
More precisely, a module-specific event occurred in a module at an individual time point with probability Pmod=0.5, independent of all other modules and time points. If a module has an event at time t, all regions within that module at time t have an event. For each region, we then superimposed a region-specific neural event sequence. A region-specific event occurred in a region at an individual time point with probability Preg=0.5, independent of all other regions and time points (except in the Neural autocorrelation simulation, where we explored the effect of autocorrelated region-specific events). We fixed the amplitude of region-specific events to be areg=1, and set the amplitude of the module-specific events to be amod=2. The full event sequences were then convolved with a haemodynamic response function (HRF; kernel length = 16 TRs) using the SPM12 software (http://www.fil.ion.ucl.ac.uk/spm) to produce fMRI-like time series, which were rescaled to have a SD of 1. White noise with SD σnoise=0.2 was then added. Finally, a high-pass Butterworth filter removing frequencies below 0.033 Hz was applied. This is based on the rule of thumb given by Leonardi and Ville (2015) which recommends removing frequency components below 1/w, where w is the window length in the sliding-window analysis.
Base Simulation 1 (4 ROIs)
In this setting, which corresponds to the framework illustrated in Fig. 1, we restricted dynamics to two FC states, S1 and S2. S1 corresponded to the partition {1,1,2,1}, so that R1, R2, R4 were grouped into module 1, and R3 was grouped by itself into module 2, while S2 corresponded to the partition {1,1,2,2}. We fixed the FC state sequence such that each individual spent half of the time in S1 and then transitioned to S2. This allowed for the comparison of dFC between three types of region pairs: connected (within-module e.g. R1-R2), unconnected (between-module e.g. R1-R3), and a dynamic connection (within-module to between-module e.g. R1-R4). We then generated fMRI-like data using the simulation framework described above.
Base Simulation 2 (32 ROIs)
In this setting, we generated a total of 9 FC states, each consisting of a partition of the 32 ROIs into exactly 5 modules. For each FC state we generated a module label for each region from the numbers {1,…,5} uniformly. If a FC state did not contain all 5 modules, we repeated this process. To ensure that no two FC states were too similar to each other, we computed the normalised mutual information (NMI) between each pair of state vectors, repeating the whole process if the maximum pairwise NMI exceeded 0.5. For each individual, we generated a random sequence of FC states under the assumption that a brain remained in a FC state for a fixed period of time before switching to any other FC state. Each FC state thus lasted a quarter of the total time period if three FC state transitions were specified, or half of the period if just one FC state transition was specified. We then generated fMRI-like data using the simulation framework described above.
Methods: specific simulations
For the first four simulations described here, we used Base Simulation 1 to generate the data. To measure dFC, we applied a sliding-window analysis. We used a tapered-cosine (Tukey) window of width w = 30 TRs with a total taper section of length 15 TRs. We slid the windows one time point at each step, yielding a total of 331 windows. We calculated pairwise Fisher-transformed Pearson correlation for each window and for each pair of regions. We then computed the SD of the time series of correlation values between each pair of regions. This measure is commonly defined as a proxy for dynamic functional connectivity. We also used the variance and the interquartile range of the correlation time series as alternative measures of dFC but these did not produce materially different results. For each set of parameters, we simulated 100 replicates in order to account for the randomness inherent in the data generation and also to assess the variability of our measure of dFC.
Neural autocorrelation
To investigate the effect of varying neural autocorrelation on the analysis of dFC, we used Base Simulation 1. To control the neural autocorrelation, we varied the generation of the region-specific neural event sequences. We modelled the binary sequences as Markov chains dependent on two parameters: the equilibrium probability of an event πreg and the lag-1 autocorrelation ρreg. The default value in Base Simulation 1 is the special case of πreg=0.5 and ρreg=0, indicating no autocorrelation, and is the value used in later simulations. For the purposes of this simulation, we kept the equilibrium probability of an event fixed at πreg=0.5, thus ensuring that the expected number of events was constant at 180. We generated data for ρreg=-0.8,-0.7,…,0.8 with the remainder of the simulation as described in Methods: simulation framework. We also performed this analysis with a range of values of Pmod, Preg, amod, and σnoise and after prewhitening the data - see Figs. S5 and S8 respectively.
Note that this region-specific signal can be considered a source of noise (as opposed to the module-specific events that drive the connectivity “signals”). The autocorrelation in this neural noise contributes to the temporal autocorrelation observed in the fMRI time series, which, once combined with the white noise measurement noise below, produces the “AR(1)+white noise” that characterises fMRI noise (at least after high-pass filtering; Friston et al., 2000). Nonetheless, in real fMRI data, there are other sources of coloured noise, such as those induced by respiratory and cardiac signals, and by head-movement (see e.g. Woolrich et al., 2001), which could also differ across groups.
Haemodynamic response function
To demonstrate the impact of the HRF on dFC, we generated data with various HRFs, two of which are shown in Fig. 2. We varied two of the HRF parameters, the dispersion of peak response (the width of the initial peak) and the delay of response, while the other 5 HRF parameters were held constant. In this simulation, we generated data using each HRF with peak dispersion σHRF=0.6,0.8,…,2.4 and response delay τHRF=5,5.2,…,9 s.

Connectivity strength
In our simulation framework, neural noise corresponds to the neural events that do not contribute to the connectivity structure. In other words, neural noise corresponds to the region-specific neural events, while the module-specific events are those which drive the connectivity structure. To investigate how connectivity strength affects dFC estimation, we varied the amplitude amod of the module-specific events relative to the region-specific events, which we fixed to have amplitude areg=1. In this simulation, we generated data for amod=0.5,1,…,5. We also performed this analysis with a range of values of Pmod, Preg, ρreg, and σnoise - see Fig. S6.
Measurement noise
To investigate how measurement noise affects dFC estimation, we varied the amount of white noise added to the fMRI-like time series. We generated data with white noise of standard deviation σnoise=0,0.1,…,2.5 times that of the signal. We also performed this analysis with a range of values of Pmod, Preg, ρreg, and amod - see Fig. S7.
k-means
A key assumption in many FC state-based methods is that there is a common set of FC states across individuals. In particular, it is assumed that, at rest, different participants transition within a comparable set of brain states. In this simulation, we investigated what happens when this assumption does not hold. Specifically, we simulated 32-ROI fMRI data, using Base Simulation 2, from two groups of 100 people: group 1 could visit 9 distinct FC states, and group 2 could only visit 6 of these 9 FC states. Individuals in both groups experienced 3 FC state transitions in the course of a scan. FC state sequences for both groups were thus of length 4, with sequences for group 2 restricted to the FC states {1,…,6}. For group 2, individuals were equally likely to be in each of the first six FC states. For group 1, in contrast, individuals were twice as likely to be in FC states 7, 8 and 9 than in the first six FC states. The remainder of the simulation for each individual then followed the generic simulation framework.
We estimated correlation matrices for each position of the sliding-window analysis, as in the previous simulations, to produce 331 (Fisher-transformed) correlation matrices of size 32×32. The upper triangular part of this correlation matrix was vectorised to yield 331 correlation vectors of length 496 per subject. The k-means clustering was then performed on the set of all these vectors, pooled across subjects, with the ℓ2 norm as distance measure. Centroids were initialised using the k-means++ algorithm in Matlab and analysis was repeated 40 times with different initial centroids to avoid sub-optimal clusterings. We investigated the performance of the clustering with k=1,…,12 for the two groups separately and combined.
For each k, the algorithm returned a sequence of FC state labels for each subject, and the centroid of each of the k FC states. As the recovered FC state labels are arbitrary, the labels do not necessarily match those of the true FC states. While one can permute the recovered FC state labels to maximise the overlap with the true FC state sequence, this becomes more difficult with multiple subjects. Additionally, a simple relabelling does not take into account that incorrectly labelled FC states are not equally wrong. For example, if in two distinct windows, a subject is in FC state 1, but a k-means analysis recovers FC states 2 and 3 respectively (after relabelling), it may be the case that FC state 2 is closer to FC state 1 than FC state 3.
To circumvent the mislabelling and to enable comparisons of performance across different values of k, we replaced FC state labels by correlation matrices. For the recovered FC state sequences, we used the corresponding FC state centroid as calculated by the k-means algorithm. For the true sequences, we replaced a FC state label by a ‘true’ correlation matrix for that state. Recall that, in the 360 TRs simulated, a brain experienced 3 FC state transitions so that each FC state lasted 90 TRs. For each FC state, we first calculated the correlation matrix for each window of width 90 TRs in which all time points are in that FC state. We then took the ‘true’ correlation matrix as the average of all the corresponding correlation matrices in the same FC state across all subjects.
At each time point, we then computed the centroid error as the ℓ2 distance from the ‘true’ correlation matrix to the centroid of the recovered FC state at that time. We thus used two measures of performance: the average number of detected FC state changes across subjects, and the mean centroid error across all time points and subjects.
Note that, in typical task-free fMRI analyses, we do not know the ground truth. In our simulation, however, we assumed that the states of all participants were drawn from a larger common pool of states. The k-means algorithm identifies comparable connectivity patterns and groups them into states. It does not take into account the order in which the states occur for an individual and so a state can occur at different times for different participants. This allowed us to cluster across individuals and time points.
Multilayer modularity
In many studies of dFC, a question of interest is whether groups differ in the degree to which connections between regions are static versus dynamic. Here, we examined how the frequency of state transitions could be detected using a multilayer modularity algorithm, and how the choice of parameters affected these results. To this end, we simulated 32-ROI data, using Base Simulation 2, from two groups of 50 people: individuals in group 1 experienced 3 FC state transitions, and group 2 experienced just 1 FC state transition in the course of a scan. FC state sequences for group 1 were thus of length 4, while FC state sequences for group 2 were of length 2. All individuals could visit the same 9 FC states. The remainder of the simulation for each individual then followed the framework described in Methods: simulation framework. We applied the same sliding-window analysis as in the previous simulations, again using a window of width w = 30 TRs, calculating pairwise (Fisher-transformed) Pearson correlation for each window and for each pair of regions. In this case, however, we slid windows in steps of 30 TRs (instead of 1 TR) resulting 12 non-overlapping windows of width 30 TRs. This is based on the multilayer-modularity approach used by Bassett et al. (2011).
In the multilayer-modularity approach, the brain is characterised as a multilayer network with nodes corresponding to brain regions in different windows, or “layers”. The non-overlapping sliding-window analysis yields a correlation matrix A with Aijl corresponding to the correlation between regions Ri and Rj in window l. For each partitioning of regions into modules, the following multilayer modularity index is defined as a measure of the quality of the partition:Q=12μ∑i=1Nreg∑j=1Nreg∑l=1Nwin∑r=1Nwin[(Aijl−γkilkjl2ml)δl,r+ωδi,jδl−r,1]δ(gil,gjr),
where γ and ω are hyperparameters, Nreg is the number of regions, Nwin is the number of windows, gil is the module assignment of region Ri in window l, kil=∑jAijl, 2ml=∑ijAijl, and 2μ=∑jl(kjl+∑rωδl−r,1). The δ function is defined such that δi,j≡δ(i,j)=1 if i = j and is equal to 0 otherwise. In our simulation, Nreg=32 and Nwin=12. The regions can then be partitioned into modules by attempting to maximise the modularity index Q using a generalised Louvain algorithm (Mucha et al., 2010).
We investigated the effect of varying the hyperparameters γ and ω on the accuracy of the subsequent partitions. Broadly speaking, γ controls the resolution of the partitioning within layers so that a high value of γ encourages regions to be grouped into smaller modules, thus increasing the total number of modules. On the other hand, ω influences the ‘stickiness’ of module assignments between consecutive layers. Thus, a high value of ω encourages fewer module changes for an individual region.
To assess the performance of the algorithm, we used two error measures. Firstly, we created “incidence matrices” of size 32×32×12, which contained an entry of 1 if the corresponding pair of regions had the same module label during a given window, and 0 otherwise. We then calculated the error in connectivity structure as the Hamming distance of the recovered incidence matrix (based on the multilayer modularity partitioning) to the true incidence matrix (based on the original module structure) at that time. The Hamming distance between two matrices of the same size is given by the number of elements at which the matrices differ. Secondly, we computed the mean flexibility for each subject, as defined by Bassett et al. (2011). The flexibility of a region is calculated by the number of times the region changes module assignment divided by the total possible number of module changes. The mean flexibility is then given by the mean region flexibility over all 32 regions. Note that the expected mean flexibility for an individual in G2 in our simulation is approximately 4/55 (1 state change out of a possible 11 with the probability of a region changing module at a state change of approximately 0.8), compared to 12/55 for an individual in G1. For each subject, we ran the algorithm for γ=0.75,1,…,2.5 and ω=0.25,0.5,…,4.5.
Results: issues and limitations
We first investigate whether unaccounted heterogeneity can impact estimates of dFC. Here we examine four sources of heterogeneity: individual differences in 1) neural autocorrelation, 2) shape of the HRF, 3) connectivity strength and 4) measurement noise. We consider a simple, yet common, measure of dynamic connectivity, namely the standard deviation (SD) of correlation values across sliding windows. We calculated this measure for three types of true connectivity: 1) static, positive connections between regions within the same module, 2) static, zero connections between regions in different modules, and 3) dynamic changes between positive and zero connections when a region switched from being in the same module to being in a different module as another region (see Methods: simulation framework). As expected, across all the simulations, estimated dFC for the dynamic connections is higher than the estimated dFC for both types of static connections.
Neural autocorrelation
In this simulation, we investigated the association between neural autocorrelation and estimated dFC, making sure that differences in neural autocorrelation were not associated with differences in ‘true’ dynamic connectivity. This was achieved by varying the autocorrelation ρreg of region-specific events but keeping the autocorrelation of the module-specific events fixed at zero. Recall that region-specific events are generated independently of the connectivity structure so, under our simulation framework, changing them should have no effect on dFC. Note that the underlying dFC structure is held constant across all iterations. We estimated dFC for three types of connection: a static, positive connection, a static zero connection, and a dynamic connection (from positive to zero) for ρreg=−0.8,−0.6,…,0.8.
Fig. 3 illustrates the impact of neural autocorrelation on estimated dFC. Although Fig. 3b shows that estimated dFC for the unconnected regions remains unaffected by changes in neural autocorrelation, Fig. 3a demonstrates that estimated dFC between the positively connected regions is higher as neural autocorrelation increases. In contrast, Fig. 3c shows that estimated dFC decreased between the dynamically-connected regions as the autocorrelation increased. In other words, the ability to detect a difference in dFC between dynamically connected and statically connected regions decreases with higher levels of neural autocorrelation. These observations are supported by the multiple regression in Fig. 3d: neural autocorrelation had a statistically significant effect on estimated dFC for the positively connected and dynamically connected regions, but not for the unconnected regions. These results suggest that observed dFC may vary substantially due to differences in neural autocorrelation, even though the true dFC was identical across individuals.
The impact of neural autocorrelation on estimated dFC, measured by the SD of the correlation time series, between (a) statically connected regions (within-module), (b) unconnected regions (between-module), and (c) dynamically connected regions (module change), with (d) the results of a multiple regression. Ri refers to Region i. Region pair R1-R4 has a dynamic connection so the true dFC should be higher than region pairs R1-R2 and R1-R3, which have a static connection. Estimated dFC between statically connected (within-module) regions increased with neural autocorrelation, while estimated dFC for dynamically connected regions (module change) decreased with increased neural autocorrelation. The neural autocorrelation is varied independently of the underlying connectivity structure, so changing it should have no effect on dFC. The multiple regression assesses the impact of neural autocorrelation on estimated dFC with a statistically significant effect indicated by a 95% confidence interval (CI) in bold type. The solid lines in (a) – (c) correspond to the fitted values of the multiple regression.

Haemodynamic response function
It has been shown that the haemodynamic response function (HRF) varies between different ages (Huettel et al., 2001, Aizenstein et al., 2004, D'Esposito et al., 1999) and disease states (Hanlon et al., 2016). To illustrate the effect of the HRF on observed dFC, we altered the HRF by varying the dispersion of the response σHRF and the delay of response τHRF. Note that the underlying dFC structure is held constant across all iterations. To measure dFC, we calculated the mean SD of the sliding-window correlation time series for the three types of region pairs (positively connected, unconnected, and dynamically connected) for σHRF=0.6,0.8,…,2.4 and τHRF=5,5.2,…,9 s.
Figs. 4a and 4b show that a more temporally dispersed HRF (as often observed for older individuals) resulted in increased dFC between regions, when in truth the connectivity remained constant. Increases in both the dispersion of peak response σHRF and the delay of response τHRF resulted in increased estimated dFC. Fig. 4c shows a similar effect for regions with a dynamic connection though, in this case, the increase was not as marked. Fig. 4d compares the observed dFC for two of these HRFs. Individuals in Group 1 had a HRF with peak dispersion 1 and a response delay of 6s (purple line in Fig. 2), which might represent a younger sample. In contrast, individuals in Group 2 had a HRF with peak dispersion 2 and a response delay of 8s (yellow line in Fig. 2), which might represent an older sample. For the static connections we see that, even though the true dFC was the same across groups, the observed dFC varied substantially between groups due to the shape of the HRF. These observations are supported by the multiple regression results in Fig. 4e: both the dispersion of peak response σHRF and the delay of response τHRF had statistically significant effects on estimated dFC for the statically connected regions, but only the delay of response τHRF had a statistically significant impact for the dynamically connected regions.
The impact of HRF shape on estimated dFC, measured by the SD of the correlation time series, between (a) statically connected regions (within-module), (b) unconnected regions (between-module), and (c) dynamically connected regions (module change), with (d) a comparison of estimated dFC for two groups and (e) a summary of results. Ri refers to Region i. The HRF was altered by varying two parameters: the dispersion of response σHRF and the delay of response τHRF. Individuals in G1 had an HRF with σHRF=1 and τHRF=6 while individuals in G2 had an HRF with σHRF=2 and τHRF=8. Each square in (a)–(c) corresponds to the mean (across individuals) estimated dFC for a pair of parameter values of (σHRF,τHRF), with yellow indicating higher dFC. The dashed lines in (a) – (c) indicate the parameter values of the two groups compared in (d), whose HRFs are shown in Fig. 3. A more dispersed HRF increased estimated dFC between all three types of region pairs, despite all individuals having identical dFC structure. We observed higher variability in dFC for the statically connected regions (within-module) as the dispersion and delay of the response increased. We saw a similar effect for the dynamically connected regions (module change), though this was less pronounced. While the box plots in (d) illustrate a single comparison of two groups, the multiple regression results in (e) summarise the impact of the two HRF parameters on estimated dFC with a statistically significant effect indicated by a 95% confidence interval (CI) in bold type.

Connectivity strength
In our simulation framework, connectivity strength corresponds to the amplitude of the module-specific events, since these drive the connectivity structure. In this simulation, we investigated the association between connectivity strength and estimated dFC. This was done by varying the amplitude of module-specific events amod while keeping the amplitude of the region-specific events fixed (i.e. the size of the connectivity “signal” versus region-specific neural “noise”). Note that the underlying dFC structure is held constant across all iterations. To measure dFC, we calculated the mean SD of the sliding-window correlation time series for the three types of region pairs (positively connected, unconnected, and dynamically connected) for amod=0.5,1,…,5.
Fig. 5 illustrates the impact of connectivity strength on estimated dFC. As expected, Fig. 5b shows that estimated dFC for the unconnected regions remains unaffected by changes in connectivity strength. However, Fig. 5a demonstrates that estimated dFC between the positively connected regions is moderately lower as connectivity strength increases. In contrast, Fig. 5c shows that estimated dFC increased between the dynamically-connected regions as connectivity strength increased. In particular, the ability to detect a difference in dFC between dynamically connected and statically connected regions decreases with lower connectivity strength. These observations are supported by the multiple regression results in Fig. 5d: connectivity strength had a statistically significant effect on estimated dFC for the positively connected and dynamically connected regions, but not for the unconnected regions. These results suggest that observed dFC may vary substantially due to differences in connectivity strength, even though the true dFC was identical across individuals.
The impact of connectivity strength on estimated dFC, measured by the SD of the correlation time series, between (a) statically connected regions (within-module), (b) unconnected regions (between-module), and (c) dynamically connected regions (module change), with (d) the results of a multiple regression. Ri refers to Region i. Region pair R1-R4 has a dynamic connection so the true dFC should be higher than region pairs R1-R2 and R1-R3, which have a static connection. Increased amplitude of module-specific events resulted in decreased observed dFC for the positive static connected pair (within-module) but increased dFC for the dynamically connected pair (module change). The effect on the unconnected pair (between-module) was small. The connectivity strength, corresponding to the amplitude of module-specific events, is varied independently of the underlying connectivity structure, so changing it should have no effect on dFC. The multiple regression assesses the impact of connectivity strength on estimated dFC with a statistically significant effect indicated by a 95% confidence interval (CI) in bold type. The solid lines in (a) – (c) correspond to the fitted values of the multiple regression.

Measurement noise
We also investigated the effects of varying amounts of measurement noise on observed connectivity dynamics by generating data with white noise of SD σnoise=0,0.1,…,2.5. Recall that the signal was rescaled to have SD 1, resulting in noise-to-signal ratios equal to σnoise (ignoring neural noise).
Fig. 6 shows the effect of varying measurement noise on the standard deviation of correlation. Figs. 6a–c show that for all three types of region pair, increasing the amount of measurement noise resulted in decreased observed dFC. This is supported by the multiple regression results in Fig. 6d: measurement noise had a statistically significant effect on estimated dFC for all three connection types. Thus, noisier data resulted in lower estimated dFC even for the pair of ROIs that experienced a true change in connectivity. When the amplitude reached a certain threshold (σnoise>2.0), the white noise dominated the underlying fMRI signal, resulting in a mean dFC of 0.2 for all three types of connectivity.
The impact of measurement noise on estimated dFC, measured by the SD of the correlation time series, between (a) statically connected regions (within-module), (b) unconnected regions (between-module), and (c) dynamically connected regions (module change), with (d) the results of a multiple regression. Ri refers to Region i. Region pair R1-R4 has a dynamic connection so the true dFC should be higher than region pairs R1-R2 and R1-R3, which have a static connection. Increased measurement noise resulted in decreased observed dFC for the three types of region pairs. This effect was particularly pronounced for the dynamically connected regions (module change). The multiple regression assesses the impact of measurement noise on estimated dFC with a statistically significant effect indicated by a 95% confidence interval (CI) in bold type. The solid lines in (a) – (c) correspond to the fitted values of the multiple regression.

In the Supplementary Material, we demonstrate that the effects of neural autocorrelation, connectivity strength and measurement noise on estimated dFC persist across a range of parameter values (see Figs. S5–S7 respectively).
k-means
In the previous sections, we demonstrated that group differences in connectivity dynamics, as measured with simple sliding window approaches, can be due to other factors such neural autocorrelation. However, even when such differences do not exist, some common dFC methods may still detect artifactual group differences in dFC owing to unaccounted heterogeneity in the dynamic connectivity structure.
To investigate the effect of heterogeneity in the number of FC states attainable, we generated a set of data for two groups of 50 individuals: those in G1 could reach 9 FC states, and those in G2 could reach only 6 of these 9 FC states (see k-means in Methods: specific simulations). If these FC states can be recovered accurately from the data, then a simple measure of dFC is the number of FC state transitions that occur. Importantly, in the simulations, the number of such transitions was identical across groups, namely three.
We used a k-means cluster analysis on the correlation matrices in an attempt to recover the underlying FC states. Fig. 7 illustrates the performance of the k-means analysis for values of the hyperparameter k=1,…,12. We ran the analysis for the two groups separately, and also for all 100 individuals together. Here, we perform the analysis with sliding-window width w = 30, though window widths w=60,90,120 TRs, yielded broadly similar results (see Figs. S1–S3). Further, Fig. S4 shows that, when k is correctly estimated or only slightly misspecified, it becomes more difficult to estimate the states correctly as window length increases.
The true number of FC state transitions in this simulation is 3. Individuals in G1 could reach 9 FC states while individuals in G2 could reach only 6 of these 9 FC states. (a) When k<9, the combined analyses underestimated the number of FC state transitions for G1 (solid yellow line), while the number of FC state transitions for G2 (solid purple line) was recovered more accurately. (b) The separate analysis showed an improvement in the error in number of FC state transitions for G1 (dotted yellow line) when k was underestimated. (c) Unless k was correctly estimated, the combined analysis yielded an incorrect group difference in number of FC state transitions. (d) For the separate analysis, incorrect group differences were only found when k was overestimated or grossly misspecified for either group. (e) In terms of recovered space-time connectivity structure, the combined analysis performed better for G2 (solid purple line) than G1 (solid yellow line) when k was underestimated. (f) If k was correctly specified or only slightly misspecified for either group, the separate analyses had a similar error in recovered connectivity structure. Asterisks in (a) – (d) indicate a statistically significant (p<0.05) difference from zero, according to a Wilcoxon signed-rank test, while asterisks in (e) – (f) indicate a statistically significant (p<0.05) difference between the two groups, according to a two-sample Wilcoxon rank-sum test.

Fig. 7a shows that when k<9, the typical (combined) k-means analysis underestimated the number of FC state transitions for G1, while the number of FC state transitions for G2 was recovered more accurately. Unless the correct value of k was estimated, Fig. 7c shows that the combined analysis leads to artifactual differences in dFC between groups. One might think that fitting the two groups separately would solve this problem. Fig. 7b demonstrates a modest improvement in the error in number of FC state transitions for G1 (yellow line) when k was underestimated, but a steep increase in error for G2 when k was overestimated. This deterioration when k>6 is to be expected because the k-means algorithm had to identify more FC states than are actually present in the G2. We see in Fig. 7d that for the separate analysis, incorrect group differences were again found when k was overestimated or grossly misspecified for either group.
Figs. 7e and 7f illustrate the differences between the groups for both analyses in mean centroid error, which is a measure of how well the complete space-time connectivity structure is recovered. In this case, the combined analysis performed better for G2 than G1 when k was underestimated (k<9), indicating that the recovered centroids were biased towards the 6 joint states. In contrast, if k was correctly specified or only slightly misspecified for either group, the separate analyses had a similar error in recovered connectivity structure. While these results suggest that the separate analyses did yield some improvement on the combined analysis, we caution that the problem of estimating k for both groups still needs to be addressed. As shown in Fig. 7c and 7d, without an accurate estimation of k, one is likely to incorrectly infer the size of differences in dFC across groups, even for cases where there is no true difference. A common method for estimating the number of clusters, k, is the elbow plot shown in Fig. 8. This demonstrates that it is not always straightforward to estimate k accurately: while we see a clear elbow for the separate analysis of G2 (purple dotted line), there is no obvious elbow for the separate analysis of G1 (yellow dotted line) or the combined analysis (solid black line).
The RSS plots for the separate analysis of G2 has a clear elbow at k = 6 but there is no obvious elbow for the separate analysis of G1 or the combined analysis of both groups.

Multilayer modularity
While the k-means method represents an approach which aggregates data across subjects in order to glean information about functional connectivity dynamics, other methods analyse fMRI data on a subject-by-subject basis. For example, the multilayer modularity approach (Bassett et al., 2011) characterises the correlation matrices for a single subject obtained from a sliding-window analysis as a multi-layered network. Each region in each window is assigned a module label by maximising a modularity index which depends on two hyperparameters γ and ω (see Multilayer modularity in Methods: specific simulations for details).
In this simulation, we again generated data for two groups of 50 individuals: those in G1 experienced one FC state transition while those in G2 experienced three FC state transitions. We applied the multilayer algorithm for all 100 individuals for γ=0.75,1,…,2.5 and ω=0.25,0.5,…,4. One measure of dFC in this approach is the mean “flexibility” of each brain region (see Multilayer modularity in Methods: specific simulations). Thus, we now simulated a true group difference in the number of FC state transitions, and examined how accurately the FC states were recovered as a function of the methods hyperparameters. We also examined the error in the true vs estimated mean flexibility for each group.
Fig. 9 illustrates the importance of parameter selection for the multilayer modularity approach. Figs. 9a and 9c show that, in terms of the complete space-time connectivity dynamics, the optimal value for ω differed between the two groups. A lower value for ω generally resulted in more changes in module assignment across consecutive time windows. Since an individual in G1 experienced fewer FC state transitions, brain regions had fewer changes in module assignment across the course of the scan. Thus for G1, a higher value for ω was more effective in recovering the spatio-temporal connectivity structure.
The effect of varying parameters γ and ω on performance of the multilayer modularity approach. Individuals in G1 experience 1 state transitions while individuals in G2 experienced 3 state transitions. Percentage error in connectivity structure is defined as the percentage of entries in the recovered incidence matrix that are equal to the corresponding entries of the true incidence matrix. A red cross indicates the pair of parameter values (γ,ω) which minimised the mean (across individuals) percentage error in connectivity structure. The flexibility of a region is defined as the number of times the region changes module assignment divided by the total possible number of module changes. (a) For G1, the optimal parameter values for recovering the connectivity structure were γ=1.25,ω=2. (b) Increasing ω resulted in decreased recovered flexibility for G1. (c) For G2, the optimal parameter values for recovering the connectivity structure were γ=1.5,ω=1. Thus, the optimal value for ω was markedly lower for G2 than for G1 while the optimal value for γ was slightly higher for G2 compared to G1. (d) Increasing ω also resulted in decreased recovered flexibility for G2, but at a greater rate than for G1.

We note that the optimal value for γ appears to be broadly the same for both groups. The parameter γ influences the resolution of the recovered network. A higher value for γ partitions the brain regions into more modules. Since ROIs were always partitioned into 5 modules for both groups, we would not expect the optimal value of γ to differ between the groups.
Figs. 9b and 9d show that larger values of ω yield higher recovered mean flexibility for both G1 and G2. This effect, however, does not occur at the same rate for both groups. Fig. 10 shows that different values of γ and ω result in different group differences in mean flexibility, to the extent that G1 is incorrectly found to be more flexible for some values of the hyperparameters. Note that the true difference in flexibility was approximately 0.14 (see Multilayer modularity in Methods: specific simulations) and this was not captured for any values of γ and ω. This suggests that caution should be taken when computing group differences, especially when an assumption of homogeneity is made.
The effect of varying parameters γ and ω in the multilayer modularity approach on the recovered group difference in mean (across regions) flexibility. Individuals in G1 experience 1 state transitions while individuals in G2 experienced 3 state transitions. The flexibility of a region is defined as the number of times the region changes module assignment divided by the total possible number of module changes. Different values of γ and ω yield different group differences in mean flexibility. Note that the true difference here is 0.14, which is not attained for any values of the parameters.

Discussion
In this article, we have illustrated some of the limitations of current dFC methods when dealing with heterogeneity. We used a generic simulation framework to isolate various sources of heterogeneity, and showed that observed connectivity dynamics may be due to factors other than true changes in connectivity.
To investigate the effects of individual differences in neural autocorrelation, HRF shape, connectivity strength and measurement noise, we used the SD of correlation values across sliding windows as our measure of dFC. We calculated this measure for three types of connectivity: static, positively connected; static, unconnected; and dynamically connected (positively connected to unconnected). Increased neural autocorrelation resulted in higher dFC for statically connected regions but lower dFC for dynamically connected regions. A more temporally dispersed HRF produced higher dFC for all three connectivity types. In contrast, increased measurement noise yielded lower dFC across the three types of connectivity. Increased connectivity strength resulted in higher dFC for the dynamically connected regions but lower dFC for the positively statically connected regions. Together, these findings demonstrate that individual differences in dFC can be caused by various properties of the fMRI signal that are unrelated to the underlying neural connectivity dynamics.
We also demonstrated that common dFC methods may detect artifactual group differences in dynamic connectivity due to the assumptions that are made. For example, in a k-means analysis, it is often assumed that all individuals may attain the same set of FC states. If the hyperparameter k is incorrectly estimated, an incorrect group difference in the number of FC state transitions experienced may be detected if one group can attain more FC states than the other group. On the other hand, if the correct k is specified, the recovery of the number of FC state transitions is highly accurate. This suggests that it is the k-means clustering operation itself (rather than the correlation measure or windowing procedure) that leads to the observed biases when inhomogeneous groups are compared. In particular, the biases are likely due to the tendency of the k-means algorithm to detect the most prevalent states. We note that these issues could in principle affect any FC state-based method which assumes homogeneity in attainable FC states across individuals.
More generally, care should be taken with any method that requires the selection of hyperparameters by the user. In particular, we demonstrated that group differences in mean flexibility detected by a multilayer modularity approach were regulated by the choice of hyperparameters. While one would expect individual-based methods such as the multilayer modularity approach to be more robust to heterogeneity, spurious group differences can nonetheless be found if hyperparameters are assumed constant across individuals.
One could attempt to optimise the choice of hyperparameters using the data. For example, in a k-means analysis, the number of clusters, k, could be estimated by a Variational Bayesian approach (Ghahramani et al., 1999). For the multilayer modularity approach, one could use cross-validation across independent scans in an attempt to maximise stability of the recovered connectivity structure, though this assumes that dynamics are invariant across scans on the same individual. Alternatively, Bassett et al. (2013a) suggest choosing the values of γ and ω which yield connectivity structure that is most different from particular null models. Hyperparameter optimisation could be investigated in future work, though our point is that such optimisation should allow for heterogeneity across individuals.
We focused on a number of likely sources of heterogeneity in fMRI signals, using effects of age to illustrate some of our examples. This is based on recent evidence of group differences in signal autocorrelation (Geerligs et al., 2016, Arbabshirani et al., 2014a), HRF shape (Hanlon et al., 2016, Huettel et al., 2001, Aizenstein et al., 2004, D'Esposito et al., 1999), and non-neural physiological noise levels (Geerligs et al., 2015, Mark et al., 2015). Nevertheless, our findings apply in any situation where such heterogeneity may arise between individuals. Furthermore, there may be other sources of heterogeneity not investigated here that could have spurious effects on observed dFC. For example, we only considered variability in 2 out of the 7 HRF parameters; it is plausible that the remaining parameters also have an effect on estimated dFC. Similarly, we assumed that brain regions partition into 5 modules in each FC state, whereas it is conceivable that this could differ among individuals.
We note that certain aspects of the simulation framework represent simplifications of the physical and physiological processes involved in fMRI neuroimaging. For example, the addition of white measurement noise is not realistic and noise related to head motion or vascular effects may have more regionally specific effects on connectivity estimates (Power et al., 2012). Therefore the noise simulation should be interpreted as a cautionary result, and not as an illustration of effects of real noise sources in fMRI. We also assumed that the HRF is the same for all regions, although the shape of the HRF has been shown to vary from region to region (e.g. Schacter et al., 1997). These assumptions, however, allowed us to isolate the impact of unaccounted heterogeneity. In particular, our current simulation framework had the distinct benefit of separating the underlying dFC structure from sources of heterogeneity such as neural autocorrelation, noise and HRF shape. We were thus able to manipulate dFC and these other sources of variation independently and show how observed dFC is affected. Further, in the Supplementary Material, we demonstrate that the effects of unrelated sources of variation persist across a range of parameter values, suggesting that these effects of dFC are not specific to this simulated dataset, but are in fact a more general phenomenon.
In this article, we do not address what drives these changes in functional connectivity. Recent observations suggest that dFC can be explained in terms of sparse brief events (Allan et al., 2015, Karahanoğlu and van de Ville, 2015, Liu and Duyn, 2013, Tagliazucchi et al., 2012). Our simulation framework is based on work by Allen et al. (2012) which attempts to find periods of recurrent patterns of functional connectivity, or FC states, across time and individuals. It may be that these periods are longer than the underlying neural processes due to the temporal limitations of fMRI. Many FC state-based studies work on the basis of certain assumptions about FC states regarding, for example, their discreteness and typical duration. Generalisations of these approaches exist: Yaesoubi et al. (2015) separate FC states across distinct frequency profiles, while Miller et al. (2016) characterises functional connectivity using meta-state analysis, which allows different FC states to occur simultaneously. However, little is actually known about the nature of FC states and future work is required to better understand how these neural processes drive observed FC states.
To illustrate the effects of heterogeneity in the number of attainable states and the number of state changes, we simulated binary differences between two groups of individuals. This represents a simplification since, in reality, it is likely that differences between individuals fall on a continuous spectrum and so we caution against dichotomising between groups. It should also be emphasised that while here we have isolated the impacts of different sources of heterogeneity, in reality they may appear in combination. Future work could investigate how different types of heterogeneity interact, or even counteract, to produce differences in observed dFC.
Our work does not specify precisely how the different aspects of an analysis pipeline affect artifactual differences in observed dFC. Many of the sources of heterogeneity presented in this paper are also likely to affect estimates of static functional connectivity, as measured by correlation. For example, static FC decreases with higher white noise and is modulated by HRF shape (Li et al., 2009). Therefore, connectivity estimates for each window in a dFC analysis will be affected by unmodelled heterogeneity and combine (possibly in a non-linear fashion) to influence the final dFC estimates. Recent work has aimed to disentangle changes in FC from changes in signal level (Duff et al., 2017), but an important next step is to determine the precise manner in which observed dFC is affected by the different parts of the dFC analysis pipeline, with the aim to develop methods that resolve the issues raised in this article.
Although we have chosen to illustrate the above points with only a few methods, the issues should in general extend to other approaches. For example, we used Fisher-transformed Pearson correlation as our basic measure of functional connectivity because this is currently the most commonly used metric. Alternative connectivity measures, such as coherence or multiplication of temporal derivatives (Shine et al., 2015), may be less susceptible to certain types of unaccounted heterogeneity: for example, it has been shown that coherence is robust against variability in the shape of the HRF between regions (Ashby, 2011). Nonetheless, the issues of hyperparameter selection in dFC methods still need to be addressed regardless of the connectivity metric used. With this aim, we provide the Matlab code used for the present simulations here: http://www.mrc-bsu.cam.ac.uk/software/miscellaneous-software/. We encourage interested readers to explore alternative metrics and dFC analysis methods.
Concluding remarks
We use simulated fMRI data to demonstrate the effect of various sources of heterogeneity on observed dFC. Our results show that individual differences in dFC may be due to non-dynamic features of the data. The choice of hyperparameters in common methods is also important: these are often assumed constant across individuals, which can result in spurious group differences in dFC. We recommend that future studies consider implicit assumptions of homogeneity in their analysis.
Footnotes
Appendix ASupplementary data associated with this article can be found in the online version at doi:10.1016/j.neuroimage.2017.05.065.
Acknowledgements
The authors thank Steven M. Hill for his helpful comments on a draft of this manuscript. BL, SRW and RNH are supported by the
The Cam-CAN corporate author consists of the project principal personnel: Lorraine K Tyler, Carol Brayne, Edward T Bullmore, Andrew C Calder, Rhodri Cusack, Tim Dalgleish, John Duncan, Richard N Henson, Fiona E Matthews, William D Marslen-Wilson, James B Rowe, Meredith A Shafto; Research Associates: Karen Campbell, Teresa Cheung, Simon Davis, Linda Geerligs, Rogier Kievit, Anna McCarrey, Abdur Mustafa, Darren Price, David Samu, Jason R Taylor, Matthias Treder, Kamen Tsvetanov, Janna van Belle, Nitin Williams; Research Assistants: Lauren Bates, Tina Emery, Sharon Erzinlioglu, Andrew Gadie, Sofia Gerbase, Stanimira Georgieva, Claire Hanley, Beth Parkin, David Troy; Affiliated Personnel: Tibor Auer, Marta Correia, Lu Gao, Emma Green, Rafael Henriques; Research Interviewers: Jodie Allen, Gillian Amery, Liana Amunts, Anne Barcroft, Amanda Castle, Cheryl Dias, Jonathan Dowrick, Melissa Fair, Hayley Fisher, Anna Goulding, Adarsh Grewal, Geoff Hale, Andrew Hilton, Frances Johnson, Patricia Johnston, Thea Kavanagh-Williamson, Magdalena Kwasniewska, Alison McMinn, Kim Norman, Jessica Penrose, Fiona Roby, Diane Rowland, John Sargeant, Maggie Squire, Beth Stevens, Aldabra Stoddart, Cheryl Stone, Tracy Thompson, Ozlem Yazlik; and administrative staff: Dan Barnes, Marie Dixon, Jaya Hillman, Joanne Mitchell, Laura Villis.
Appendix ASupplementary data
References
- 1. The bold hemodynamic response in healthy agingJ. Cognit. Neurosci.1652004786793[PubMed][Google Scholar]
- 2. Functional connectivity in MRI is driven by spontaneous bold eventsPLoS One1042015e0124577[PubMed][Google Scholar]
- 3. Tracking whole-brain connectivity dynamics in the resting stateCereb. Cortex2432012663676[PubMed][Google Scholar]
- 4. Arbabshirani, M.R., Castro, E., Calhoun, V.D., 2014a. Accurate classification of schizophrenia patients based on novel resting-state fMRI features. In: Proceedings of the 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, pp. 6691–6694, 〈https://doi.org/10.1109/EMBC.2014.6945163〉.
- 5. Impact of autocorrelation on functional connectivityNeuroImage1022014294308[PubMed][Google Scholar]
- 6.
Statistical Analysis of fMRI Data 2011MIT PressCambridge, MA - 7. Robust detection of dynamic community structure in networksChaos2312013013142[PubMed][Google Scholar]
- 8. Dynamic reconfiguration of human brain networks during learningProc. Natl. Acad. Sci.10818201176417646[PubMed][Google Scholar]
- 9. Task-based core-periphery organization of human brain dynamicsPLoS Comput. Biol.992013e1003171[PubMed][Google Scholar]
- 10. Functional connectivity in the motor cortex of resting human brain using echo-planar MRIMagn. Reson. Med.3441995537541〈http://dx.doi.org/10.1002/mrm.1910340409〉[PubMed][Google Scholar]
- 11. Complex brain networks: graph theoretical analysis of structural and functional systemsNat. Rev. Neurosci.102009186198[PubMed][Google Scholar]
- 12. Time-frequency dynamics of resting-state brain connectivity measured with fMRINeuroImage50120108198[PubMed][Google Scholar]
- 13. The effect of normal aging on the coupling of neural activity to the bold hemodynamic responseNeuroImage1011999614[PubMed][Google Scholar]
- 14. Duff, E., Makin, T., Smith, S.M., Woolrich, M.W., 2017. Disambiguating brain functional connectivity, bioRxiv. 〈https://doi.org/10.1101/103002〉.
- 15. Task-related modulation of functional connectivity variability and its behavioral correlationsHum. Brain Mapp.368201532603272[PubMed][Google Scholar]
- 16. To smooth or not to smooth?: bias and efficiency in fmri time-series analysisNeuroImage1222000196208〈https://doi.org/10.1006/nimg.2000.0609〉[PubMed][Google Scholar]
- 17. Functional connectivity and structural covariance between regions of interest can be measured more accurately using multivariate distance correlationNeuroImage13520161631〈https://doi.org/10.1016/j.neuroimage.2016.04.047〉[PubMed][Google Scholar]
- 18. State and trait components of functional connectivity: individual differences vary with mental stateJ. Neurosci.354120151394913961〈https://doi.org/10.1523/JNEUROSCI.1324-15.2015〉[PubMed][Google Scholar]
- 20. Hemodynamic response function abnormalities in schizophrenia during a multisensory detection taskHum. Brain Mapp.3722016745755[PubMed][Google Scholar]
- 21. The effects of aging upon the hemodynamic response measured by functional {MRI}NeuroImage1312001161175[PubMed][Google Scholar]
- 22. Dynamic functional connectivity: promise, issues, and interpretationsNeuroImage802013360378[PubMed][Google Scholar]
- 23. Non-stationarity in the “resting brain's” modular architecturePLoS One762012e39731[PubMed][Google Scholar]
- 24. Transient brain activity disentangles fMRI resting-state dynamics in terms of spatially and temporally overlapping networksNat. Commun.62015[Google Scholar]
- 25. A sliding time-window ica reveals spatial variability of the default mode network in timeBrain Connect.142011339347[PubMed][Google Scholar]
- 26. On spurious and real fluctuations of dynamic functional connectivity during restNeuroImage1042015430436[PubMed][Google Scholar]
- 27. Review of methods for functional brain connectivity detection using fMRIComput. Med. Imaging Graph.3322009131139[PubMed][Google Scholar]
- 28. Time-varying functional network information extracted from brief instances of spontaneous brain activityProc. Natl. Acad. Sci.11011201343924397[PubMed][Google Scholar]
- 29. Metabolic and vascular origins of the bold effect: implications for imaging pathology and resting-state brain functionJ. Magn. Reson. Imaging4222015231246[PubMed][Google Scholar]
- 30. Higher dimensional meta-state analysis reveals reduced resting fMRI connectivity dynamism in schizophrenia patientsPLoS One1132016(e0149849)[Google Scholar]
- 31. Community structure in time-dependent, multiscale, and multiplex networksScience32859802010876878[PubMed][Google Scholar]
- 32. Spurious but systematic correlations in functional connectivity MRI networks arise from subject motionNeuroimage593201221422154[PubMed][Google Scholar]
- 33. A method for evaluating dynamic functional network connectivity and task-modulation: application to schizophreniaMagn. Reson. Mater. Phys. Biol. Med.2352010351366[Google Scholar]
- 34. Late onset of anterior prefrontal activity during true and false recognition: an event-related fMRI studyNeuroImage641997259269[PubMed][Google Scholar]
- 35. Evaluation of sliding window correlation performance for characterizing dynamic functional connectivity and brain statesNeuroImage1332016111128[PubMed][Google Scholar]
- 36. Estimation of dynamic functional connectivity using multiplication of temporal derivativesNeuroImage1222015399407[PubMed][Google Scholar]
- 37. Criticality in large-scale brain fMRI dynamics unveiled by a novel point process analysisFront. Physiol.32012[Google Scholar]
- 38. Extrinsic and intrinsic brain network connectivity maintains cognition across the lifespan despite accelerated decay of regional brain activationJ. Neurosci.3611201631153126[PubMed][Google Scholar]
- 39. Temporal autocorrelation in univariate linear modeling of fMRI dataNeuroImage146200113701386[PubMed][Google Scholar]
- 40. Dynamic coherence analysis of resting fMRI data to jointly capture state-based phase, frequency, and time-domain informationNeuroImage1202015133142[PubMed][Google Scholar]