A feature-based information-theoretic approach for detecting interpretable, long-timescale pairwise interactions from time series (2404.05929v1)
Abstract: Quantifying relationships between components of a complex system is critical to understanding the rich network of interactions that characterize the behavior of the system. Traditional methods for detecting pairwise dependence of time series, such as Pearson correlation, Granger causality, and mutual information, are computed directly in the space of measured time-series values. But for systems in which interactions are mediated by statistical properties of the time series (`time-series features') over longer timescales, this approach can fail to capture the underlying dependence from limited and noisy time-series data, and can be challenging to interpret. Addressing these issues, here we introduce an information-theoretic method for detecting dependence between time series mediated by time-series features that provides interpretable insights into the nature of the interactions. Our method extracts a candidate set of time-series features from sliding windows of the source time series and assesses their role in mediating a relationship to values of the target process. Across simulations of three different generative processes, we demonstrate that our feature-based approach can outperform a traditional inference approach based on raw time-series values, especially in challenging scenarios characterized by short time-series lengths, high noise levels, and long interaction timescales. Our work introduces a new tool for inferring and interpreting feature-mediated interactions from time-series data, contributing to the broader landscape of quantitative analysis in complex systems research, with potential applications in various domains including but not limited to neuroscience, finance, climate science, and engineering.
- L. Peel, T. P. Peixoto, and M. De Domenico, Statistical inference links data and theory in network science, Nature Communications 13, 6794 (2022).
- S. Lefèvre, D. Vasquez, and C. Laugier, A survey on motion prediction and risk assessment for intelligent vehicles, ROBOMECH Journal 1, 1 (2014).
- C. W. J. Granger, Investigating causal relations by econometric models and cross-spectral methods, in Essays in Econometrics: Collected Papers of Clive W. J. Granger (Harvard University Press, USA, 2001) pp. 31–47.
- T. Schreiber, Measuring information transfer, Physical Review Letters 85, 461 (2000).
- M. Ursino, G. Ricci, and E. Magosso, Transfer entropy as a measure of brain connectivity: A critical analysis with the help of neural mass models, Frontiers in Computational Neuroscience 14, 45 (2020).
- T. Dimpfl and F. J. Peter, Using transfer entropy to measure information flows between financial markets, Studies in Nonlinear Dynamics and Econometrics 17, 85 (2013).
- Y. Li and D. E. Giles, Modelling volatility spillover effects between developed stock markets and asian emerging stock markets, International Journal of Finance & Economics 20, 155 (2015).
- M. Bonnefond, S. Kastner, and O. Jensen, Communication between brain areas based on nested oscillations, eNeuro 4, ENEURO.0153 (2017).
- J. T. Lizier, The local information dynamics of distributed computation in complex systems, 2013th ed., Springer theses (Springer, Berlin, Germany, 2012).
- T. Edinburgh, S. J. Eglen, and A. Ercole, Causality indices for bivariate time series data: A comparative review of performance, Chaos: An Interdisciplinary Journal of Nonlinear Science 31, 083111 (2021).
- R. Bellman, Dynamic programming, Science 153, 34 (1966).
- D. Donoho, High-dimensional data analysis: The curses and blessings of dimensionality, in AMS Math Challenges Lecture (2000) pp. 1–32.
- M. Staniek and K. Lehnertz, Symbolic transfer entropy, Physical Review Letters 100, 158101 (2008).
- D. P. Shorten, R. E. Spinney, and J. T. Lizier, Estimating transfer entropy in continuous time between neural spike trains or other event-based data, PLOS Computational Biology 17, 1 (2021).
- B. D. Fulcher, Feature-based time-series analysis, in Feature Engineering for Machine Learning and Data Analytics (CRC Press, 2018).
- T. Henderson and B. D. Fulcher, An empirical evaluation of time-series feature sets, in 2021 International Conference on Data Mining Workshops (ICDMW) (2021) pp. 1032–1038.
- B. D. Fulcher, M. A. Little, and N. S. Jones, Highly comparative time-series analysis: The empirical structure of time series and their methods, Journal of The Royal Society Interface 10, 20130048 (2013).
- B. D. Fulcher and N. S. Jones, Hctsa: A computational framework for automated time-series phenotyping using massive feature extraction, Cell Systems 5, 527 (2017).
- M. O’Hara-Wild, R. Hyndman, and E. Wang, Feasts: Feature Extraction and Statistics for Time Series (2021).
- J. T. Lizier, Jidt: An information-theoretic toolkit for studying the dynamics of complex systems, Frontiers in Robotics and AI 1, 10.3389/frobt.2014.00011 (2014).
- P. F. Verdes, Assessing causality from multivariate time series, Physical Review E 72, 026222 (2005).
- S. Holm, A simple sequentially rejective multiple test procedure, Scandinavian Journal of Statistics 6, 65 (1979).
- A. Kraskov, H. Stögbauer, and P. Grassberger, Estimating mutual information, Physical Review E 69, 066138 (2004).
- B. Goswami, A brief introduction to nonlinear time series analysis and recurrence plots, Vibration 2, 332 (2019).
- T. Schreiber and A. Schmitz, Surrogate time series, Physica D: Nonlinear Phenomena 142, 346 (2000).
- B. P. M. M. S. J. Leybourne and A. R. Tremayne, Can economic time series be differenced to stationarity?, Journal of Business & Economic Statistics 14, 435 (1996).
- M. Cassidy and W. Penny, Bayesian nonstationary autoregressive models for biomedical signal analysis, IEEE Transactions on Biomedical Engineering 49, 1142 (2002).
- George.E.P. Box and G. M. Jenkins, Time Series Analysis: Forecasting and Control (Holden-Day, 1976) Chap. 3.
- B. Venables and B. Ripley, Modern Applied Statistics with S (Springer, 2002).
- D. Feldman, The spike-timing dependence of plasticity, Neuron 75, 556 (2012).
- W. M. Usrey, J.-M. Alonso, and R. C. Reid, Synaptic interactions between thalamic inputs to simple cells in cat visual cortex, Journal of Neuroscience 20, 5461 (2000).
- T. Schreiber, Detecting and analyzing nonstationarity in a time series using nonlinear cross predictions, Physical Review Letters 78, 843 (1997).
- L. A. Aguirre and C. Letellier, Nonstationarity signatures in the dynamics of global nonlinear models, Chaos: An Interdisciplinary Journal of Nonlinear Science 22, 033136 (2012).
- V. Griffith and C. Koch, Quantifying synergistic mutual information, in Guided Self-Organization: Inception (Springer Berlin Heidelberg, Berlin, Heidelberg, 2014) pp. 159–190.
- R. Quax, O. Har-Shemesh, and P. M. A. Sloot, Quantifying synergistic information using intermediate stochastic variables, Entropy 19, 10.3390/e19020085 (2017).
- C. J. Rozell and D. H. Johnson, Analyzing the robustness of redundant population codes in sensory and feature extraction systems, Neurocomputing 69, 1215 (2006), computational Neuroscience: Trends in Research 2006.