Quantifying predictive uncertainty of aphasia severity in stroke patients with sparse heteroscedastic Bayesian high-dimensional regression (2309.08783v4)
Abstract: Sparse linear regression methods for high-dimensional data commonly assume that residuals have constant variance, which can be violated in practice. For example, Aphasia Quotient (AQ) is a critical measure of language impairment and informs treatment decisions, but it is challenging to measure in stroke patients. It is of interest to use high-resolution T2 neuroimages of brain damage to predict AQ. However, sparse regression models show marked evidence of heteroscedastic error even after transformations are applied. This violation of the homoscedasticity assumption can lead to bias in estimated coefficients, prediction intervals (PI) with improper length, and increased type I errors. Bayesian heteroscedastic linear regression models relax the homoscedastic error assumption but can enforce restrictive prior assumptions on parameters, and many are computationally infeasible in the high-dimensional setting. This paper proposes estimating high-dimensional heteroscedastic linear regression models using a heteroscedastic partitioned empirical Bayes Expectation Conditional Maximization (H-PROBE) algorithm. H-PROBE is a computationally efficient maximum a posteriori estimation approach that requires minimal prior assumptions and can incorporate covariates hypothesized to impact heterogeneity. We apply this method by using high-dimensional neuroimages to predict and provide PIs for AQ that accurately quantify predictive uncertainty. Our analysis demonstrates that H-PROBE can provide narrower PI widths than standard methods without sacrificing coverage. Narrower PIs are clinically important for determining the risk of moderate to severe aphasia. Additionally, through extensive simulation studies, we exhibit that H-PROBE results in superior prediction, variable selection, and predictive inference compared to alternative methods.
- Alfons, A., Croux, C., and Gelper, S. (2013), “Sparse least trimmed squares regression for analyzing high-dimensional large data sets,” The Annals of Applied Statistics, 7, 226–248.
- Begoli, E., Bhattacharya, T., and Kusnezov, D. (2019), “The need for uncertainty quantification in machine-assisted medical decision making,” Nature Machine Intelligence, 1, 20–23.
- Belloni, A., Chernozhukov, V., and Wang, L. (2014), “Pivotal Estimation via Square-Root LASSO in Nonparametric Regression,” The Annals of Statistics, 42, 757–788.
- Blanchard, G. and Roquain, E. (2009), “Adaptive FDR control under independence and dependence,” Journal of Machine Learning Research, 10, 2837 –2831.
- Buonaccorsi, J. P. (1995), “Prediction in the presence of measurement error: General discussion and an example predicting defoliation,” Biometrics, 1562–1569.
- Castillo, I. and Roquain, É. (2020), “On spike and slab empirical Bayes multiple testing,” The Annals of Statistics, 48, 2548–2574.
- Chiou, H. T., Guo, M., and Ing, C. K. (2020), “Variable selection for high-dimensional regression models with time series and heteroscedastic errors,” Journal of Econometrics, 216, 118–136.
- Daye, Z. J., Chen, J., and Li, H. (2012), “High-Dimensional Heteroscedastic Regression with an Application to eQTL Data Analysis,” Biometrics, 68, 316–326.
- Efron, B. (2008), “Microarrays, Empirical Bayes and the Two-Group Model,” Statistical Science, 23, 1–22.
- Johnson, L., Basilakos, A., Yourganov, G., Cai, B., Bonilha, L., Rorden, C., and Fridriksson, J. (2019), “Progression of Aphasia Severity in the Chronic Stages of Stroke,” American Journal of Speech-Language Pathology, 28, 639–649.
- Lee, J., Ko, M., Park, S., and Kim, G. (2021), “Prediction of Aphasia Severity in Patients with Stroke Using Diffusion Tensor Imaging,” Brain Sciences, 11, 304.
- Lei, J., G’Sell, M., Rinaldo, A., Tibshirani, R. J., and Wasserman, L. (2018), “Distribution-free predictive inference for regression,” Journal of the American Statistical Association, 113, 1094–1111.
- Liang, F., Paulo, R., Molina, G., Clyde, M. A., and Berger, J. O. (2008), “Mixtures of g priors for Bayesian variable selection,” Journal of the American Statistical Association, 103, 410–423.
- Liu, C., Rubin, D. B., and Wu, Y. N. (1998), “Parameter expansion to accelerate EM: The PX-EM algorithm,” Biometrika, 85, 755–770.
- Martin, R. and Tang, Y. (2020), “Empirical Priors for Prediction in Sparse High-dimensional Linear Regression.” Journal of Machine Learning Research, 21, 1–30.
- McLain, A. C. and Zgodic, A. (2021), “Fitting high-dimensional linear regression models with probe,” https://github.com/alexmclain/UNHIDEM.
- McLain, A. C., Zgodic, A., and Bondell, H. (2022), “Sparse high-dimensional linear regression with a partitioned empirical Bayes ECM algorithm.” https://arxiv.org/abs/2209.08139.
- Meng, X. L. and Rubin, D. B. (1993), “Maximum likelihood estimation via the ECM algorithm: A general framework,” Biometrika, 80, 267–278.
- Minka, T. and Lafferty, J. (2002), “Expectation-propagation for the generative aspect model,” in Proceedings of the Eighteenth conference on Uncertainty in artificial intelligence, pp. 352–359.
- Nachev, P., Coulthard, E., Jäger, H. R., Kennard, C., and Husain, M. (2008), “Enantiomorphic normalization of focally lesioned brains,” Neuroimage, 39, 1215–1226.
- Odekar, A. and Hallowell, B. (2005), “Comparison of Alternatives to Multidimensional Scoring in the Assessment of Language Comprehension in Aphasia,” American Journal of Speech-Language Pathology, 14, 337–345.
- Parker, P., Holan, S., and Wills, S. (2021), “A general Bayesian model for heteroskedastic data with fully conjugate full-conditional distributions,” Journal of Statistical Computation and Simulation, 91, 3207–3227.
- Risser, A. H. and Spreen, O. (1985), “The western aphasia battery,” Journal of clinical and experimental neuropsychology, 7, 463–470.
- Rorden, C., Bonilha, L., Fridriksson, J., Bender, B., and Karnath, H.-O. (2012), “Age-specific CT and MRI templates for spatial normalization,” Neuroimage, 61, 957–965.
- Rousseeuw, P. and Van Driessen, K. (2006), “Computing LTS regression for large data sets,” Data Mining and Knowledge Discovery, 12, 29–45.
- Schlather, M., Malinowski, A., Menck, P. J., Oesting, M., and Strokorb, K. (2015), “Analysis, Simulation and Prediction of Multivariate Random Fields with Package RandomFields,” Journal of Statistical Software, 63, 1–25.
- Storey, J. D. (2007), “The optimal discovery procedure: a new approach to simultaneous significance testing,” Journal of the Royal Statistical Society: Series B (Methodological), 69, 347–368.
- Teghipco, A., Newman-Norlund, R., Fridriksson, J., Rorden, C., and Bonilha, L. (2023), “Distinct brain morphometry patterns revealed by deep learning improve prediction of aphasia severity,” Research Square.
- Temlyakov, V. (2000), “Weak greedy algorithms,” Advances in Computational Mathematics, 12, 213––227.
- Tibshirani, R. and Foygel, R. (2019), “Conformal prediction under covariate shift.” Advances in neural information processing systems.
- Vehtari, A., Gelman, A., Sivula, T., Jylänki, P., Tran, D., Sahai, S., Blomstedt, P., Cunningham, J. P., Schiminovich, D., and Robert, C. P. (2020), “Expectation Propagation as a Way of Life: A Framework for Bayesian Inference on Partitioned Data,” Journal of Machine Learning Research, 21, 1–53.
- Wang, H., Li, G., and Jiang, G. (2007), “Robust Regression Shrinkage and Consistent Variable Selection Through the LAD-Lasso,” Journal of Business and Economic Statistics, 25, 347–355.
- Wang, X. and Leng, C. (2016), “High dimensional ordinary least squares projection for screening variables,” Journal of the Royal Statistical Society. Series B (Statistical Methodology), 78, 589–611.
- Yourganov, G., Smith, K., Fridriksson, J., and Rorden, C. (2015), “Predicting aphasia type from brain damage measured with structural MRI,” Cortex, 73, 203–2015.
- Zhou, K., Li, K.-C., and Zhou, Q. (2021), “Honest confidence sets for high-dimensional regression by projection and shrinkage,” Journal of the American Statistical Association, 1–20.
- Zhou, L. and Zou, H. (2021), “Cross-Fitted Residual Regression for High-Dimensional Heteroscedasticity Pursuit,” Journal of the American Statistical Association, 0, 1–10.
- Ziel, F. (2016), “Iteratively reweighted adaptive lasso for conditional heteroscedastic time series with applications to AR-ARCH type processes,” Computational Statistics and Data Analysis, 100, 773–793.
- Zou, K., Chen, Z., Yuan, X., Shen, X., Wang, M., and Fu, H. (2023), “A review of uncertainty estimation and its application in medical imaging,” Meta-Radiology, 1, 100003.
- Anja Zgodic (4 papers)
- Ray Bai (20 papers)
- Jiajia Zhang (23 papers)
- Yuan Wang (251 papers)
- Chris Rorden (6 papers)
- Alexander McLain (2 papers)