Transfer Learning with Random Coefficient Ridge Regression (2306.15915v1)
Abstract: Ridge regression with random coefficients provides an important alternative to fixed coefficients regression in high dimensional setting when the effects are expected to be small but not zeros. This paper considers estimation and prediction of random coefficient ridge regression in the setting of transfer learning, where in addition to observations from the target model, source samples from different but possibly related regression models are available. The informativeness of the source model to the target model can be quantified by the correlation between the regression coefficients. This paper proposes two estimators of regression coefficients of the target model as the weighted sum of the ridge estimates of both target and source models, where the weights can be determined by minimizing the empirical estimation risk or prediction risk. Using random matrix theory, the limiting values of the optimal weights are derived under the setting when $p/n \rightarrow \gamma$, where $p$ is the number of the predictors and $n$ is the sample size, which leads to an explicit expression of the estimation or prediction risks. Simulations show that these limiting risks agree very well with the empirical risks. An application to predicting the polygenic risk scores for lipid traits shows such transfer learning methods lead to smaller prediction errors than the single sample ridge regression or Lasso-based transfer learning.
- Transfer learning. In Handbook of research on machine learning applications and trends: algorithms, methods, and techniques, pages 242–264. IGI Global, 2010.
- Gene ontology based transfer learning for protein subcellular localization. BMC bioinformatics, 12:44, 2011.
- Deep convolutional neural networks for computer-aided detection: Cnn architectures, dataset characteristics and transfer learning. IEEE transactions on medical imaging, 35(5):1285–1298, 2016.
- Transfer learning approaches to improve drug sensitivity prediction in multiple myeloma patients. IEEE Access, 5:7381–7393, 2017.
- Integrative analysis of multi-omics data for discovery and functional studies of complex human diseases. In Advances in genetics, volume 93, pages 147–190. 2016.
- A statistical framework for cross-tissue transcriptome-wide association analysis. Nature genetics, 51(3):568–576, 2019.
- Horizontal and vertical integrative analysis methods for mental disorders omics data. Scientific Reports, pages 1–12, 2019. ISSN 2045-2322. doi:10.1038/s41598-019-49718-5. URL http://dx.doi.org/10.1038/s41598-019-49718-5.
- Hal Daumé III. Frustratingly easy domain adaptation. In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pages 256–263, 2007.
- Transfer learning in heterogeneous collaborative filtering domains. Artificial intelligence, 197:39–55, 2013.
- Transfer learning for high-dimensional linear regression: Prediction, estimation, and minimax optimality. Journal of Royal Statistical Society, series B, 2022.
- Polygenic scores via penalized regression on summary statistics. Genetic epidemiology, 41(6):469–480, 2017.
- The personal and clinical utility of polygenic risk scores. Nature Reviews Genetics, 19:581–590, 2018.
- J. Pattee and W. Pan. Penalized regression and model selection methods for polygenic scores on summary statistics. PLoS Comput Biol, 16(10):e1008271, 2020.
- R. de Vlaming and P.J. Groenen. The current and future use of ridge regression for prediction in quantitative genetics. Biomed Res Int., page 2015:143712, 2015.
- Nature Genetics, 53:1097–1103, 2021.
- Core greml for estimating covariance between random effects in linear mixed models for complex trait analyses. Nature Communication, 11:4208, 2020.
- Estimation of pleiotropy between complex diseases using snp-derived genomic relationships and restricted maximum likelihood. Bioinformatics, 28(19):2540–2542, 2012.
- High-dimensional asymptotics of prediction: Ridge regression and classification. The Annals of Statistics, 46(1):247–279, 2018.
- One-shot distributed ridge regression in high dimensions. In Hal Daumé III and Aarti Singh, editors, Proceedings of the 37th International Conference on Machine Learning, volume 119 of Proceedings of Machine Learning Research, pages 8763–8772. PMLR, 13–18 Jul 2020.
- Cross-trait prediction accuracy of high-dimensional ridge-type estimators in genome-wide association studies. arXiv preprint arXiv:1911.10142, 2019.
- Asymptotics of ridge (less) regression under general source condition. In International Conference on Artificial Intelligence and Statistics, pages 3889–3897. PMLR, 2021.
- Denny Wu and Ji Xu. On the optimal weighted \\\backslash\ ell_2𝑒𝑙𝑙_2ell\_2italic_e italic_l italic_l _ 2 regularization in overparameterized linear regression. Advances in Neural Information Processing Systems, 33:10112–10123, 2020.
- Dimension free ridge regression. arXiv preprint arXiv:2210.08571, 2022.
- Surprises in high-dimensional ridgeless least squares interpolation. The Annals of Statistics, 50(2):949 – 986, 2022. doi:10.1214/21-AOS2133. URL https://doi.org/10.1214/21-AOS2133.
- Anisotropic local laws for random matrices. Probability Theory and Related Fields, 169:257–352, 2017.
- Spectral convergence for a general class of random matrices. Statistics & probability letters, 81(5):592–602, 2011.