Maximizing Model Generalization for Machine Condition Monitoring with Self-Supervised Learning and Federated Learning (2304.14398v2)
Abstract: Deep Learning (DL) can diagnose faults and assess machine health from raw condition monitoring data without manually designed statistical features. However, practical manufacturing applications remain extremely difficult for existing DL methods. Machine data is often unlabeled and from very few health conditions (e.g., only normal operating data). Furthermore, models often encounter shifts in domain as process parameters change and new categories of faults emerge. Traditional supervised learning may struggle to learn compact, discriminative representations that generalize to these unseen target domains since it depends on having plentiful classes to partition the feature space with decision boundaries. Transfer Learning (TL) with domain adaptation attempts to adapt these models to unlabeled target domains but assumes similar underlying structure that may not be present if new faults emerge. This study proposes focusing on maximizing the feature generality on the source domain and applying TL via weight transfer to copy the model to the target domain. Specifically, Self-Supervised Learning (SSL) with Barlow Twins may produce more discriminative features for monitoring health condition than supervised learning by focusing on semantic properties of the data. Furthermore, Federated Learning (FL) for distributed training may also improve generalization by efficiently expanding the effective size and diversity of training data by sharing information across multiple client machines. Results show that Barlow Twins outperforms supervised learning in an unlabeled target domain with emerging motor faults when the source training data contains very few distinct categories. Incorporating FL may also provide a slight advantage by diffusing knowledge of health conditions between machines.
- Imagenet classification with deep convolutional neural networks, in: Advances in Neural Information Processing Systems, volume 25, 2012, pp. 1097–1105.
- Deep learning, Nature 521 (2015) 436–444.
- A new convolutional neural network-based data-driven fault diagnosis method, IEEE Transactions on Industrial Electronics 65 (2017) 5990–5998.
- Intelligent rotating machinery fault diagnosis based on deep learning using data augmentation, Journal of Intelligent Manufacturing 31 (2020) 433–452.
- Adversarial representation learning for intelligent condition monitoring of complex machinery, IEEE Transactions on Industrial Electronics 70 (2022) 5255–5265.
- Deep-learning-based open set fault diagnosis by extreme value theory, IEEE Transactions on Industrial Informatics 18 (2021) 185–196.
- A two-stage transfer adversarial network for intelligent fault diagnosis of rotating machinery with multiple new faults, IEEE/ASME Transactions on Mechatronics 26 (2021) 1591–1601.
- Broad auto-encoder for machinery intelligent fault diagnosis with incremental fault samples and fault modes, Mechanical Systems and Signal Processing 178 (2022) 109353.
- S. J. Pan, Q. Yang, A survey on transfer learning, IEEE Transactions on Knowledge and Data Engineering 22 (2010) 1345–1359.
- W. M. Kouw, M. Loog, An introduction to domain adaptation and transfer learning (2018). arXiv:1812.11806.
- Discriminative unsupervised feature learning with convolutional neural networks, in: Advances in Neural Information Processing Systems, volume 27, 2014, pp. 766–774.
- Communication-efficient learning of deep networks from decentralized data, Proceedings of the 20th International Conference on Artifical Intelligence and Statistics (AISTATS) (2017).
- P. Wang, R. X. Gao, Transfer learning for enhanced machine fault diagnosis in manufacturing, CIRP Annals - Manufacturing Technology 69 (2020) 413–416.
- Deep model based domain adaptation for fault diagnosis, IEEE Transactions on Industrial Electronics 64 (2017) 2296–2305.
- A polynomial kernel induced distance metric to improve deep transfer learning for fault diagnosis of machines, IEEE Transactions on Industrial Electronics 67 (2019) 9747–9757.
- Unsupervised domain-share CNN for machine fault transfer diagnosis from steady speeds to time-varying speeds, Journal of Manufacturing Systems 62 (2022) 186–198.
- An intelligent fault diagnosis approach based on transfer learning from laboratory bearings to locomotive bearings, Mechanical Systems and Signal Processing 122 (2019) 692–706.
- A multi-level adaptation scheme for hierarchical bearing fault diagnosis under variable working conditions, Journal of Manufacturing Systems 64 (2022) 251–260.
- Domain-adversarial training of neural networks, Journal of Machine Learning Research 17 (2016) 1–35.
- Diagnosing rotating machines with weakly supervised data using transfer learning, IEEE Transactions on Industrial Electronics 16 (2019) 1688–1697.
- Deep convolutional transfer learning network: A new method for intelligent fault diagnosis of machines with unlabeled data, IEEE Transactions on Industrial Electronics 66 (2018) 7316–7325.
- How transferable are features in deep neural networks?, in: Advanced in Neural Information Processing Systems, volume 27, 2014, pp. 3320–3328.
- A comprehensive survey on transfer learning, Proceedings of the IEEE 109 (2020) 43–76.
- Highly accurate machine fault diagnosis using deep transfer learning, IEEE Transactions on Industrial Informatics 15 (2018) 2446–2455.
- Multi-scale deep intra-class transfer learning for bearing fault diagnosis, Reliability Engineering & System Safety 202 (2020) 107050.
- Modified deep autoencoder driven by multisource parameters for fault transfer prognosis of aeroengine, IEEE Transactions on Industrial Electronics 69 (2021) 845–855.
- Unsupervised representation learning by predicting image rotations, in: International Conference on Learning Representations (ICLR), 2018.
- Unsupervised visual representation learning by context prediction, in: Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2015.
- Data-driven prognostic method based on self-supervised learning approaches for fault detection, Journal of Intelligent Manufacturing 31 (2018) 1611–1619.
- Self-supervised joint learning fault diagnosis method based on three-channel vibration images, Sensors 21 (2021) 4774.
- Noise-based self-supervised anomaly detection in washing machines using a deep neural network with operational information, Mechanical Systems and Signal Processing 189 (2023) 110102.
- Deep learning-based data registration of melt-pool-monitoring images for laser power bed fusion additive manufacturing, Journal of Manufacturing Systems 68 (2023) 117–129.
- Prior knowledge-augmented self-supervised feature learning for few-shot intelligent fault diagnosis of machines, IEEE Transactions on Industrial Electronics 69 (2022) 10573–10584.
- Self-supervised signal representation learning for machinery fault diganosis under limited annotation data, Knowledge-Based Systems 239 (2022) 107978.
- A novel study on a generalized model based on self-supervised learning and sparse filtering for intelligent bearing fault diagnosis, Sensors 23 (2023) 1858.
- Dimensionality reduction by learning an invariant mapping, in: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06), 2006, pp. 1063–6919.
- Representation learning with contrastive predictive coding (2018). arXiv:1807.03748.
- Momentum contrast for unsupervised visual representation learning, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020, pp. 9729–9738.
- In defense of the triplet loss for person re-identification (2017). arXiv:1703.07737.
- A simple framework for contrastive learning of visual representations, in: Proceedings of the 37th International Conference on Machine Learning, volume 119 of Proceedings of Machine Learning Research, 2020, pp. 1597–1607.
- Bootstrap your own latent - a new approach to self-supervised learning, in: Advances in Neural Information Processing Systems, volume 33, 2020.
- X. Chen, K. He, Exploring simple siamese representation learning, in: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021, pp. 15750–15758.
- Barlow twins: Self-supervised learning via redundancy reduction, in: Proceedings of the 38th International Conference on Machine Learning, volume 139 of Proceedings of Machine Learning Research, 2021, pp. 12310–12320.
- VICReg: Variance-invariance-covariance regularization for self-sueprvised learning, in: International Conference on Learning Representations (ICLR), 2022.
- Fault diagnosis of rotating machinery based on improved self-supervised learning method and very few labeled samples, Sensors 22 (2022) 192.
- Semi-supervised contrast learning based on multi-scale attention and multi-target contrast learning for bearing fault diagnosis, IEEE Transactions on Industrial Informatics (2023).
- A cookbook of self-supervised learning (2023). arXiv:2304.12210.
- Inter-instance and intra-temporal self-supervised learning with few labeled data for fault diagnosis, IEEE Transactions on Industrial Informatics (2023).
- Self-supervised pretraining via contrast learning for intelligent incipient fault detection of bearings, Reliability Engineering & System Safety 218, Part A (2022) 108126.
- Fault feature extractor based on bootstrap your own latent and data augmentation algorithm for unlabeled vibration signals, IEEE Transactions on Industrial Electronics 69 (22) 9547–9555.
- Self-supervised simple siamese framework for fault diagnosis of rotating machinery with unlabeled samples, IEEE Transactions on Neural Networks and Learning Systems (2023).
- Mixed-up experience replay for adaptive online condition monitoring, IEEE Transactions on Industrial Electronics (2023) 1–8.
- Federated learning for machinery fault diganosis with dynamic validation and self-supervision, Knowledge-Based Systems 213 (2021) 106679.
- Collaborative deep learning framework for fault diagnosis in distributed complex systems, Mechanical Systems and Signal Processing 156 (2021) 107650.
- Privacy-preserving gradient boosting tree: Vertical federated learning for collaborative bearing fault diagnosis, IET Collaborative Intelligent Manufacturing 4 (2022) 208–219.
- Trans-Lighter: A light-weight federated learning-based architecture for Remaining Useful Lifetime prediction, Computers in Industry 148 (2023) 103888.
- M. Mehta, C. Shao, Federated learning-based semantic segmentation for pixel-wise defect detection in additive manufacturing, Journal of Manufacturing Systems 64 (2022) 197–210.
- Federated learning-based collaborative manufacturing for complex parts, Journal of Intelligent Manufacturing (2022).
- SCA-LFD: Side-channel analysis-based load forecasting disturbance in the energy internet, IEEE Transactions on Industrial Electronics 70 (2023) 3199–3208.
- Federated transfer learning based cross-domain prediction for smart manufacturing, IEEE Transactions on Industrial Informatics 18 (2021) 4088–4096.
- N. Shi, R. A. Kontar, Personalized federated learning via domain adaptation with an application to distributed 3d printing, Technometrics 65 (2022) 328–339.
- A federated learning approach to mixed fault diagnosis in rotating machinery, Journal of Manufacturing Systems 68 (2023) 687–694.
- Federated transfer learning in fault diagnosis under data privacy with target self-adaptation, Journal of Manufacturing Systems 68 (2023) 523–535.
- M. Mehta, C. Shao, A greedy agglomerative framework for clustered federated learning, IEEE Transactions on Industrial Informatics Early Access (2023) 1–12.
- Rethinking few-shot image classification: A good embedding is all you need?, in: Computer Vision – ECCV 2020, 2020, pp. 266–282.