Temporal Detection of Anomalies via Actor-Critic Based Controlled Sensing (2201.00879v2)
Abstract: We address the problem of monitoring a set of binary stochastic processes and generating an alert when the number of anomalies among them exceeds a threshold. For this, the decision-maker selects and probes a subset of the processes to obtain noisy estimates of their states (normal or anomalous). Based on the received observations, the decisionmaker first determines whether to declare that the number of anomalies has exceeded the threshold or to continue taking observations. When the decision is to continue, it then decides whether to collect observations at the next time instant or defer it to a later time. If it chooses to collect observations, it further determines the subset of processes to be probed. To devise this three-step sequential decision-making process, we use a Bayesian formulation wherein we learn the posterior probability on the states of the processes. Using the posterior probability, we construct a Markov decision process and solve it using deep actor-critic reinforcement learning. Via numerical experiments, we demonstrate the superior performance of our algorithm compared to the traditional model-based algorithms.
- Y.-D. Lee and W.-Y. Chung, “Wireless sensor network based wearable smart shirt for ubiquitous health and activity monitoring,” Sens. Actuators B Chem., vol. 140, no. 2, pp. 390–395, Jul. 2009.
- W.-Y. Chung and S.-J. Oh, “Remote monitoring system with wireless sensors module for room environment,” Sens. Actuators B Chem., vol. 113, no. 1, pp. 64–70, Jan. 2006.
- G. Joseph, A. B. Zoubi, C. R. Murthy, and V. J. Mathews, “Anomaly imaging for structural health monitoring exploiting clustered sparsity,” in Proc. IEEE ICASSP. IEEE, 2019, pp. 4255–4259.
- M. Naghshvar and T. Javidi, “Information utility in active sequential hypothesis testing,” in IEEE Allerton, Oct. 2010, pp. 123–129.
- I. Cleland, M. Han, C. Nugent, H. Lee, S. McClean, S. Zhang, and S. Lee, “Evaluation of prompted annotation of activity data recorded from a smart phone,” Sensors, vol. 14, no. 9, pp. 15 861–15 879, Sep. 2014.
- M. Han, Y.-K. Lee, S. Lee et al., “Comprehensive context recognizer based on multimodal sensors in a smartphone,” Sensors, vol. 12, no. 9, pp. 12 588–12 605, Sep. 2012.
- S. Aminikhanghahi and D. J. Cook, “A survey of methods for time series change point detection,” Knowl. Inf. Syst., vol. 51, no. 2, pp. 339–367, May 2017.
- S. Liu, M. Yamada, N. Collier, and M. Sugiyama, “Change-point detection in time-series data by relative density-ratio estimation,” Neural Netw., vol. 43, pp. 72–83, Jul. 2013.
- D. Agudelo-España, S. Gomez-Gonzalez, S. Bauer, B. Schölkopf, and J. Peters, “Bayesian online prediction of change points,” in Proc. Conf. on Uncertain. in Artif. Intell., Aug. 2020, pp. 320–329.
- E. Erdemir, P. L. Dragotti, and D. Gündüz, “Active privacy-utility trade-off against a hypothesis testing adversary,” in IEEE ICASSP, Jun. 2021, pp. 2660–2664.
- C. Zhong, M. C. Gursoy, and S. Velipasalar, “Deep actor-critic reinforcement learning for anomaly detection,” in Proc. Globecom, Dec. 2019.
- G. Joseph, M. C. Gursoy, and P. K. Varshney, “Anomaly detection under controlled sensing using actor-critic reinforcement learning,” in Proc. IEEE Inter. Workshop SPAWC, May 2020.
- H. Chernoff, “Sequential design of experiments,” Ann. Math. Stat., vol. 30, no. 3, pp. 755–770, Sep. 1959.
- S. A. Bessler, “Theory and applications of the sequential design of experiments, k-actions and infinitely many experiments. part i. theory,” Stanford Univ CA Applied Mathematics and Statistics Labs, Tech. Rep., 1960.
- S. Nitinawarat, G. K. Atia, and V. V. Veeravalli, “Controlled sensing for multihypothesis testing,” IEEE Trans. Autom. Control, vol. 58, no. 10, pp. 2451–2464, May 2013.
- M. Naghshvar and T. Javidi, “Active sequential hypothesis testing,” Ann. Stat., vol. 41, no. 6, pp. 2703–2738, 2013.
- B. Huang, K. Cohen, and Q. Zhao, “Active anomaly detection in heterogeneous processes,” IEEE Trans. Inf. Theory, vol. 65, no. 4, pp. 2284–2301, Aug. 2018.
- D. Kartik, E. Sabir, U. Mitra, and P. Natarajan, “Policy design for active sequential hypothesis testing using deep learning,” in Proc. Allerton, Oct. 2018, pp. 741–748.
- G. Joseph, C. Zhong, M. C. Gursoy, S. Velipasalar, and P. K. Varshney, “Anomaly detection via controlled sensing and deep active inference,” in Proc. IEEE Globecom, Dec. 2020.