2000 character limit reached
RLStop: A Reinforcement Learning Stopping Method for TAR (2405.02525v2)
Published 3 May 2024 in cs.IR
Abstract: We present RLStop, a novel Technology Assisted Review (TAR) stopping rule based on reinforcement learning that helps minimise the number of documents that need to be manually reviewed within TAR applications. RLStop is trained on example rankings using a reward function to identify the optimal point to stop examining documents. Experiments at a range of target recall levels on multiple benchmark datasets (CLEF e-Health, TREC Total Recall, and Reuters RCV1) demonstrated that RLStop substantially reduces the workload required to screen a document collection for relevance. RLStop outperforms a wide range of alternative approaches, achieving performance close to the maximum possible for the task under some circumstances.
- Measure-based metasearch. In Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval. 571–572.
- Providing More Efficient Access To Government Records: A Use Case Involving Application of Machine Learning to Improve FOIA Review for the Deliberative Process Privilege. arXiv preprint arXiv:2011.07203 (2020).
- Reem Bin-Hezam and Mark Stevenson. 2023. Combining Counting Processes and Classification Improves a Stopping Rule for Technology Assisted Review. In Findings of the Association for Computational Linguistics: EMNLP 2023. Association for Computational Linguistics, Singapore, 2603–2609. https://doi.org/10.18653/v1/2023.findings-emnlp.171
- Max W Callaghan and Finn Müller-Hansen. 2020. Statistical stopping criteria for automated screening in systematic reviews. Systematic Reviews 9, 1 (2020), 1–14.
- Gordon Cormack and Maura Grossman. 2015. Autonomy and Reliability of Continuous Active Learning for Technology-Assisted Review. arXiv preprint arXiv:1504.06868 (apr 2015). arXiv:1504.06868
- Gordon V Cormack and Maura R Grossman. 2014. Evaluation of machine-learning protocols for technology-assisted review in electronic discovery. In Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval. 153–162.
- Gordon V Cormack and Maura R Grossman. 2016a. Engineering Quality and Reliability in Technology-Assisted Review. In Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval. 75–84.
- Gordon V. Cormack and Maura R. Grossman. 2016b. Engineering quality and reliability in technology-assisted review. In Proceedings of the 39th International ACM SIGIR Conference on Research and Development in Information Retrieval. Association for Computing Machinery, Inc, 75–84. https://doi.org/10.1145/2911451.2911510
- Gordon V Cormack and Maura R Grossman. 2016c. Scalability of continuous active learning for reliable high-recall text classification. In Proceedings of the 25th ACM international on conference on information and knowledge management. 1039–1048.
- Giorgio Maria Di Nunzio. 2018. A study of an automatic stopping strategy for technologically assisted medical reviews. In European Conference on Information Retrieval. Springer, 672–677.
- Eugene Lodewick Grant and Richard S Leavenworth. 1980. Statistical quality control. Vol. 7. McGraw-Hill New York.
- TREC 2016 Total Recall Track Overview. In Proceedings of The Twenty-Fifth Text REtrieval Conference, TREC 2016 (NIST Special Publication, Vol. 500-321). National Institute of Standards and Technology (NIST).
- Cochrane Handbook for Systematic Reviews of Interventions. John Wiley & Sons.
- Noah Hollmann and Carsten Eickhoff. 2017. Ranking and Feedback-based Stopping for Recall-Centric Document Retrieval. In Working Notes of CLEF 2017 - Conference and Labs of the Evaluation Forum. 7–8.
- SWIFT-Active Screener: Accelerated document screening through active learning and integrated recall estimation. Environment International 138 (2020), 105623. https://www.sciencedirect.com/science/article/pii/S0160412019314023
- CLEF 2017 Technologically Assisted Reviews in Empirical Medicine Overview. In CEUR workshop proceedings, Vol. 1866.
- CLEF 2018 Technologically Assisted Reviews in Empirical Medicine Overview. In CEUR workshop proceedings, Vol. 2125.
- CLEF 2019 Technology Assisted Reviews in Empirical Medicine Overview. In CEUR workshop proceedings, Vol. 2380.
- Certifying One-Phase Technology-Assisted Reviews. In Proceedings of the 30th ACM International Conference on Information & Knowledge Management.
- RCV1: A New Benchmark Collection for Text Categorization Research. Journal of Machine Learning Research 5 (2004), 361–397. https://doi.org/10.5555/1005332.1005345
- Dan Li and Evangelos Kanoulas. 2020. When to Stop Reviewing in Technology-Assisted Reviews: Sampling from an Adaptive Distribution to Estimate Residual Relevant Documents. ACM Trans. on Information Systems 38, 4 (2020), 1–36. https://doi.org/10.1145/3411755
- When to stop making relevance judgments? A study of stopping methods for building information retrieval test collections. Journal of the Association for Information Science and Technology 70, 1 (2019), 49–60.
- How the accuracy and confidence of sensitivity classification affects digital sensitivity review. ACM Transactions on Information Systems (TOIS) 39, 1 (2020), 1–34.
- Playing Atari with Deep Reinforcement Learning. arXiv:1312.5602 [cs.LG]
- Alessio Molinari and Andrea Esuli. 2023. SALτ𝜏\tauitalic_τ: efficiently stopping TAR by improving priors estimates. Data Mining and Knowledge Discovery (2023), 1–34.
- A reinforcement learning framework for relevance feedback. In Proceedings of the 43rd international acm sigir conference on research and development in information retrieval. 59–68.
- Rodrigo Nogueira and Kyunghyun Cho. 2017. Task-oriented query reformulation with reinforcement learning. arXiv preprint arXiv:1704.04572 (2017).
- Stable-Baselines3: Reliable Reinforcement Learning Implementations. Journal of Machine Learning Research 22, 268 (2021), 1–8. http://jmlr.org/papers/v22/20-1364.html
- Optimizing query evaluations using reinforcement learning for web search. In The 41st International ACM SIGIR Conference on Research & Development in Information Retrieval. 1193–1196.
- Proximal policy optimization algorithms. arXiv preprint arXiv:1707.06347 (2017).
- Pinpointing needles in giant haystacks: use of text mining to reduce impractical screening workload in extremely large scoping reviews. Research Synthesis Methods 5, 1 (2014), 31–49.
- Mark Stevenson and Reem Bin-Hezam. 2023. Stopping Methods for Technology-assisted Reviews Based on Point Processes. ACM Transactions on Information Systems 42, 3 (2023), 1–37.
- Richard S. Sutton and Andrew G. Barto. 2018. Reinforcement learning: An introduction. MIT Press. http://incompleteideas.net/book/RLbook2020.pdf
- Gymnasium. https://doi.org/10.5281/zenodo.8127026
- Ronald J Williams. 1992. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine learning 8 (1992), 229–256.
- Heuristic Stopping Rules for Technology-Assisted Review. In Proceedings of the 21st ACM Symposium on Document Engineering 2021 (DocEng ’21). Article 31, 10 pages. https://doi.org/10.1145/3469096.3469873
- On minimizing cost in legal document review workflows. In Proceedings of the 21st ACM Symposium on Document Engineering 2021 (DocEng ’21). Article 30, 10 pages. https://doi.org/10.1145/3469096.3469872
- TAR on social media: A framework for online content moderation. In 2nd international conference on Design of Experimental Search & Information REtrieval Systems (DESIRES 2021). 147–155.
- Zhe Yu and Tim Menzies. 2019. FAST2: An Intelligent Assistant for Finding Relevant Papers. Expert Systems with Applications 120 (2019), 57–71.
- Multi page search with reinforcement learning to rank. In Proceedings of the 2018 ACM SIGIR international conference on theory of information retrieval. 175–178.
- Jianghong Zhou and Eugene Agichtein. 2020. Rlirank: Learning to rank with reinforcement learning for dynamic search. In Proceedings of The Web Conference 2020. 2842–2848.
- Justin Zobel. 1998. How Reliable Are the Results of Large-Scale Information Retrieval Experiments?. In Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 307–314.
- Reem Bin-Hezam (4 papers)
- Mark Stevenson (30 papers)