Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 134 tok/s
Gemini 2.5 Pro 41 tok/s Pro
GPT-5 Medium 27 tok/s Pro
GPT-5 High 24 tok/s Pro
GPT-4o 81 tok/s Pro
Kimi K2 205 tok/s Pro
GPT OSS 120B 432 tok/s Pro
Claude Sonnet 4.5 37 tok/s Pro
2000 character limit reached

Relevance feedback strategies for recall-oriented neural information retrieval (2311.15110v1)

Published 25 Nov 2023 in cs.CL

Abstract: In a number of information retrieval applications (e.g., patent search, literature review, due diligence, etc.), preventing false negatives is more important than preventing false positives. However, approaches designed to reduce review effort (like "technology assisted review") can create false negatives, since they are often based on active learning systems that exclude documents automatically based on user feedback. Therefore, this research proposes a more recall-oriented approach to reducing review effort. More specifically, through iteratively re-ranking the relevance rankings based on user feedback, which is also referred to as relevance feedback. In our proposed method, the relevance rankings are produced by a BERT-based dense-vector search and the relevance feedback is based on cumulatively summing the queried and selected embeddings. Our results show that this method can reduce review effort between 17.85% and 59.04%, compared to a baseline approach (of no feedback), given a fixed recall target

Definition Search Book Streamline Icon: https://streamlinehq.com
References (7)
  1. Chai, C.P.: Comparison of text preprocessing methods. Natural Language Engineering pp. 509–553 (2023)
  2. Deolalikar, V.: How valuable is your data? A quantitative approach using data mining. In: 2015 IEEE International Conference on Big Data (Big Data). pp. 1248–1253. IEEE (2015)
  3. Jaccard, P.: Nouvelles recherches sur la distribution florale. Bull. Soc. Vaud. Sci. Nat. pp. 223–270 (1908)
  4. Manning, C.D.: Introduction to information retrieval. Syngress Publishing (2008)
  5. Piramuthu, O.B.: Multiple choice online algorithms for technology-assisted reviews. In: Proceedings of the 38th ACM/SIGAPP Symposium on Applied Computing. pp. 639–645 (2023)
  6. Rocchio, J.J.: Document Retrieval System-Optimization and Evaluation. DIR 2009 Dutch-Belgian Information Retrieval Workshop p. 99 (2009)
  7. Roitblat, H.L.: Probably Reasonable Search in eDiscovery. arXiv e-prints p. 2201 (2022)

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Lightbulb Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Don't miss out on important new AI/ML research

See which papers are being discussed right now on X, Reddit, and more:

“Emergent Mind helps me see which AI papers have caught fire online.”

Philip

Philip

Creator, AI Explained on YouTube