Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

TERMinator: A system for scientific texts processing (2209.14854v1)

Published 29 Sep 2022 in cs.CL

Abstract: This paper is devoted to the extraction of entities and semantic relations between them from scientific texts, where we consider scientific terms as entities. In this paper, we present a dataset that includes annotations for two tasks and develop a system called TERMinator for the study of the influence of LLMs on term recognition and comparison of different approaches for relation extraction. Experiments show that LLMs pre-trained on the target language are not always show the best performance. Also adding some heuristic approaches may improve the overall quality of the particular task. The developed tool and the annotated corpus are publicly available at https://github.com/iis-research-team/terminator and may be useful for other researchers.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Elena Bruches (5 papers)
  2. Olga Tikhobaeva (1 paper)
  3. Yana Dementyeva (1 paper)
  4. Tatiana Batura (10 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.