Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Topological and Semantic Graph-based Author Disambiguation on DBLP Data in Neo4j (1901.08977v1)

Published 25 Jan 2019 in cs.IR, cs.DL, and cs.SI

Abstract: In this work, we introduce a novel method for entity resolution author disambiguation in bibliographic networks. Such a method is based on a 2-steps network traversal using topological similarity measures for rating candidate nodes. Topological similarity is widely used in the Link Prediction application domain to assess the likelihood of an unknown link. A similarity function can be a good approximation for equality, therefore can be used to disambiguate, basing on the hypothesis that authors with many common co-authors are similar. Our method has experimented on a graph-based representation of the public DBLP Computer Science database. The results obtained are extremely encouraging regarding Precision, Accuracy, and Specificity. Further good aspects are the locality of the method for disambiguation assessment which avoids the need to know the global network, and the exploitation of only a few data, e.g. author name and paper title (i.e., co-authorship data).

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Valentina Franzoni (6 papers)
  2. Michele Lepri (1 paper)
  3. Alfredo Milani (5 papers)
Citations (7)

Summary

We haven't generated a summary for this paper yet.