Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Similarity Learning for Authorship Verification in Social Media (1908.07844v1)

Published 20 Aug 2019 in cs.CL, cs.LG, and stat.ML

Abstract: Authorship verification tries to answer the question if two documents with unknown authors were written by the same author or not. A range of successful technical approaches has been proposed for this task, many of which are based on traditional linguistic features such as n-grams. These algorithms achieve good results for certain types of written documents like books and novels. Forensic authorship verification for social media, however, is a much more challenging task since messages tend to be relatively short, with a large variety of different genres and topics. At this point, traditional methods based on features like n-grams have had limited success. In this work, we propose a new neural network topology for similarity learning that significantly improves the performance on the author verification task with such challenging data sets.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Benedikt Boenninghoff (10 papers)
  2. Robert M. Nickel (7 papers)
  3. Steffen Zeiler (8 papers)
  4. Dorothea Kolossa (33 papers)
Citations (40)

Summary

We haven't generated a summary for this paper yet.