Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Improving Document Binarization via Adversarial Noise-Texture Augmentation (1810.11120v2)

Published 25 Oct 2018 in cs.CV

Abstract: Binarization of degraded document images is an elementary step in most of the problems in document image analysis domain. The paper re-visits the binarization problem by introducing an adversarial learning approach. We construct a Texture Augmentation Network that transfers the texture element of a degraded reference document image to a clean binary image. In this way, the network creates multiple versions of the same textual content with various noisy textures, thus enlarging the available document binarization datasets. At last, the newly generated images are passed through a Binarization network to get back the clean version. By jointly training the two networks we can increase the adversarial robustness of our system. Also, it is noteworthy that our model can learn from unpaired data. Experimental results suggest that the proposed method achieves superior performance over widely used DIBCO datasets.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Ayan Kumar Bhunia (63 papers)
  2. Aneeshan Sain (40 papers)
  3. Partha Pratim Roy (64 papers)
  4. Ankan kumar Bhunia (14 papers)
Citations (29)

Summary

We haven't generated a summary for this paper yet.