Cascade Network with Deformable Composite Backbone for Formula Detection in Scanned Document Images

Khurram Azeem Hashmi; Alain Pagani; Marcus Liwicki; Didier Stricker; Muhammad Zeshan Afzal

doi:10.20944/preprints202107.0165.v1

Submitted:

05 July 2021

Posted:

06 July 2021

You are already at the latest version

Abstract

This paper presents a novel architecture for detecting mathematical formulas in document images, which is an important step for reliable information extraction in several domains. Recently, Cascade Mask R-CNN networks have been introduced to solve object detection in computer vision. In this paper, we suggest a couple of modifications to the existing Cascade Mask R-CNN architecture: First, the proposed network uses deformable convolutions instead of conventional convolutions in the backbone network to spot areas of interest better. Second, it uses a dual backbone of ResNeXt-101, having composite connections at the parallel stages. Finally, our proposed network is end-to-end trainable. We evaluate the proposed approach on the ICDAR-2017 POD and Marmot datasets. The proposed approach demonstrates state-of-the-art performance on ICDAR-2017 POD at a higher IoU threshold with an f1-score of 0.917, reducing the relative error by 7.8%. Moreover, we accomplished correct detection accuracy of 81.3% on embedded formulas on the Marmot dataset, which results in a relative error reduction of 30%.

Keywords:

Formula detection

;

Cascade Mask R-CNN

;

Mathematical expression detection

;

document image analysis

;

deep neural networks

;

computer vision.

Subject:

Computer Science and Mathematics - Algebra and Number Theory

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

Cascade Network with Deformable Composite Backbone for Formula Detection in Scanned Document Images

Abstract

Keywords:

Subject:

MDPI Initiatives

Important Links

Subscribe