Research Hub

대학 자원

대학 인프라와 자원을 공유해 공동 연구와 기술 활용을 지원합니다.

Loading...

논문 리스트

2015
Local Similarity based Document Layout Analysis using Improved ARLSA Local Similarity based Document Layout Analysis using Improved ARLSA
한국콘텐츠학회
김수형 외 1명
논문정보
Publisher
International Journal of Contents
Issue Date
2015-06-30
Keywords
-
Citation
-
Source
-
Journal Title
-
Volume
11
Number
2
Start Page
15
End Page
19
DOI
ISSN
17386764
Abstract
In this paper, we propose an efficient document layout analysis algorithm that includes table detection. Typical methods of document layout analysis use the height and gap between words or columns. To correspond to the various styles and sizes of documents, we propose an algorithm that uses the mean value of the distance transform representing thickness and compare with components in the local area. With this algorithm, we combine a table detection algorithm using the same feature as that of the text classifier. Table candidates, separators, and big components are isolated from the image using Connected Component Analysis (CCA) and distance transform. The key idea of text classification is that the characteristics of the text parallel components that have a similar thickness and height. In order to estimate local similarity, we detect a text region using an adaptive searching window size. An improved adaptive run-length smoothing algorithm (ARLSA) was proposed to create the proper boundary of a text zone and non-text zone. Results from experiments on the ICDAR2009 page segmentation competition test set and our dataset demonstrate the superiority of our dataset through f-measure comparison with other algorithms.

저자 정보

이름 소속
김수형 인공지능학부