Loading...
2015
Local Similarity based Document Layout Analysis using Improved ARLSA
Local Similarity based Document Layout Analysis using Improved ARLSA
한국콘텐츠학회
김수형 외 1명
논문정보
- Publisher
- International Journal of Contents
- Issue Date
- 2015-06-30
- Keywords
- -
- Citation
- -
- Source
- -
- Journal Title
- -
- Volume
- 11
- Number
- 2
- Start Page
- 15
- End Page
- 19
- DOI
- ISSN
- 17386764
Abstract
In this paper, we propose an efficient document layout analysis algorithm that includes table detection. Typical methods of document layout analysis use the height and gap between words or columns. To correspond to the various styles and sizes of documents, we propose an algorithm that uses the mean value of the distance transform representing thickness and compare with components in the local area. With this algorithm, we combine a table detection algorithm using the same feature as that of the text classifier. Table candidates, separators, and big components are isolated from the image using Connected Component Analysis (CCA) and distance transform. The key idea of text classification is that the characteristics of the text parallel components that have a similar thickness and height. In order to estimate local similarity, we detect a text region using an adaptive searching window size. An improved adaptive run-length smoothing algorithm (ARLSA) was proposed to create the proper boundary of a text zone and non-text zone. Results from experiments on the ICDAR2009 page segmentation competition test set and our dataset demonstrate the superiority of our dataset through f-measure comparison with other algorithms.
- 전남대학교
- KCI
- International Journal of Contents
저자 정보
| 이름 | 소속 |
|---|---|
| 김수형 | 인공지능학부 |