Detail of Publication
Text Language | English |
---|---|
Authors | Sheraz Ahmed, Koichi Kise, Masakazu Iwamura, Marcus Liwicki, and Andreas Dengel |
Title | Automatic Ground Truth Generation of Camera Captured Documents Using Document Image Retrieval |
Journal | Proc. 12th International Conference on Document Analysis and Recognition (ICDAR 2013) |
Pages | pp.528-532 |
Location | Washington, DC, USA |
Reviewed or not | Reviewed |
Presentation type | Oral |
Month & Year | August 2013 |
Abstract | In this paper a novel method for automatic ground truth generation of camera captured document images is proposed. Currently, no dataset is available for camera captured documents. It is very difficult to build these datasets manually, as it is very laborious and costly. The proposed method is fully automatic, allowing building the very large scale (i.e., millions of images) labeled camera captured documents dataset, without any human intervention. Evaluation of samples generated by the proposed approach shows that 99.98% of the images are correctly labeled. Novelty of the proposed approach lies in the use of document image retrieval for automatic labeling, especially for camera captured documents, which contain different distortions specific to camera, e.g., blur, occlusion, perspective distortion, etc. |
DOI | 10.1109/ICDAR.2013.111 |
- Following files are available.
- Entry for BibTeX
@InProceedings{Ahmed2013, author = {Sheraz Ahmed and Koichi Kise and Masakazu Iwamura and Marcus Liwicki and Andreas Dengel}, title = {Automatic Ground Truth Generation of Camera Captured Documents Using Document Image Retrieval}, booktitle = {Proc. 12th International Conference on Document Analysis and Recognition (ICDAR 2013)}, year = 2013, month = aug, pages = {528--532}, DOI = {10.1109/ICDAR.2013.111}, location = {Washington, DC, USA} }