文献の詳細
論文の言語 | 英語 |
---|---|
著者 | Wei Fan, Koichi Kise, and Masakazu Iwamura |
論文名 | Automatic Character Labeling for Camera Captured Document Images |
論文誌名 | Proc. 2016 The International Conference on Image Processing (ICIP 2016) |
ページ | pp.3284-3288 |
発表場所 | Phoenix, Arizona, USA |
査読の有無 | 有 |
年月 | 2016年9月 |
要約 | Character groundtruth for camera captured documents is crucial for training and evaluating advanced OCR algorithms. Manually generating character level groundtruth is a time consuming and costly process. This paper proposes a robust groundtruth generation method based on document retrieval and image registration for camera captured documents. We use an elastic non-rigid alignment method to fit the captured document image which relaxes the flat paper assumption made by conventional solutions. The proposed method allows building very large scale labeled camera captured documents dataset, without any human intervention. We construct a large labeled dataset consisting of 1 million camera captured Chinese character images. Evaluation of samples generated by our approach showed that 99.99% of the images were correctly labeled, even with different distortions specific to cameras such as blur, specularity and perspective distortion. |
DOI | 10.1109/ICIP.2016.7532967 |
- 次のファイルが利用可能です.
- BibTeX用エントリー
@InProceedings{Fan2016, author = {Wei Fan and Koichi Kise and Masakazu Iwamura}, title = {Automatic Character Labeling for Camera Captured Document Images}, booktitle = {Proc. 2016 The International Conference on Image Processing (ICIP 2016)}, year = 2016, month = sep, pages = {3284--3288}, DOI = {10.1109/ICIP.2016.7532967}, location = {Phoenix, Arizona, USA} }