文献の詳細
論文の言語 | 英語 |
---|---|
著者 | Masakazu Iwamura, Takahiro Matsuda, Naoyuki Morimoto, Hitomi Sato, Yuki Ikeda and Koichi Kise |
論文名 | Downtown Osaka Scene Text Dataset |
論文誌名 | Proc. 2nd International Workshop on Robust Reading (ECCV 2016 Workshops, Part I) |
Vol. | 9913 |
Series | Lecture Notes in Computer Science (LNCS) |
ページ | pp.1-16 |
出版社 | Springer |
発表場所 | Amsterdam, Netherlands |
査読の有無 | 有 |
発表の種類 | 口頭発表 |
年月 | 2016年10月 |
要約 | This paper presents a new scene text dataset named Downtown Osaka Scene Text Dataset (in short, DOST dataset). The dataset consists of sequential images captured in shopping streets in downtown Osaka with an omnidirectional camera. Unlike most of existing datasets consisting of scene images intentionally captured, DOST dataset consists of uncontrolled scene images; use of an omnidirectional camera enabled us to capture videos (sequential images) of whole scenes surrounding the camera. Since the dataset preserved the real scenes containing texts as they were, in other words, they are scene texts in the wild. DOST dataset contained 32,147 manually ground truthed sequential images. They contained 935,601 text regions consisting of 797,919 legible and 137,682 illegible. The legible regions contained 2,808,340 characters. The dataset is evaluated using two existing scene text detection methods and one powerful commercial end-to-end scene text recognition method to know the difficulty and quality in comparison with existing datasets. |
DOI | 10.1007/978-3-319-46604-0_32 |
- 次のファイルが利用可能です.
- BibTeX用エントリー
@InProceedings{Iwamura2016, author = {Masakazu Iwamura and Takahiro Matsuda and Naoyuki Morimoto and Hitomi Sato and Yuki Ikeda and Koichi Kise}, title = {Downtown Osaka Scene Text Dataset}, booktitle = {Proc. 2nd International Workshop on Robust Reading (ECCV 2016 Workshops, Part I)}, year = 2016, month = oct, volume = {9913}, pages = {1--16}, DOI = {10.1007/978-3-319-46604-0_32}, publisher = {Springer}, location = {Amsterdam, Netherlands} }