Chars74K

by No License

Chars74K

The Chars74K dataset consists of 64 classes (0-9, A-Z, a-z), 7705 characters obtained from natural images, 3410 hand drawn characters using a tablet PC, 62992 synthesised characters from computer fonts. This gives a total of over 74K images (which explains the name of the dataset).In the English language, Latin script (excluding accents) and Hindu-Arabic numerals are used. For simplicity we call this the English characters set.T. E. de Campos, B. R. Babu and M. Varma. Character recognition in natural images. In Proceedings of the International Conference on Computer Vision Theory and Applications (VISAPP), Lisbon, Portugal, February 2009.Bibtex | Abstract | PDF

Dataset Attributes

Label SVG
CategoriesText, Detection