OSDN > Buscar Software > External Sites > SourceForge.net > Sanskrit / Hindi - Tesseract OCR > los archivos de la lista de descargas > Descargar

Sanskrit / Hindi - Tesseract OCR

Download of san.mnt-v7-traineddataonly.zip (san.mnt-v7-traineddataonly.zip ( external link: SF.net): 8,963,389 octetos) will begin shortly. If not so, click link on the left.

File Information

File Size: 8,963,389 octetos
MD5: b138a3fcf880a832287e5e62c7672044

Where do you want to go next?

Go to the project page on OSDN View another version

Opinión

Promedio

0.0

0 total

5 Estrellas	0
4 Estrellas	0
3 Estrellas	0
2 Estrellas	0
1 Estrella	0

Your rating

Review this project

Descripción del Proyecto

Tesseract OCR 3.02 provides hin.traineddata for recognizing texts in devanagari scripts. However the Hindi training texts, images and box files are not provided, so it is difficult to improve the accuracy by further improving the traineddata. It is noted that recognition is more accurate and faster if the training is done with the same /similar font as used in the text to be OCRed.

I am experimenting with different fonts and training texts and will post the traineddata files for various devanagari fonts in the hope that these can be used to OCR the various scanned books with devanagari text.

Currently traineddata file for Sanskrit2003 font and another similar font used in a book are uploaded here.

See DocumentationWiki for more details.

Sanskrit / Hindi - Tesseract OCR

File Information

Where do you want to go next?

Opinión

Your rating on Sanskrit / Hindi - Tesseract OCR

Descripción del Proyecto