This UDF provides text capturing support for applications and controls using Tesseract - an OCR engine currently developed by Google.
Tesseract was originally developed as proprietary software at Hewlett-Packard between 1985 until 1995. After ten years without any development taking place, Hewlett Packard and UNLV released it as open source in 2005. Tesseract is currently developed by Google and released under the Apache License, Version 2.0.
Tesseract is considered one of the most accurate free software OCR engines currently available. It was one of the top 3 engines in the 1995 UNLV Accuracy test.
My main goal in developing this UDF is to provide AutoIT users with a free Screen OCR solution that competes with other commercial (payed) technologies like Microsoft Office Document Imaging (MODI) and Textract.
REQUIREMENTS:
AutoIt3 3.2 or higherTesseract 2.01 or aboveINSTALLATION:
To install Tesseract:
Run the file http://web.aanet.com.au/seangriffin/Tesseract201.exe.Follow the installation instructions.LIST OF FUNCTIONS:
DEMONSTRATION:
<Under Construction>
EXAMPLES:
_TesseractControlCapture.au3_TesseractControlFind.au3
DOWNLOAD:
Latest Version - v0.6 (17/03/09)
Tesseract.au3