Question:
My system must process some images and get key text data. Of course, the images will follow a style pattern.
Where can I start? Is there any API that can help me?
Answer:
The most common and free OCR (character recognition) engine I know of is Tesseract. There are two contributions of it to .NET:
I recommend turning the images to black and white before using Tesseract to read. Preferably, try to position the texts horizontally, and do not use texts that are too small, as it can get lost.
Another tip is to configure a list of characters allowed for reading . Depending on how the text looks, it may change some characters for others.