Is an API guaranteed, or just speculation? I don't know enough on the topic. Regarding EliteOCR, I understand that it's presently focused on commodities, but I'm trying to think ahead with this. You may be interested in http://sourceforge.net/projects/capture2text/ which is a very powerful program written in AHK that makes calls to libraries like tesseract and leptonica for OCR. Its current form allows the user to press WIN+Q to start drawing a box around the screen, and even shows parsing in real-time at the top of the screen. A click will save the parsed data to clipboard, but you can modify the AHK script to do anything you want with it. If you look at the script, it can be easily modified to run entirely from commands with args on where to grab imagery from the screen without having to go through any of the manual selection process. There's also an output folder that allows you to see both the original and binarized images from the most recent capture. I've done some testing, and it's definitely able to get the job done. My problem is I'm new to all this OCR, so I'm not exactly sure how to train this sucker. I started with https://code.google.com/p/tesseract-ocr/wiki/TrainingTesseract3 but quickly got overwhelmed. seeebek's trainer UI is pretty awesome by comparison. It sounds like the training results from EliteOCR may be able to be used by Capture2Text, but I just don't know enough right now to be sure. Of course, all this Capture2Text stuff is moot if seeebek ever decides to provide a simplified version of EliteOCR that just parses a portion of screen based on coordinate args, rather than trying to solve one whole problem (commodities only) in one chunk. Then I imagine we could still take advantage of his great trainer and have the flexibility to parse anything desired. But maybe I'm oversimplifying things =)
It can be done with the options I mentioned above, and I'm sure I can manage it somehow, but I'm hesitant since you're already talking about about making changes to your UI, which is to be expected. How would you prefer to digest any info parsed from such a script?
Thinking ahead is good, but we get an API (sometimes) to read all those data. The time needed for any other OCR is nearly equal to me. So i enhancing TCE in aspects, that not covered with the data collection.
Last edited: