07/12/2014, 11:38 PM
(Edited. 07/12/2014 at 11:42 PM)
1st: WE (the non-tool-creators) have to thank you and all the other tool-creators for spending their free time into this. We are fighting space ships, while you are fighting data.
2nd: if you need more files just say so (best PM me). I have now about 4000 files for you to test, constantly growing.
07/12/2014, 11:39 PM
07/12/2014, 11:52 PM
(Edited. 08/12/2014 at 12:32 AM)
08/12/2014, 12:05 AM
Originally Posted by
When trying to run the BPC Feeder from EliteOCR 0.3.2.1, I am getting the error:
Line 13697 (File
Error: Variable must be of type "Object".
Are you having any issues?
I know that you are still working on it so if anything, I am reporting this for possible bug.
Keep up the awesome work!
08/12/2014, 12:19 AM
(Edited. 08/12/2014 at 12:24 AM)
@seeebek: first feedback to your newest version:
1.) The excel export now contains floating values with a decimal delimiter ".". All values ending with ".0" which is quite unnecessary as there are only integers.
That causes problems for some of us using excel for processing your CSV output (like me). In some areas of this planet the decimal delimiter is "," and not ".". ;-)
Would be nice, if you can remove this again, as it is out off sense anyway.
2.) I guess the BPC-Feeder doesn work (haevn't tested yet) as there is still the problem with the "sell error". That need to be fixed by slopey I think. Am I right?
3.) A small suggestion to improve usablity: can you increase the font of the OCR results fields? They are quite small to read compared to the OCR gaphis above them. A bigger display of all these fields would help identifying mismatches.
And now the major point:
4.) Thank you very much vor this tool. Its really g awesome and helps A LOT!!!!!
5.) found a small bug: setting "Remove duplicates in tbale" didn't work anylonger.
08/12/2014, 12:27 AM
Yes, but if you look in the .ini file in the settings in the feeder folder you see that the file path is wrong.
But i figure its not really finished work yet, so I will probably not play with it until i know more.
08/12/2014, 12:52 AM
(Edited. 08/12/2014 at 1:07 AM)
My Excel export looks OK and no delimiters at all. Isn't it a config setting maybe in your Excel?
EDIT: My Excel Export Works fine, but if i do a CSV Export i get the same result as Conehead with delimiters.
Excel Export - Fine
CSV Export - Floating/Delimiters
08/12/2014, 12:54 AM
I had changed the path already. I get the error with the correct path.
Originally Posted by
08/12/2014, 12:56 AM
Yes, same here I just noticed.
08/12/2014, 2:27 AM
(Edited. 08/12/2014 at 4:02 AM)
Just a quick update on what all of the data you guys sent in today has helped achieve:
The yellow rows are labels.
So out of 3.3k digits tested, it got 14 of them incorrect. Not too bad of a start, eh? :-)
Shouldn't take much longer to get even more accurate :-)
Edit: Went through and looked through the ones it got wrong to see if I could notice a pattern: turns out, it's better at telling 8/9/6 apart than some humans: some of the input files were labeled wrong ;-)
With those relabeled correctly, the real accuracy is starting to get awesome :-)
Wooo .24% error = 99.76% accuracy. That means it only gets one wrong out of ~416 :-)
Yeah there is definitely going to be some incorrect entries in the OCR stuff...
08/12/2014, 4:14 AM
(Edited. 08/12/2014 at 4:20 AM)
This is the k-nearest algorithm, which is fast to train, but really susceptible to noise, so it's a great place to start to get a feel for data cohesion/cleanliness.
The plan, if anyone is interested, is to first get a good accuracy rate with kmeans (which we are quickly approaching), and then transition over to a pure neural net, or if I have time, preferably a convolution neural net- both of which are much more noise tolerant. We'd have the most advanced video game OCR market data scraping program in the world hahaha :-)
And with an accuracy rate of >= 99.9%, we'd be in the area of trusting enough to have it automatically recognize
Here are some more training images.
- - - - - Additional Content Posted / Auto Merge - - - - -
Originally Posted by
Any chance that when exporting CSV it is triggering decimals to be turned on in Excel? Can you go to formatting options in Excel and turn of decimal places?