T O P I C R E V I E W |
JohnnyT |
Posted - Aug 28 2014 : 11:23:01 I am writing out pages of PDF's to BMP's. I've automated the OCR process, but to avoid producing "junk" text, I'd like to be able to scrap a page which contains a photograph, or preferably ignore embedded photographs and capture only text for that page.
Is there a method for scanning an image, discerning photos/testing line by line and then only OCRing the "valid" areas of text?
TIA,
John |
2 L A T E S T R E P L I E S (Newest First) |
JohnnyT |
Posted - Sep 04 2014 : 07:55:55 Thanks Nigel.
I'll give that a try. John |
xequte |
Posted - Sep 03 2014 : 19:18:52 Hi John
No, I'm afraid there is not a specfic function for that. You may be able to use getRegions and ignore pages that return 0 regions.
Nigel Xequte Software www.xequte.com nigel@xequte.com
|
|
|