ImageEn for Delphi and C++ Builder ImageEn for Delphi and C++ Builder

 

ImageEn Forum
Profile    Join    Active Topics    Forum FAQ    Search this forumSearch
Forum membership is Free!  Click Join to sign-up
Username:
Password:
Save Password
Forgot your Password?

 All Forums
 ImageEn Library for Delphi, C++ and .Net
 ImageEn and IEvolution Support Forum
 Scanned Document PDF
 New Topic  Reply to Topic
Author Previous Topic Topic Next Topic  

coderboy

Canada
2 Posts

Posted - Oct 24 2023 :  08:15:14  Show Profile  Reply
Is there a way to determine if a page in a PDF is an image from a scanned paper document?

aleatprog

132 Posts

Posted - Oct 24 2023 :  10:25:08  Show Profile  Reply
Hi coder,

try to extract the textual content using TIEPdfViewerInteraction.GetText. If there isn't any textual content, it may be a scan.

Ale
Go to Top of Page

coderboy

Canada
2 Posts

Posted - Oct 24 2023 :  13:22:04  Show Profile  Reply
Hi Ale,

Some PDFs have text embedded in the page along with the image so that wouldn't work. I need to check if the page has image so that the page can be flagged for someone to review the textual content or to add textual content.

Thanks.
Go to Top of Page

xequte

38608 Posts

Posted - Oct 25 2023 :  00:25:48  Show Profile  Reply
Hi

If you email me you can test a beta that lets you retrieve a list of all the objects on the page (just the type of each object). That would tell you if it contains any images, but not anything more than that.

Nigel
Xequte Software
www.imageen.com
Go to Top of Page

aleatprog

132 Posts

Posted - Oct 25 2023 :  08:53:13  Show Profile  Reply
That would be a nice feature for the next official update. In case of multilayer PDF, maybe in the future it could also indicate in which layer the image is, thus, in case of OCRed scans, overlaying or underlaying images can be processed separately.

Ale
Go to Top of Page

xequte

38608 Posts

Posted - Oct 25 2023 :  23:09:45  Show Profile  Reply
Hi Ale

We'll see how it goes. But we will be limited by the functionality that PDFium provides.

Nigel
Xequte Software
www.imageen.com
Go to Top of Page
  Previous Topic Topic Next Topic  
 New Topic  Reply to Topic
Jump To: