Jayanthi Krishnaraj (JayanthiK4197)
Bank Of America
Software Architect
Bank Of America
JayanthiK4197 Member since 2018 8 posts
Bank Of America
Posted: August 23, 2019
Last activity: August 23, 2019
Posted: 23 Aug 2019 10:28 EDT
Last activity: 23 Aug 2019 17:00 EDT

Can Document OCR not get text out of a scanned pdf?

I have tried processToXml and ProcessToPdf and tried putting ProcessToPdf before each of these and thried everything with and without ocrImagesAndText being true. everything just returns false. I am trying to get text out of a pdf produced by scanning a paper document, but there are even some pdfs the regular pdf connector can read that document ocr cannot, unless I just cannot sort out how to use it. I can make it get text from images in word documents and it can get text out of a pdf I make by doing a print to pdf, so I know I am not doing everything wrong. can this component actually not get text from a scanned pdf?

Robotic Process Automation
Moderation Team has archived post, This thread is closed to future replies. Content and links will no longer be updated. If you have the same/similar Question, please write a new Question.