Pega IVA : Email Attachments : OCR extract from Word, Excel, Powerpoint filetypes

Question

SARANGAN

Member since 2013

8 posts

Wipro Technologies

Posted: Apr 8, 2019

Last activity: Apr 15, 2019

Posted: 8 Apr 2019 10:29 EDT
Last activity: 15 Apr 2019 8:31 EDT

Closed

Pega IVA : Email Attachments : OCR extract from Word, Excel, Powerpoint filetypes

Report

As per OCR support article, it is possible to extract data from email attachments across several filetypes, including PDFs.

Is it possible to extract data from Word, Excel, Powerpoint and other office documents ?

https://community1.pega.com/exchange/components/pega-ocr

To see attachments, please log in.

Pega Intelligent Virtual Assistant

Like (0)
Share this page Facebook Twitter LinkedIn Email Copying... Copied!

Posted: 5 years ago

Posted: 15 Apr 2019 8:31 EDT

MariuszGrabowski

PEGA

replied to SARANGAN

Report

Dear Sarangan,

You may want to take a look on pySetTextExtractionCapabilities which controls document types. This is not limited to OCR component, in other words if you want to retrieve text from docx or xlsx files you may do it.

However if your scenario is that you have in a xlsx document a picture which you want to OCR then it is not supported. OCR component works with images (e.g. jpg) and as a container for images it can process pdf files.

Hope it helps.

Best regards, Mariusz

To see attachments, please log in.

Like (0)

Get Started with Community

Question

Pega IVA : Email Attachments : OCR extract from Word, Excel, Powerpoint filetypes

Need help or want to help others?

Experience the benefits of Support Center when you log in.

Question

Pega IVA : Email Attachments : OCR extract from Word, Excel, Powerpoint filetypes

Related content:

Need help or want to help others?

Experience the benefits of Support Center when you log in.

We'd prefer it if you saw us at our best.