Posted: 9 Oct 2017 14:07 EDT Last activity: 13 Oct 2017 5:14 EDT
Parsing Excel from stream
I have an excel saved in DB, while processing it is moved to clipboard. I am trying to parse the excel stream but MSOParseExcelFile takes filename. Is there any OOTB option to parse an Excel stream. I tweaked the MSOParselExcel file to take ByteInputStream but its not working.
Yes its saved in DB. The reason to do it is to have this file available to agents running on all nodes. Due to security we cannot have shared folders between nodes. Please consider a node as 1 physical node, not a virtual node.
When you say it isn't working - do you receive an error that you could share here ? (Is the Binay BLOB not being found, or not being decoded correctly or is the File showing as corrupt or something else?)
I have attached the document which explains what I am doing...I have couple of other performance questions revolving this POC.
Task 1 : Make Excel stream parsing using OOTB.
Task 2 : If excel has 40,000 rows it would be better I load first 500 rows, process them and load next 500. This would reduce the memory usage and ensures it works fine on multiple file processings.
This post is for task 1. Once we figure out the solution for task 1 next work will be on task 2. We have third party libraries like Apache POI which can do both task 1 and task 2. Instead of adding these libraries and build a custom solution in Pega I am thinking to expose Apache POI solution as an integration point for Pega.
Before that I am researching if there is any feasible option in Pega. If Task 1 works, there is a chance to request users to break down the large excel file as an alternate solution for Task2.