Posted: 12 Aug 2019 13:58 EDT Last activity: 15 Aug 2019 22:30 EDT
What image type does the PDF connector look for?
I have a situation where the images in PDF documents are not within the margins of the page. Because of this, when running document OCR, the tool is only OCRing what is visible. However, if I copy and paste the image out of the PDF into MS Paint or similar, the whole image appears. Therefore I am looking to use the PDF connectors GetImages method, to extract a collection of images from the PDF and then run the Document OCR tool over this collection. However, when I run GetImages or Imagecount, it always return without any images. When I run the same PDF through the OCR tool, and set the property to only OCR images, it finds the images. Therefore we believe that the GetImages method only looks for specific image types and the documentOCR component looks for a broader type of images.