Read multiple PDFs in different format


#1

Hi All,

Recently I went through article regarding reading PDFs using Xpaths. I wanted to check if there is any option to read pdfs in loop which are in different format (changing invoices).

Thanks,
Narendra


#2

Hi @narendra_purTG

I believe your invoice PDFs are plain scanned images so XPath will not help for reading them. You can use Xpath when you have the detailed XML information in the PDF.
For plain images you need to use the OCR functionality. If you have few invoice formats that dont change you could program each format to be read via OCR. If the formats are changing and you have many variations (which is normally the case in invoice processing) the programming of each one of them will be a lot of work and I would not recommend it.


#3

Hi Tim,

My PDFs are text invoices.

Thanks,
Narendra