OCR accuracy when reading PDF

ocr

#1

Hi
I am reading via OCR a value in a PDF file that is “L00” but OCR returns “LOO”. Any chance to correct this?

Thanks!


#3

Can you check with number datatype instead of String?
Thanks


#4

Hi @sharad_kumar, OCR result can only be stored in a string variable.


#5

what the result should look like?

L zero zero ?


#6

Yes, L zero zero


#7

ohh…i didnt work on it…Thanks for this information if i want to extract data from pdf which contains numbers what should we do…that time also we have to take String only??


#8

@timriewe,

in your case I can recommend you to create a “postprocessing” step - Replace Text action o -> 0


#9

Yes, that’s what I did, but I thought there would be some other solution.
Thanks!


#10

more intellectual OCR tuning and post-processing is available in the SPA product.


#11

You extract as string and then convert that variable into number via clipboard operation for example. Make sure the extracted string variable does not contain other characters than numbers before conversion and trim whitespaces.


#12

Thanks For this…