Unable to capture text in PDFs for some fields where the position of the field is on the next page.

sheebanancy90

New Member
The PDF is of same templates. Im trying to read text by OCR and it is capturing well for all PDFs except for some where the field falls under next page (in same order). Im trying to capture firstName LastName and email, where the email falls under next concurrent page following the same order of fields.

I'm thinking of trying with Relative , parent method. where the first name would be Parent of the corresponding field. will that work even if the position changes slightly to next page?
 

VJR

Well-Known Member
Hi sheebanancy90,

Looks like you are using Region Mode. Need to understand, if the Email is going on the next page why aren't you spying the Email from the next page? Maybe you are not aware whether the Email falls on the same page or the next.

As a sample if you have the data in the following format,
Email Label: Email Text containing the address

then you can read the Email Label portion after you are doing with the LastName and read what text it is retrieving using Read Text with OCR. If it is retrieving the text as "Email Label" then that means the Email is on the same page and now you can fetch the 'Email Text containing the address' portion.
If it is not "Email Label" then do the same at the beginning of the next page.

If this is not workable on your pdf then you might have to try something similar but different based on the pdf format you have, as the Region mode generally requires the data to be in the same format/order.
 
Top