Sundeep4sandy

New Member
Hello,

*****I'm trying to OCR pdf in region mode and I kept it as size to fit(it should be sized to fit in ADOBE READER for development purpose) as i need to take lots of data for example :the name of the company region.
example:
The actual pdf contains the name as : Americhem Inc.
I'm getting the name as : Amedchem Inc.

****I tried using page segmentation, scale and tried various combinations but I'm not getting the data which is present in the pdf

**** How to read Bolded data like this BILL TO becomes as BI.L TO
 
Last edited:

cs.andras

Active Member
Hi,
OCR certainly has its limitations. Especially when it comes to those characters. You might be stuck because of the font you are using. Is font-smoothing turned off on the computer where the script is running? That might enhance the results. Also, I see you tried to modify scale... that is a good starting point but don't be afraid to experiment on higher (24-32+) settings as well. I had some successes with this before.
 

Sundeep4sandy

New Member
Hello guys,

I've attached the Reader where i'm trying to send all kinds of page segmentations and scale. But i'm unable to get accurate results..any help would be a major help in this
 

Attachments

  • reader.PNG
    reader.PNG
    33.6 KB · Views: 212
  • page segmentation.PNG
    page segmentation.PNG
    9.8 KB · Views: 209

cs.andras

Active Member
To be honest I don't know what page segmentation is, probably a new function of BluePrism's OCR capability. I'm sadly using 4.2, due to be upgraded here in my company. Don't give up, someone ought to be able to help out.
 

RPA_1

Member
If it's a true PDF you can also use send keys to get the data and extract details accordingly using in string function.
Also, not sure but v6.2 has added some conginite capabilities in BP (customer made vbos) which you can leverage for this purpose .
 
Top