Removing sensitive content from PDFs

gil.silva

Active Member
Hello Hema_rao,

Questions:
1. How would you do it manually?
2. Can the PDF be converted to Word? (without losing the format)
3. Can the text be extracted and then create a new PDF from it?
4. Do the PDF contain images and other content which can't be converted to plain text?
 

Sachin_Kharmale

Active Member
1)PDF file has abc@gmail.com, we should hide ***@gmail.com
2)We should also hide URL's
3)Company logo

What procedure we have to use to implement this in blueprism?
You need to convert into Word and then you can replace the text.
Or there is another lib which will work on pdf like itextsharp you can search for that also may be it will help you.
 
Top