Redaction using the Adobe PDF Java Toolkit

Sample of the Week:

Joel Geraci“Redaction” is a legal term of art that means to obfuscate parts of a document. In legal proceedings, relevant documents must be disclosed between litigants. However, some documents, or even parts of documents, contain references (names, numbers, or other information) that are is not subject to disclosure. Trade secrets, social security numbers of non-relevant individuals, the names of minors, and some confidential and non-relevant medical information are all commonly redacted from evidentiary documents.

Redacted-Document

Redacting paper documents is pretty straightforward; you grab a Sharpie and start crossing out text. When you’re done, you photocopy the paper and you’re good to go.
Redacting electronic documents can be just as easy… if you’re using the right tools… and you use them correctly. Continue reading

Extracting Images using the Adobe PDF Java Toolkit

Sample of the Week:

Joel GeraciExtracting images from PDF files was once thought impossible. As a matter of fact, there was a time when PDF was considered the “Roach Motel of file formats;” information went in but it never came out. That was never actually true… but the phrase was so pithy that PDF’s reputation as being static and locked caught on. But as I said, nothing could be further from the truth. There are many tools available that can extract text, convert PDF to other formats like .DOCX or .SVG and PDF can be placed into other layout applications like InDesign. This article will focus on images.
Continue reading

HSM Certification of PDF with the Adobe PDF Java Toolkit

Sample of the Week:

Joel GeraciInformation Assurance is at the top of mind for every developer and IT manager these days so certifying your PDF files is more important than ever. Many business transactions in regulated industries, like financial services, pharmaceuticals, manufacturing, and governmental organizations, require a high level of assurance when documents are distributed electronically. Information Assurance, at least where PDF is concerned has two primary components, document authenticity and document integrity. Basically, did the document come from the organization that it claims to come from and can you confirm that it has not been modified in transit?

Adobe Acrobat allows authors to manually “certify” a document with a hidden digital signature so their recipients can verify it’s authenticity without modifying the appearance of the page. This can be critical for evidentiary documentation but is also important for branded documents, bank statements, regulations etc.

Continue reading

AddImages: A New Service for the Datalogics PDF WebAPI

We continue to add capabilities to the Datalogics PDF WebAPI  with the new AddImages service. AddImages does exactly what it says it does; it allows developers to programmatically add any number of images to any number of pages in a PDF file.

tripleframepreview-fThe AddImages service can be used in conjunction with other services like FillForm to create customized, graphically rich, documents on demand that combine static content,  images, and data such as invoices, advertisements, and receipts.

Continue reading