Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Info

The PDF Extractor transformation plugin is available in the Hub.

This transformation leverages the Apache PDF Box library to extract text and metadata from a PDF file. It is usually used in conjunction with the Whole File Reader plugin since it requires the entire contents of the PDF to be loaded into a single message and passed into the transform. Due to this, there may be memory issues when loading extremely large PDF files.

...