Pdf

Pdf

This filter allows you to extract plain text from a PDF file.
Based on the ledongthuc/pdf library.

Parameters

ParameterTypeDefaultDescription
targetSTRING“main”the field of the Message that should be used for the filter (it could be the main or and extra field)
filenameSTRINGemptythe filename of the PDF file to parse (supports Golang templates)
... | pdf(target="{{ .extra_field }}") | ...
The filename field override the target. They are mutually exclusive, so you can specify only one of them.

Output

The propagated Message contains the plain text of the input PDF file (fulltext will be set to the file name received as input).

Examples

What's on this Page