This filter allows you to extract plain text from a PDF file.
Based on the ledongthuc/pdf library.
Parameters
Parameter | Type | Default | Description |
---|---|---|---|
target | STRING | “main” | the field of the Message that should be used for the filter (it could be the main or and extra field) |
filename | STRING | empty | the filename of the PDF file to parse (supports Golang templates) |
... | pdf(target="{{ .extra_field }}") | ...
The
filename
field override the target
. They are mutually exclusive, so you can specify only one of them.Output
The propagated Message contains the plain text of the input PDF file (fulltext
will be set to the file name received as input).
Examples
Soon…