Working with Templates Leave feedback

Regex-based fields

Use TemplateRegexPosition to locate values by pattern and extract only the matched group:

Python

from groupdocs.parser import Parser
from groupdocs.parser.templates import Template, TemplateField, TemplateRegexPosition

template = Template([
    TemplateField(
        TemplateRegexPosition(r"Invoice Number\s+(?<value>[A-Z0-9\-]+)"),
        "InvoiceNumber",
    )
])

with Parser("./invoice.pdf") as parser:
    data = parser.parse_by_template(template)
    if data:
        invoice_number = data["InvoiceNumber"].page_area.text
        print(f"Invoice number: {invoice_number}")

invoice.pdf

The following sample file is used in this example: invoice.pdf

Tables with detector parameters

Define table bounds and column separators with TemplateTableParameters when you need to extract line items:

Python

from groupdocs.parser import Parser
from groupdocs.parser.data import Rectangle, Point, Size
from groupdocs.parser.templates import (
    Template,
    TemplateItem,
    TemplateTable,
    TemplateTableParameters,
)

table_area = Rectangle(Point(175.0, 350.0), Size(400.0, 200.0))
columns = [185.0, 370.0, 425.0, 485.0, 545.0]

table = TemplateTable(
    TemplateTableParameters(table_area, columns),
    "Details",
    0,  # restrict to the first page; omit to scan all pages
)

template = Template([table])

with Parser("./invoice.pdf") as parser:
    data = parser.parse_by_template(template)
    if data:
        details = data["Details"].page_area
        print(f"Rows extracted: {details.row_count}")

invoice.pdf

The following sample file is used in this example: invoice.pdf

Tips

Combine regex, fixed, and linked positions in one template to anchor values reliably.
Keep field names unique (case-insensitive).
Reuse TemplateTableLayout when the same table structure appears on multiple pages—see the API reference for layout helpers.
If a template item cannot be located, the corresponding field/table is empty; handle None in your code.

We value your opinion. Your feedback will help us improve our documentation.

Working with Templates Leave feedback

On this page

Regex-based fields

Tables with detector parameters

Tips

Was this page helpful?

Any additional feedback you'd like to share with us?

Please tell us how we can improve this page.

Thank you for your feedback!

On this page