Executive Summary
This briefing details PDF/A-3, the third part of the ISO 19005 standard for long-term PDF archiving, and its transformative role in electronic invoicing and tax compliance. PDF/A-3’s key innovation is its ability to embed files of any format (e.g., XML, CSV, Excel) directly within a PDF/A document. This enables “hybrid” documents, particularly invoices, that are both human-readable (the PDF visual layer) and machine-readable (the embedded structured data). This dual capability is crucial for meeting evolving e-invoicing mandates, especially in the European Union with standards like ZUGFeRD and Factur-X, which embed EN 16931-compliant XML within a PDF/A-3 file. While offering significant benefits in compliance, automation, and long-term preservation, organizations must consider potential risks like embedded format obsolescence and ensure rigorous validation. PDF/A-3 acts as a critical bridge between traditional document formats and automated data processing in a compliance-driven world.
- Introduction to PDF/A-3
PDF/A-3 is the third iteration of the ISO 19005 standard, specifically designed for long-term electronic document preservation. Building upon earlier PDF/A versions, PDF/A-3 introduced a “groundbreaking feature: the ability to embed files of any format (XML, CSV, Excel, etc.) directly inside a PDF/A document.” This means a single PDF/A-3 file can serve a “dual purpose,” holding both a visually consistent, human-readable document and supplementary, machine-readable data. This “hybrid” approach is particularly pivotal in the realm of electronic invoicing and tax compliance, where documents must satisfy both business readability and automated data processing requirements.
- Evolution of PDF/A Standards: Context for PDF/A-3
PDF/A (“PDF/Archive”) is a family of ISO standards dedicated to ensuring that digital documents can be reproduced accurately and consistently decades into the future. Each version builds upon its predecessors while adhering to core archival principles like self-containment (embedding all necessary fonts, colors, etc.) and prohibiting features unsuitable for long-term archiving (e.g., encryption, external references).
- PDF/A-1 (2005): The first and most restrictive version, based on PDF 1.4. It disallowed modern features like JPEG2000 images, transparency, and any attachments. Its primary focus was on ensuring the PDF stood alone.
- PDF/A-2 (2011): Based on PDF 1.7, this version expanded capabilities, allowing transparency, layers, JPEG2000 compression, and digital signatures. Crucially, it permitted embedding other PDF/A-compliant files as attachments, enabling the bundling of related PDF/A documents.
- PDF/A-3 (2012): Based on the same PDF 1.7 baseline as PDF/A-2, PDF/A-3’s “only change is controversial but transformative: support for embedding files of any type (XML, Excel, etc.) inside the PDF/A document.” This was a direct response to a “desire to have a machine-readable component” within archived documents, aligning with real-world business needs to preserve original source data alongside the final document.
- PDF/A-4 (2020): The latest version, based on PDF 2.0, also allows embedded files (PDF/A-4f) and introduces new conformance levels. However, as of now, PDF/A-3 remains the “prevalent format for hybrid e-documents” due to established standards and widespread software support built around its 2012-2025 era use cases.
- Key Features and the “Hybrid” Concept
The defining characteristic of PDF/A-3 is its “Any file format as attachment” capability. Unlike its predecessors, it permits embedding files of any type (e.g., XML, CSV, Word, Excel), creating a “self-contained document that holds the visual content and supplementary data together for future-proof archiving.”
This capability directly enables the “hybrid” document concept. For example, in e-invoicing, a hybrid invoice means “the method of embedding structured electronic invoice data inside a PDF using the PDF/A-3 format.” This allows the PDF to serve as the “legible invoice for people, while the attached XML enables automated processing by software.” The embedded XML is typically linked as an “Associated Files” entry, often tagged with a relationship like “Data” or “Alternative,” making it visible in the attachments panel of compliant PDF viewers.
- Pivotal Role in E-Invoicing and Tax Compliance
PDF/A-3 has become a cornerstone for electronic invoicing, particularly in the EU, by allowing a single file to satisfy both human readability and machine processing requirements.
4.1. Hybrid E-Invoices: ZUGFeRD & Factur-X
European e-invoicing standards like Germany’s ZUGFeRD (2014) and the joint Franco-German Factur-X explicitly utilize PDF/A-3. These formats combine a visual PDF invoice with an embedded XML file that adheres to the EN 16931 EU standard for semantic data models.
- A Factur-X invoice is defined as “a file in PDF/A-3 format… both the directly readable representation of the invoice and the envelope for the structured data file.”
- This approach is lauded for its “transition-friendliness,” allowing recipients who are not yet fully automated to read the PDF, while automated systems can ingest the XML.
4.2. EU Regulatory Context and ViDA
The EU has been a primary driver for hybrid e-invoices:
- Directive 2014/55/EU mandated that EU public administrations accept invoices compliant with EN 16931. While the directive primarily specified UBL or UN/CEFACT CII XML as syntaxes, the PDF/A-3 approach, by embedding compliant XML, effectively meets the requirement.
- Looking ahead, the “VAT in the Digital Age (ViDA)” proposal (2022) will redefine e-invoicing to mean “issued in a structured electronic format that allows for automatic processing,” effectively disqualifying traditional unstructured PDFs. However, PDF/A-3 hybrid invoices are compliant as “the embedded XML is fully machine-readable.” This positions PDF/A-3 as a crucial “compliance path that combines structure with the familiar PDF format.”
4.3. Global Usage and Compliance
While prominently used in the EU, PDF/A-3’s utility extends globally:
- In Saudi Arabia, e-invoicing regulations require a human-readable invoice alongside a compliant XML; some solutions generate PDF/A-3 files with embedded XML to satisfy this.
- In Latin America, where pure XML invoices are prevalent, some businesses use PDF/A-3 to attach the mandated XML to a PDF for internal archiving or customer convenience.
- In the U.S., sectors like pharmaceuticals and government archives (e.g., NARA) use PDF/A-3 for forms and records management.
4.4. Ensuring Compliance and Auditability
A critical aspect of PDF/A-3 for invoices is the mandated consistency between the PDF content and the embedded data. Tax authorities expect the human-readable invoice to match the XML precisely, ensuring the “PDF can serve as a legally valid original invoice copy.” This facilitates a robust audit trail, allowing auditors to “open the PDF to see the invoice exactly as issued, and also extract the XML to run it through validation tools or cross-check calculations.”
Furthermore, PDF/A-3 supports digital signatures (PAdES), allowing a single signature to cover the entire file, including all attachments. This provides integrity and authenticity guarantees, crucial for legal acceptance.
- Business Benefits of PDF/A-3
PDF/A-3 offers compelling advantages for document workflows, particularly in financial contexts:
- Single-File Convenience: It ensures that “the data ‘travels with’ the document as an atomic unit,” eliminating the risk of losing or mismatching separate human-readable and machine-readable files.
- Human & Machine Synergy: Business users can view the familiar PDF, while “back-end systems can parse the attached XML for automatic booking, validation, or reporting,” reducing manual re-keying and errors.
- Regulatory Compliance: It helps meet both long-term archiving and e-invoicing regulations simultaneously, as “PDF/A-3, being an ISO-standard archival PDF, meets stringent record-keeping rules,” while the embedded XML provides data for tax authorities.
- Long-Term Preservation & Future-Proofing: PDF/A standards ensure consistent document reproduction for decades. By embedding standard formats like EN 16931-compliant XML, the data also remains interpretable long-term.
- Enhanced Workflows and Integration: It streamlines processes, for instance, by allowing accounts payable systems to automatically extract XML for ERP integration, “reducing manual typing and errors.”
- Packing Additional Related Documents: While primarily used for invoice XML, PDF/A-3 allows embedding multiple files, enabling a “complete record set together” within one PDF package.
This synergy bridges the gap between unstructured documents and pure data streams, enabling digital invoices that are “Visible and readable by both machines and people.”
- Risks and Considerations
While beneficial, organizations must address potential challenges:
- Format Obsolescence of Attachments: The PDF/A standard doesn’t guarantee the long-term readability of embedded files. Best practice is to “use widely adopted, standard formats (XML, CSV, plain text, etc.)” to mitigate this risk.
- Validation and Standards Compliance: Creating truly compliant PDF/A-3 files requires careful validation. Tools like PDF Tools AG 3-Heights® PDF Validator or open-source veraPDF are recommended to ensure both PDF/A-3 conformance and XML schema compliance (e.g., EN 16931).
- File Size Consideration: Embedding large files can increase PDF size, potentially impacting email or storage. For typical XML invoice data, this is usually negligible.
- Tooling and Training: Users and systems need to be equipped to handle embedded files (e.g., accessing attachments panels in PDF viewers). “As hybrid invoices become more widespread, tool support is becoming standard.”
- Security Considerations: While PDF/A-3 prohibits launching executables, general cybersecurity practices (e.g., antivirus scanning) should apply to extracted attachments, as with any external files.
- Future Trends
While PDF/A-4 (based on PDF 2.0) also supports embedded files, PDF/A-3 remains dominant for hybrid e-documents due to established software and standards. PDF/A-3 is expected to “play a key role in compliance for the next several years, particularly as countries implement new e-invoicing mandates and need a bridge from paper/pdf to full digital.”
- Conclusion
PDF/A-3 is a powerful standard that harmonizes human-readable documents with machine-processable data. Its capacity to embed any file type, particularly structured XML, has made it indispensable for modern e-invoicing and tax compliance, exemplified by hybrid invoice standards like ZUGFeRD and Factur-X. By enabling businesses to meet stringent regulatory requirements for both archiving and data reporting within a single, secure, and user-friendly format, PDF/A-3 serves as a vital enabler for digital transformation in global commerce and tax administration.
Other articles in this serie of ”E-Invoicing and E-Reporting Explained”
- Clearance Models
- EN16931 – European E-Invoicing Standard
- X‑Rechnung Invoices in Germany
- ZUGFeRD Invoices in Germany
- Factur-X in France

Article
PDF/A‑3: The Hybrid PDF Standard for Data-Embedded, Archivable Documents
PDF/A-3 is the third part of the ISO 19005 standard for long-term PDF archiving. It introduced a groundbreaking feature: the ability to embed files of any format (XML, CSV, Excel, etc.) directly inside a PDF/A document. This means a single PDF/A-3 file can hold both a human-readable document and machine-readable data. Below, we explain what PDF/A-3 is, how it differs from earlier PDF/A versions, and why it’s pivotal in electronic invoicing and tax compliance – enabling “hybrid” invoices that satisfy both business readability and automated data processing requirements. [en.wikipedia.org], [en.wikipedia.org]
- PDF/A-3: Embedded Data in Archival PDFs
- Any file format as attachment: Unlike its predecessors, PDF/A-3 permits embedding files of any type (e.g. XML, CSV, Word, Excel) within a PDF. This creates a self-contained document that holds the visual content and supplementary data together for future-proof archiving.
- Hybrid E-Invoices (ZUGFeRD & Factur-X)
- Machine + human-readable: European e-invoicing standards like Germany’s ZUGFeRD and France’s Factur-X use PDF/A-3 to embed a structured XML invoice (compliant with the EN 16931 EU standard) inside a PDF. The result is a hybrid invoice: the PDF serves as the legible invoice for people, while the attached XML enables automated processing by software.
- Compliance & Archiving Benefits
- One file, dual purpose: A PDF/A-3 invoice preserves the exact visual layout sent to a customer, and carries the full invoice dataset for audits or VAT reporting. Both representations are kept in sync within one document, maintaining a reliable audit trail. PDF/A’s strict requirements (all fonts embedded, no external content) ensure the document remains viewable decades later, satisfying legal archiving mandates. Additionally, PDF/A-3 supports digital signatures (PAdES), so the PDF (including its XML attachment) can be electronically signed to guarantee integrity and authenticity.
Understanding the PDF/A Family and PDF/A-3’s Role
PDF/A (“PDF/Archive”) is a series of ISO standards (19005) defining PDF subsets for long-term electronic document preservation. PDF/A ensures that a document can be reproduced in the same way in the future by prohibiting features unsuitable for archiving (like encryption or missing fonts) and requiring self-containment (all fonts, colors, etc. are embedded in the file). There have been several versions – PDF/A-1, -2, -3 (and recently PDF/A-4) – each building on newer PDF technologies while preserving the archival principles. [en.wikipedia.org]
PDF/A-1 (2005) was the first version, based on PDF 1.4. It’s the most restrictive: it disallows modern features such as JPEG2000 images, transparency layers, and attachments of any kind. PDF/A-2 (2011), based on PDF 1.7, introduced support for newer features provided they remain self-contained. PDF/A-2 allows embedding other PDF/A files as attachments (e.g. bundling multiple related PDFs) and permits transparency and JPEG2000 compression. It also added a new conformance level ‘U’ (Unicode) to ensure text can be copied/searchable without requiring full semantic tagging. [apryse.com], [en.wikipedia.org] [en.wikipedia.org], [apryse.com] [apryse.com]
PDF/A-3 (2012) is virtually identical to PDF/A-2 in technical requirements and PDF 1.7 baseline – with one crucial change: it permits embedding files of any format as attachments inside the PDF/A document. This was a response to user demands for preserving original source data (XML, spreadsheets, etc.) together with the document. Critics noted this could undermine long-term accessibility (since future compatibility of the embedded files isn’t guaranteed by the PDF/A standard). However, proponents argued that carefully choosing durable, standard formats (XML, CSV, plain text, etc.) mitigates this risk, and the benefit of a unified human+machine-readable document is immense for many industries. In fact, PDF/A-3’s design was “inspired by a desire to have a machine-readable component” within archived documents – a capability soon embraced by the business community, especially for invoices and records that pair a visual representation with structured data. [apryse.com] [pdf-tools.com]
To put the versions in perspective, the table below highlights key differences:
- PDF/A Version: PDF/A-1
Year / ISO Standard: 2005 (ISO 19005-1)
PDF Base: PDF 1.4
Key Features & Limitations: First archival PDF standard. Strictest rules: no transparency or modern compression (JPEG2000), fonts must be embedded, no encryption [apryse.com], [en.wikipedia.org]. Only basic a (accessible) or b (basic) conformance levels.
Attachment Policy: No attachments allowed [apryse.com] – the PDF/A-1 file must stand alone (external content references are forbidden). - PDF/A Version: PDF/A-2
Year / ISO Standard: 2011 (ISO 19005-2)
PDF Base: PDF 1.7 (ISO 32000-1)
Key Features & Limitations: Expanded features aligned with later PDF tech: allows transparency and layers, JPEG2000 and JBIG2 images, and digital signatures (PAdES) for authenticity [en.wikipedia.org]. Introduces ‘U’ level for Unicode text mapping [apryse.com].
Attachment Policy: Limited attachments: permitted to embed files, but only if the attachments are themselves PDF/A-compliant PDFs [apryse.com] (e.g. a PDF/A-2 file could carry another PDF/A invoice as an attachment). - PDF/A Version: PDF/A-3
Year / ISO Standard: 2012 (ISO 19005-3)
PDF Base: PDF 1.7 (ISO 32000-1)
Key Features & Limitations: Same feature set as PDF/A-2. The only change is controversial but transformative: support for embedding files of any type (XML, Excel, etc.) inside the PDF/A document [pdf-tools.com], [apryse.com]. Conformance levels a, b, u remain.
Attachment Policy: Any format attachments: allows embedded files of any format (not just PDFs). This enables “hybrid” PDFs that marry human-readable content with embedded data. Example: a PDF/A-3 invoice can carry an XML invoice file inside it.
Note: PDF/A-4 (ISO 19005-4:2020) is the latest part (based on PDF 2.0). It streamlines conformance levels (introducing levels ‘e’ and ‘f’ instead of a/b) and continues to allow embedded files (PDF/A-4f) including modern engineering data and rich media in a controlled way. However, as of now, PDF/A-3 is the prevalent format for hybrid e‑documents, since many standards (and software tools) are built around PDF 1.7 and the 2012–2025 era use cases. [apryse.com]
Why PDF/A-3? From Archival Documents to Data-Enabled Documents
The primary motivation for PDF/A-3 was real-world business needs that weren’t met by earlier PDF/A versions. Companies and institutions wanted to keep original electronic data (structured datasets, spreadsheets, XML records) bundled with the final document. For example, a financial report might be prepared in Excel – having the original .xlsx embedded within the archived PDF allows future users to access not just the report’s pages, but also the underlying calculations in the spreadsheet. In regulated industries (finance, pharmaceuticals, engineering, etc.), preserving this source data alongside the human-readable document is valuable for audits, compliance, or simply convenience. PDF/A-2 only allowed attaching PDF/A files, but many source files aren’t PDF – thus PDF/A-3 was born to “embed file formats that do not comply with PDF/A” as attachments. [pdf-tools.com], [pdf-tools.com]
However, this flexibility raised concerns among archiving purists. Allowing arbitrary attachments could violate the self-contained ideal of PDF/A, since a non-PDF attachment might not be readable decades later (the PDF/A standard only guarantees the PDF content is viewable). For instance, embedding a proprietary binary format or an executable is risky – future software might not support it, or it could introduce security issues. Best practice, therefore, is to embed open or standardized formats (like XML, CSV, plain text, or image formats) that are likely to remain accessible. In business workflows, this is exactly what’s done: PDF/A-3 files typically embed structured text-based data (XML or JSON), not arbitrary binaries. Additionally, PDF/A-3’s rules require that viewers at least allow extraction of embedded files, even if they can’t render them. In short, PDF/A-3 trades a small increase in future format uncertainty for a large gain in current usability. When implemented carefully, it “preserves the complete digital chain” of a document – future archivists can still open the PDF visually and retrieve the original data for verification or reuse. [apryse.com]
How embedding works: Technically, PDF/A-3 uses the standard PDF embedded file mechanism, extended for archival use. Each attached file is stored within the PDF structure (in a section often called the File Specification dictionary), and an “Associated Files” entry (the AF array) links it either at the document level or to a specific page. The attachment can also be tagged with a relationship (e.g. “Data” or “Alternative”) to indicate how it relates to the main document. In hybrid invoices, the XML is typically marked as an alternative representation of the entire document. The PDF/A-3 standard mandates that any embedded file must be completely included (no external references) and that its presence is indicated in the PDF’s metadata for visibility. Users opening a PDF/A-3 in a compliant reader will see an “Attachments” panel showing the embedded files. For example, a Factur-X invoice PDF will contain an attachment named factur-x.xml (or xrechnung.xml for the XRechnung profile) which holds the invoice data. The embedded XML and the PDF content are meant to be two sides of the same coin: one for machines, one for humans. [0_FACTUR-X…5_12_04_EN | PDF], [0_FACTUR-X…5_12_04_EN | PDF] [0_FACTUR-X…5_12_04_EN | PDF]
PDF/A-3 for E-Invoicing: Marrying Human-Readable and Machine-Readable Invoices
One of the most significant applications of PDF/A-3 is in electronic invoicing. Invoices have legal and business requirements to be readable documents (for buyers, auditors, or tax officials) and increasingly need to be structured data for automated processing (for accounting systems or real-time reporting to tax authorities). PDF/A-3 enables a single file to satisfy both needs – it’s essentially the enabler of the “hybrid invoice” concept. A hybrid e-invoice is “the method of embedding structured electronic invoice data inside a PDF using the PDF/A-3 format”, so that the invoice is at once an image/PDF and a data file.
ZUGFeRD and Factur-X: Pioneering Hybrid Invoice Standards
Shortly after PDF/A-3’s release, European industry groups and government bodies embraced it for e-invoicing. Germany’s ZUGFeRD format (released 2014) and France’s Factur-X (a joint Franco-German standard, essentially ZUGFeRD 2.0) are prime examples. A ZUGFeRD/Factur-X invoice is a PDF/A-3 file that “combines machine- and human-readable data in the same document”. The PDF part is a complete visual invoice (often meeting all legal layout requirements), and an XML file following the standard invoice schema is embedded inside. That XML conforms to the European Norm EN 16931 (the EU’s semantic data model for invoices) – meaning it contains all the required invoice fields (supplier, buyer, line items, tax details, totals, etc.) in a structured format. In fact, Factur-X is explicitly defined as “a file in PDF/A-3 format … both the directly readable representation of the invoice and the envelope for the structured data file”, with the XML providing the data for automation. [pdf-tools.com] [External s…glish v2.1 | PDF], [External s…glish v2.1 | PDF]
Because the data schema is standardized EU-wide, a single hybrid PDF/A-3 invoice can satisfy multiple jurisdictions. For example, XRechnung, the German government’s official XML invoice schema for B2G transactions, can be embedded in a PDF/A-3 — German agencies have accepted PDF/A-3 invoices as long as the XML inside is XRechnung compliant (some regional governments allowed PDF invoices with XRechnung XML attachments as an alternative to pure XML submissions). Similarly, Factur-X in France uses the same approach (it’s literally the same standard under a different name). Many EU countries plan to adopt mandates for B2B e-invoicing in the coming years (as part of the “VAT in the Digital Age” initiative), and hybrid PDF/A-3 invoices compliant with EN 16931 are one way to meet those requirements in a user-friendly manner. They are especially useful during transition periods: a supplier can send a PDF/A-3 hybrid invoice via email, and a customer who isn’t yet automated can open and read the PDF, whereas a customer with an e-invoicing system can ingest the XML automatically. This flexibility – sometimes called “transition-friendliness” – is a big reason these formats are favored. [0_FACTUR-X…5_12_04_EN | PDF] [apryse.com], [pdf-tools.com] [invoicenavigator.eu]
Global usage: Outside the EU, PDF/A-3 is also recognized. For instance, in Latin America the prevalent model is pure XML invoices plus optional human-readable PDFs. While PDF/A-3 isn’t mandated there, some businesses use PDF/A-3 to attach the mandated XML to a PDF for internal archiving or customer convenience. In Saudi Arabia, e-invoicing regulations require a human-readable invoice with a compliant XML; some solutions generate a PDF/A-3 with the XML embedded to satisfy this (the embedded XML can be validated against the Saudi tax authority’s requirements). U.S. and others: The U.S. hasn’t standardized on PDF/A-3 for invoices, but sectors like pharmaceuticals and government archives (e.g. U.S. National Archives NARA) have guidelines for using PDF/A-3 for forms and records management. In short, PDF/A-3 is internationally seen as a way to consolidate documents and data. [zatca.wafeq.com]
Ensuring Compliance and Auditability
When using PDF/A-3 for invoices, it’s critical to maintain consistency between the PDF content and the embedded data. Tax authorities and auditors expect that the human-readable invoice matches the XML exactly. European guidelines for hybrid invoices explicitly state that all invoicing data in the XML must also appear in the PDF – the XML should not introduce extra figures that aren’t visible on the invoice document. This ensures the PDF can serve as a legally valid original invoice copy, while the XML can be used for automated checks. In practice, the PDF is usually generated from the same data source as the XML, or even rendered from the XML itself, to guarantee that totals, tax amounts, dates, and line items align perfectly. Audit trail: By bundling them, a PDF/A-3 invoice creates a robust audit trail – a tax auditor can open the PDF to see the invoice exactly as issued, and also extract the XML to run it through validation tools or cross-check calculations. The embedded data often includes a detailed breakdown (per legal requirements like EN 16931) which aids in automated validation (e.g. checking that tax amounts are correctly calculated from line item details). This dual assurance builds trust that the electronic invoice is complete and untampered. [External s…glish v2.1 | PDF]
Furthermore, PDF/A-3’s support for digital signatures means a company can digitally sign the PDF/A-3 invoice once, and that signature covers the entire file including attachments. The signature (conforming to ETSI PAdES standards) will detect any post-fact modifications to either the visible PDF or the XML data, thus providing authenticity and integrity guarantees accepted by law (in the EU and many jurisdictions). Some e-invoicing implementations use this feature to sign the hybrid invoice, so that the recipient and tax authority can verify the signature. (Note: In certain centralized clearance systems – e.g., Italy or India – the government platform itself might apply a digital stamp or signature to the invoice data; even in such cases, businesses sometimes locally sign their PDF/A-3 files for internal integrity.) [en.wikipedia.org], [pdf-tools.com]
EU Regulatory Context
The European Union has been a driving force in adopting hybrid e-invoices. Directive 2014/55/EU mandated that all public administrations in the EU must accept invoices that comply with the EN 16931 standard. EN 16931 defines the data model for a compliant invoice and two allowed syntaxes (UBL XML and UN/CEFACT CII XML). Factur-X/ZUGFeRD demonstrates a third way: an EN 16931 XML embedded in a PDF/A-3. This wasn’t explicitly listed in the directive as a “syntax,” but it is effectively an envelope around an accepted syntax (CII XML) – so if the XML inside is compliant, the hybrid invoice is compliant. Germany’s federal ministries and France’s Chorus Pro platform both accept hybrid PDF invoices that meet the standard. Looking forward, the EU’s 2022 “VAT in the Digital Age (ViDA)” proposal will, from 2024, redefine e-invoice to mean “issued in a structured electronic format that allows for automatic processing” – effectively disqualifying traditional PDFs or scans. This signals that pure unstructured invoices (even if PDF/A) will no longer count as “electronic invoices” for VAT purposes. However, PDF/A-3 hybrid invoices do meet the “structured format” test because the embedded XML is fully machine-readable. Thus, PDF/A-3 provides a compliance path that combines structure with the familiar PDF format. It’s expected that many companies, especially SMEs, will use hybrid PDF invoices as an on-ramp to full e-invoicing compliance, since they can still email or visually handle PDFs while meeting the new data reporting mandates. [invoicenavigator.eu], [pdf-tools.com]
Business Benefits of PDF/A-3 in Practice
From a business perspective, PDF/A-3 offers a number of compelling advantages for document workflows, especially invoices and financial documents:
- Single-File Convenience: By keeping the human-readable and machine-readable versions together, there’s no risk of losing or mismatching the pieces. For example, instead of storing “invoice_1234.pdf” separately from “invoice_1234.xml,” a PDF/A-3 file contains both. This eliminates errors where one file is updated without the other or one gets misplaced – the data “travels with” the document as an atomic unit.
- Human & Machine Synergy: Business users (e.g. accountants, buyers) can open the PDF and view it like any normal invoice on-screen or paper. Meanwhile, back-end systems can parse the attached XML for automatic booking, validation, or reporting. There’s no need for OCR or manual re-keying of data, which reduces errors and processing time. This “best of both worlds” approach eases user adoption of e-invoicing – stakeholders don’t have to give up the familiar PDF format, even as they gain the efficiencies of structured data.
- Regulatory Compliance: PDF/A-3 helps meet archiving and e-invoice regulations simultaneously. Many countries require invoices to be stored for years in the exact form they were issued; a PDF/A-3, being an ISO-standard archival PDF, meets stringent record-keeping rules (all content is embedded and unalterable). At the same time, tax authorities increasingly demand standardized data for reporting – which the embedded XML provides. For instance, a French company using Factur-X can satisfy France’s upcoming mandate for e-invoice transmission in structured form and its requirement to provide a readable invoice copy to customers in one go. [en.wikipedia.org] [invoicenavigator.eu], [invoicenavigator.eu]
- Long-Term Preservation & Future-Proofing: PDF/A standards are specifically designed so that documents can be opened decades in the future with consistent appearance. By using PDF/A-3 for critical documents, businesses ensure that even in, say, 2040, the invoice will still display correctly. Meanwhile, if the embedded XML is based on a standard like EN 16931 (which is expected to be supported long-term), that data too will likely remain interpretable. Organizations like libraries, archives, and large enterprises favor PDF/A-3 for digital preservation of records that contain both documents and supporting data (for example, a research report with accompanying datasets, or a contract with associated metadata files). [en.wikipedia.org], [pdf-tools.com]
- Enhanced Workflows and Integration: A PDF/A-3 invoice can streamline workflows. For example, an accounts payable system might automatically extract the XML from incoming PDF/A-3 invoices and feed it into an ERP system, while storing the PDF for human reference. This reduces manual typing and errors. It also facilitates cross-border transactions – since the XML can be multilingual and standardized, an Italian company could automatically process a French Factur-X invoice, for instance, even if the PDF content is in French. The embedded data can also carry identifiers (like order numbers, tax IDs) that backend software uses to link the invoice with other systems (procurement, tax reporting, etc.).
- Packing Additional Related Documents: PDF/A-3 allows multiple files to be embedded, not just one. In practice the primary use is an invoice XML, but companies might also embed other relevant files – e.g. a purchase order, contract, or supporting schedule – to keep a complete record set together. All such attachments must be optional supplements (the PDF remains the primary content), but it’s a convenient way to deliver a bundle of documents in one PDF package. [0_FACTUR-X…5_12_04_EN | PDF], [0_FACTUR-X…5_12_04_EN | PDF]
These benefits empower businesses to increase automation without alienating users or violating record-keeping laws. A slide from an Avalara tax presentation captures this evolution: originally invoices were either fully unstructured (paper, image or PDF) or fully structured (EDI, pure XML), but the “hybrid” PDF/A-3 + XML approach bridges the gap, enabling digital invoices that are “Visible and readable by both machines and people.” This has positioned PDF/A-3 as a key component in modern tax technology and compliance strategies. [03_Avalara…-Invoicing | PDF], [03_Avalara…-Invoicing | PDF]
Risks and Considerations
While PDF/A-3 brings clear advantages, organizations should be mindful of a few challenges and best practices:
- Format Obsolescence of Attachments: The long-term readability of embedded files is not guaranteed by the PDF/A standard. If you embed a file in a proprietary or obscure format, there’s a chance that in 10+ years there may be no software to open it. The PDF/A-3 file itself will still display the main PDF content, but the attachment could become a dead weight. To minimize this risk, use widely adopted, standard formats (XML, CSV, plain text, etc.) for embedded data. For example, XML, being text-based and standardized, is far more likely to be readable in the future than a binary spreadsheet from a legacy application. [apryse.com]
- Validation and Standards Compliance: When creating PDF/A-3 documents, you must ensure they truly conform to the standard. It’s easy to accidentally produce a PDF that looks fine but isn’t technically PDF/A-3 compliant. Common pitfalls include leaving in features like transparency or external hyperlinks, or not embedding certain fonts or color profiles. Using a PDF/A validator tool is highly recommended to catch issues (the PDF Tools AG 3-Heights® PDF Validator or open-source veraPDF are examples). For hybrid invoices, additionally ensure the XML meets the required schema (e.g. EN 16931 validation) and that the PDF contains all mandated information in human-readable form. Many e-invoicing solution providers include validation steps to guarantee that the PDF/A-3 and XML are synchronized and compliant. [External s…glish v2.1 | PDF]
- File Size Consideration: Attaching a file will increase the PDF’s size. In most cases (XML data is usually only a few kilobytes or a couple of MB), this is negligible. But if an organization considers embedding very large files (e.g. high-resolution images or big spreadsheets), it could make the PDF quite large, which might be impractical to email or store en masse. A workaround, if large data must be archived, could be to compress or reduce the data, or use PDF/A-3’s capability to link to external files (though that breaks self-containment and is rarely used). In general, for invoices this is not a big issue since XML invoices are compact.
- Tooling and Training: Consuming PDF/A-3 with attachments may require specific software or configurations. Not all PDF viewers show attachments prominently. Adobe Acrobat and many others do support attachments; some lighter or older PDF readers might ignore them. Businesses should ensure that their staff knows how to access embedded files (e.g. using the attachments panel of their PDF software) if they need to extract or view the XML. Likewise, internal IT systems should be set up to detect and handle PDF/A-3 attachments. This might involve slight adaptations – for instance, an AP automation system might need an update to pull invoice XML from PDFs it receives. These are typically one-time setup costs. As hybrid invoices become more widespread, tool support is becoming standard.
- Security Considerations: Any time you allow file attachments, there’s a theoretical security risk – an embedded file could contain malware or harmful content. PDF/A-3 doesn’t allow launching embedded executables from within the PDF (that’s forbidden), and in a compliance scenario the attachments are usually just data files (which are low-risk). Still, organizations should treat attachments with the same caution as any external file: use antivirus scanning on extracted attachments and restrict who can add attachments to outgoing PDFs. [en.wikipedia.org]
- Future Trends – PDF/A-4 and Alternatives: As noted, PDF/A-4 (the 2020 standard) also allows embedded files (and even introduces an ‘e’ conformance for engineering use with 3D models, etc.). Over time, PDF/A-4 may supersede PDF/A-3 for new applications. However, PDF/A-4 has some differences (it’s based on PDF 2.0, drops the a/b/u levels in favor of new ones, etc.) and as of 2026 it’s still emerging in software support. Companies heavily invested in PDF/A-3 (and standards like Factur-X) are likely to continue with it until PDF/A-4 proves itself. Meanwhile, some organizations might choose to skip “documents” entirely and use pure data streams for e-invoicing (especially in controlled platform environments). The hybrid approach remains valuable, though, wherever a human-readable document is desired – it’s user-friendly and backward-compatible. We can expect PDF/A-3 and similar hybrid formats to play a key role in compliance for the next several years, particularly as countries implement new e-invoicing mandates and need a bridge from paper/pdf to full digital. [apryse.com]
Conclusion
PDF/A-3 represents a powerful fusion of document readability and data accessibility. By permitting embedded non-PDF files within an archival PDF, it allows the creation of rich “document packages” – a concept that has proven especially useful for e-invoices. Businesses leveraging PDF/A-3 can issue invoices that meet tax authorities’ data mandates without sacrificing the familiarity and legal assurance of a PDF document. In the European VAT compliance landscape and beyond, PDF/A-3 has become the de facto standard for hybrid invoices like ZUGFeRD and Factur-X. When implementing PDF/A-3, organizations should adhere to best practices: use standard data formats, maintain consistency between PDF and XML content, and validate compliance with relevant standards. When used correctly, PDF/A-3 is not just an archival format, but a bridge between human and machine-readable information – ensuring that digital documents remain both understandable for people and processable by software in our increasingly automated, compliance-driven world.
Sources: Standards and authoritative guides on PDF/A; industry articles and knowledge bases; European e-invoicing specifications and enterprise guidance on Factur-X/ZUGFeRD hybrid invoices; VAT compliance presentations and EU regulatory documentation. All information is drawn from vetted sources to ensure accuracy and relevance to both the technical and business dimensions of PDF/A-3 and its role in e-invoicing. [en.wikipedia.org], [pdf-tools.com] [03_Avalara…-Invoicing | PDF], [invoicenavigator.eu]
- See also
- Join the Linkedin Group on Global E-Invoicing/E-Reporting/SAF-T Developments, click HERE
- Join the LinkedIn Group on ”VAT in the Digital Age” (VIDA), click HERE
Latest Posts in "World"
- VATupdate Newsletter Week 21 2026
- Understanding EU E-Invoicing: EN 16931, UBL, CII, and National Syntaxes
- E‑Invoicing & E‑Reporting Explained: From Invoice to Intelligence (WIP)
- E‑Invoicing Explained – Syntax Reality: UBL vs CII and the Impact on Mapping and Validation
- The Revo Wind‑Down – Week ending May 24: The Countdown to 2030: Digital Tax Mandates Accelerate














