Bug 135192 - Accessibility of PDF export: "Export as > Tagged PDF" does not export correct tags for tables
Summary: Accessibility of PDF export: "Export as > Tagged PDF" does not export correct...
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Impress (show other bugs)
(earliest affected) release
Hardware: All All
: medium normal
Assignee: Not Assigned
Keywords: accessibility
Depends on:
Blocks: PDF-Export PDF-Accessibility
  Show dependency treegraph
Reported: 2020-07-27 12:01 UTC by zainab.ali
Modified: 2020-12-17 18:00 UTC (History)
4 users (show)

See Also:
Crash report or crash signature:

The LibreOffice file containing a table (13.41 KB, application/vnd.oasis.opendocument.presentation)
2020-07-27 12:03 UTC, zainab.ali
The tagged PDF export (12.74 KB, application/pdf)
2020-07-27 12:04 UTC, zainab.ali

Note You need to log in before you can comment on or make changes to this bug.
Description zainab.ali 2020-07-27 12:01:49 UTC
The Tagged PDF export for LibreOffice Impress does not contain marked up tables.

Steps to Reproduce:
1. Open the attached PPT.  There is a table on the first slide.
2. Export as a Tagged PDF using File > Export as ... > Tagged PDF
3. Open the tagged PDF in a PDF explorer such as Adobe Acrobat. The tags for the table are marked up as 'P' (paragraph) tags

Actual Results:
The table contents are marked up as 'p' (paragraph) tags

Expected Results:
The table is marked up as a 'table' tag

Reproducible: Always

User Profile Reset: Yes

Additional Info:
This is an accessibility concern.  Users with screen readers are not able to identify tables in the tagged PDF export.
Comment 1 zainab.ali 2020-07-27 12:03:18 UTC
Created attachment 163618 [details]
The LibreOffice file containing a table
Comment 2 zainab.ali 2020-07-27 12:04:52 UTC
Created attachment 163619 [details]
The tagged PDF export
Comment 3 V Stuart Foote 2020-07-27 18:26:56 UTC
For bug 45636 project has implemented support for PDF/UA (ISO 14289) available with the 7.0.0rc2 release and current master/7.1.0 daily builds.

Please retest.
Comment 4 Timur 2020-07-27 19:49:55 UTC
Please explain if we can see these tags in some open source or free tool.
Comment 5 zainab.ali 2020-07-29 08:53:36 UTC
Thank you for following up.

They can be seen using the Apache PDFBox Debugger:

This is a Java library and must be set up as part of a Java / JVM project.  Setup instructions can be found on the following page:

Once opened, you can examine the PDF structure tree to see the accessibility metadata.
Comment 6 V Stuart Foote 2020-07-29 12:28:47 UTC
@Christophe S. - could you comment on both the Tagged PDF, and the PDF/UA handling? Specifically what if anything is missing from the PDF/UA filter exports.