Bug 163240 - Accessibility issue: docx to pdf conversion creates duplicate <Link> tags when hyperlinks extend over multiple lines
Summary: Accessibility issue: docx to pdf conversion creates duplicate <Link> tags whe...
Status: NEW
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: LibreOffice (show other bugs)
Version:
(earliest affected)
24.8.2.1 release
Hardware: All All
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords: accessibility
Depends on:
Blocks: PDF-Export PDF-Accessibility
  Show dependency treegraph
 
Reported: 2024-10-01 20:45 UTC by ekressmiller
Modified: 2024-11-05 09:03 UTC (History)
3 users (show)

See Also:
Crash report or crash signature:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description ekressmiller 2024-10-01 20:45:34 UTC
Description:
When converting from a docx file to a tagged PDF, hyperlinked phrases that extend over multiple lines create duplicate <Link> tags in the resulting PDF. This is an accessibility issue because it causes repetition and makes navigation harder for screen reader users.

Steps to Reproduce:
1. Create a docx document with a multi-word hyperlink that starts on one line and continues onto the next line.
2. Convert to PDF using LibreOffice (in my case, I did this using Docassemble, which converts documents using LibreOffice through Unoconv (https://github.com/unoconv/unoconv))
3. View PDF Accessibility tags or use a screen reader to confirm existence of duplicate links.

Actual Results:
The Accessibility tags are duplicated for the hyperlink, one tag for the words on each separate line. A screen reader reads the hyperlink twice. The hyperlink is also listed twice in the screen reader's list of links on the page.

Expected Results:
There should be only a single <Link> tag for a single hyperlink, even if it extends over multiple lines. A screen reader should only read the link once.


Reproducible: Always


User Profile Reset: No

Additional Info:
N/A. Using LibreOffice via Docassemble and Unoconv.
Comment 1 Chika 2024-11-01 18:37:07 UTC
Hello,

Thank you for reporting the bug. I can confirm that the bug is present in master and dev build.

Master Version: 24.8.2.1 (X86_64) / LibreOffice Community
Build ID: 0f794b6e29741098670a3b95d60478a65d05ef13
CPU threads: 8; OS: macOS 13.6.3; UI render: Skia/Metal; VCL: osx
Locale: en-US (en_US.UTF-8); UI: en-US
Calc: threaded

Dev Version: 25.2.0.0.alpha0+ (X86_64) / LibreOffice Community
Build ID: 2d65d52bd208acde60e77ec49b995958985babe7
CPU threads: 8; OS: macOS 13.6.3; UI render: Skia/Metal; VCL: osx
Locale: en-US (en_US.UTF-8); UI: en-US
Calc: threaded

Moving the status to NEW.

Sincerely,

Chika (from CSUMB tester)