Bug 119782 - Search engine-friendly and accessible HTML export: output slide text
Summary: Search engine-friendly and accessible HTML export: output slide text
Status: NEW
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Impress (show other bugs)
Version:
(earliest affected)
6.0.6.2 release
Hardware: x86-64 (AMD64) Linux (All)
: medium enhancement
Assignee: Not Assigned
URL:
Whiteboard:
Keywords: accessibility
Depends on:
Blocks: a11y, Accessibility (X)HTML-Export
  Show dependency treegraph
 
Reported: 2018-09-10 12:34 UTC by Johannes Buchner
Modified: 2023-07-06 06:05 UTC (History)
3 users (show)

See Also:
Crash report or crash signature:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Johannes Buchner 2018-09-10 12:34:01 UTC
Description:
I have presentations which I would like to put on the web. The goal is to have it indexed by content and discoverable by google searches.




Steps to Reproduce:
I have tried "export > HTML Impress", which produces pages with images. It looks nice and is navigable with buttons.
I have tried "export > XHTML", which produces text, but the formatting is quite different.
I have tried "export > SVG", which produces a single page in SVG. It looks nice, and contains the text as text.

Actual Results:
In none of the cases, I have the full presentation with text content (alt is empty).

Expected Results:
A nice solution could be 
a) "export > HTML Impress" fills the <img alt=""> attribute with text fields found on the slide.
b) To allow "export > HTML Impress" to use the "export > SVG" functionality to write out SVG in place of PNG output on each slide.

In either case, I don't think the UI has to be changed substantially.




Reproducible: Always


User Profile Reset: No



Additional Info:
Comment 1 Heiko Tietze 2018-10-05 11:35:09 UTC
(In reply to Johannes Buchner from comment #0)
> Description:
> I have presentations which I would like to put on the web. The goal is to
> have it indexed by content and discoverable by google searches.

Basically you shouldn't expect an office suite to be a perfect HTML editor. There are better suited tools. But going into details...
 
> A nice solution could be 
> a) "export > HTML Impress" fills the <img alt=""> attribute with text fields
> found on the slide.

Sounds like a simple and useful enhancement. Though "text fields on the slide" is not precise enough. How about the title?

> b) To allow "export > HTML Impress" to use the "export > SVG" functionality
> to write out SVG in place of PNG output on each slide.

SVG export is what we recommend to prefer over HTML. If you select all slides and export you get the full presentation (the "Selection Only" checkbox is not working, cant find the respective ticket).


So putting all together I'd say yes to the enhancement of <img alt=""> and change the summary accordingly. Up to QA now.
Comment 2 Stéphane Guillou (stragu) 2021-07-06 23:41:40 UTC
Adding bug 105303 to "see also" as the plan is to remove HTML export. However, I think this should stay open to discuss the topic of accessibility and indexability.