Bug 113620 - Conversion of Draw documents to html fails with command line.
Summary: Conversion of Draw documents to html fails with command line.
Status: NEW
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Draw (show other bugs)
Version:
(earliest affected)
4.3.0.4 release
Hardware: x86-64 (AMD64) All
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords: bibisected, bisected, regression
Depends on:
Blocks: Commandline
  Show dependency treegraph
 
Reported: 2017-11-03 10:27 UTC by Julian Jung
Modified: 2024-10-25 07:54 UTC (History)
7 users (show)

See Also:
Crash report or crash signature:


Attachments
The neccessary files to reproduce the bug. (8.50 KB, application/x-zip-compressed)
2017-11-03 10:28 UTC, Julian Jung
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Julian Jung 2017-11-03 10:27:14 UTC
Description:
Converting a Draw document via command line to a html document does not work. The command 

soffice --headless --convert-to html:draw_html_Export testFile.odg 

creates with LibreOffice 5.4.1.2 an empty html file (it's a valid html file with a body but without any content).

Calling the same command with LibreOffice 4 creates a html file showing the content of the draw document.

The behavior can be reproduced with the attached files (the html files produced by Libo4/5 are contained, too.)


Steps to Reproduce:
soffice --headless --convert-to html:draw_html_Export testFile.odg creates an empty html File.

Actual Results:  
An empty html file

Expected Results:
An valid html file showing the content like in LibO 4


Reproducible: Always


User Profile Reset: Yes



Additional Info:


User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:52.0) Gecko/20100101 Firefox/52.0
Comment 1 Julian Jung 2017-11-03 10:28:24 UTC
Created attachment 137498 [details]
The neccessary files to reproduce the bug.
Comment 2 Xisco Faulí 2017-11-03 17:01:59 UTC
> Expected Results:
> An valid html file showing the content like in LibO 4

Which version was it?
I've just tried in

- Version: 4.3.0.0.alpha1+
Build ID: c15927f20d4727c3b8de68497b6949e72f9e6e9e

- Version 4.1.0.0.alpha0+ (Build ID: efca6f15609322f62a35619619a6d5fe5c9bd5a)

and the resulting html file is empty...
Comment 3 Julian Jung 2017-11-06 06:51:30 UTC
It was the Version 4.0.3.3+ (Build ID: 444507c47867f262c8b880ccdc998154f02fca8)
Comment 4 Buovjaga 2017-11-13 17:36:16 UTC
Repro.

Arch Linux 64-bit, KDE Plasma 5
Version: 6.0.0.0.alpha1+
Build ID: d73225119476de1826f648acca9e93bf6797e813
CPU threads: 8; OS: Linux 4.13; UI render: default; VCL: kde4; 
Locale: fi-FI (fi_FI.UTF-8); Calc: group
Built on November 12th 2017
Comment 5 Buovjaga 2018-07-05 12:24:37 UTC Comment hidden (obsolete)
Comment 6 Buovjaga 2018-07-06 18:27:00 UTC
Bisected with 43max on Linux to
commit 16d2798254a62975fb561ac2dfe3acf05c144a99
Author: Matthew Francis <mjay.francis@gmail.com>
Date:   Thu May 28 20:58:02 2015 +0800

    source-hash-69fe10662845288c09a05716a93876e2ca95c4bd
    
    commit 69fe10662845288c09a05716a93876e2ca95c4bd
    Author:     Tomaž Vajngerl <tomaz.vajngerl@collabora.com>
    AuthorDate: Fri Apr 11 14:14:49 2014 +0200
    Commit:     Tomaž Vajngerl <tomaz.vajngerl@collabora.com>
    CommitDate: Sat Apr 12 12:58:49 2014 +0200
    
        sd html export: "single document" publish mode & support tables
    
        Change-Id: I0c9f45011091f962d5304c9efcaebbed7e6c3d9d

Adding Cc: to Tomaž Vajngerl
Comment 7 QA Administrators 2019-07-30 03:15:19 UTC Comment hidden (obsolete)
Comment 8 Julian Jung 2019-07-30 06:52:50 UTC
The bug is still reproducable with: 

Version: 6.2.0.0.alpha1+ (x64)
Build-ID: 993948678e7a78c7c91e2c2728a9b7182cb2bc79
CPU-Threads: 12; BS: Windows 10.0; UI-Render: Standard; VCL: win;
Comment 9 Timur 2023-11-01 11:08:41 UTC
With 24.2, export to HTML from GUI also does not export the rectangle.
Comment 10 Noel Grandin 2024-10-25 07:54:32 UTC
So this is more complicated than it appears

we used to have multiple HTML exporters.

Then with
    commit 608c35665bee5990bd7e2799854e233d1454b6a4
    Author: Xisco Fauli <xiscofauli@libreoffice.org>
    Date:   Wed Sep 27 10:58:34 2023 +0200
    tdf#105303: re-introduce single-document html export filter

we dropped the old "PUBLISH_HTML" exporter in favour of the newer "single-document" exporter.

The old exporter exported each page as an image, which means that text on the page is not selectable and copyable.

The new exporter tries to export text as HTML text, but largely ignores shapes.

The two approaches are completely different and yield different results with different benefits.

There are several options here

(*) just live with the current situation
(*) Rename the current filter to "Only export text to HTML"
(*) restore the old PUBLISH_HTML filter and expose it as an option
(*) tell people to rather use the SVG export filter (which exports both text and shapes in ways that the information is selectable)