Bug 83890 - Save as HTML numbering confusion
Summary: Save as HTML numbering confusion
Status: NEW
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: LibreOffice (show other bugs)
Version:
(earliest affected)
4.2.6.2 release
Hardware: Other All
: medium normal
Assignee: Not Assigned
URL:
Whiteboard: target:5.2.0
Keywords:
Depends on:
Blocks: (X)HTML-Export
  Show dependency treegraph
 
Reported: 2014-09-15 13:59 UTC by Marc Bessières
Modified: 2022-08-26 20:08 UTC (History)
5 users (show)

See Also:
Crash report or crash signature:


Attachments
tar file containing odt file, converted html, and gif showing diff (70.00 KB, application/tar)
2014-09-15 13:59 UTC, Marc Bessières
Details
Screen shot of converted html (61.97 KB, image/png)
2016-01-12 05:58 UTC, Luke Picciau
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Marc Bessières 2014-09-15 13:59:55 UTC
Created attachment 106320 [details]
tar file containing odt file, converted html, and gif showing diff

I tried to check first if this was already reported, but I couldn't find anything. Sorry if I missed it.

When we try at work to convert some MSWORD documents to html we have sometimes some numbering issues.

We managed to make a small reproducer of one of the issues.
The small.odt is originally coming from a big MSWORD document.

In attachment I put the small.odt, the small.html (result of the conversion of the former to html).
I also put a gif file that shows the differences I see:
- Title C in the html should be numbered 3 rather than 1
- under title B, the paragraph "text" and the following one should be numbered 2, and then 3 rather than respectively 1 and 2

This was run on Linux with
Version: 4.2.6.2
Build ID: 185f2ce4dcc34af9bd97dec29e6d42c39557298f
Comment 1 Tomaz Vajngerl 2014-09-15 14:42:29 UTC
Looks wrong indeed..
Comment 2 QA Administrators 2015-10-14 19:58:14 UTC Comment hidden (obsolete)
Comment 3 Luke Picciau 2016-01-12 05:58:39 UTC
Created attachment 121869 [details]
Screen shot of converted html

Tested on ubuntu 15.10 with libreoffice 5.0.3.2 and the file converted mostly fine however there was a # after title a/b/c on the next line and title c had the number 2. instead of 1. like in the html
Comment 4 Commit Notification 2016-02-08 10:04:23 UTC
Marc Bessières committed a patch related to this issue.
It has been pushed to "master":

http://cgit.freedesktop.org/libreoffice/core/commit/?id=c24aa380eaeb792d6b7125e54eab833d8b65dbc7

tdf#83890 sw html export: fix missing start numbering attribute

It will be available in 5.2.0.

The patch should be included in the daily builds available at
http://dev-builds.libreoffice.org/daily/ in the next 24-48 hours. More
information about daily builds can be found at:
http://wiki.documentfoundation.org/Testing_Daily_Builds

Affected users are encouraged to test the fix and report feedback.
Comment 5 Buovjaga 2016-02-16 11:21:01 UTC
Not export, but Save as.

Title C is still numbered 1 instead of 3.

The problem with Title B is fixed.

Win 7 Pro 64-bit Version: 5.2.0.0.alpha0+
Build ID: a6f876d45bd4e41a7143594a6cb11b6893a0f620
CPU Threads: 4; OS Version: Windows 6.1; UI Render: default; 
TinderBox: Win-x86@39, Branch:master, Time: 2016-02-11_00:07:38
Locale: fi-FI (fi_FI)
Comment 6 QA Administrators 2017-10-26 08:10:16 UTC Comment hidden (obsolete)
Comment 7 Thomas Lendo 2017-11-08 22:00:10 UTC
Same as in comment 5.

Version: 6.0.0.0.alpha1+
Build ID: 5e0022c90c4125a1590b3688dfec73c271b7aedd
CPU threads: 4; OS: Linux 4.10; UI render: default; VCL: gtk3; 
Locale: de-DE (de_DE.UTF-8); Calc: group
today's build
Comment 8 QA Administrators 2018-11-09 03:59:24 UTC Comment hidden (obsolete)
Comment 9 QA Administrators 2020-11-09 04:17:30 UTC Comment hidden (obsolete)
Comment 10 Stéphane Guillou (stragu) 2022-08-26 20:08:13 UTC
Reproduced as in comment 5 with:

Version: 7.5.0.0.alpha0+ / LibreOffice Community
Build ID: 24087697d5cf78aac346d4dcea0596373e15a95c
CPU threads: 8; OS: Linux 5.15; UI render: default; VCL: gtk3
Locale: en-AU (en_AU.UTF-8); UI: en-US
Calc: threaded