Description: https://gofile.io/?c=jwgcwg The attached file is converted to HTML appropriately on Mac OSX 6.2.5.1, but on the same Linux distribution (6.2.5.1) the HTML is different and some extra bullets are displayed. Steps to Reproduce: 1. Download .DOCX file from https://gofile.io/?c=jwgcwg 2. Convert the .DOCX file using Linux distribution of LibreOffice into HTML 3. Convert the .DOCX file using Mac distribution of LibreOffice into HTML Actual Results: Compare the two HTML files. The linux version has bullets in front of the text (incorrect) and the Mac version does not (correct). Expected Results: Both resulting HTML files should NOT have any bullets in front of the text. It should look just like when the .DOCX file is opened. Reproducible: Always User Profile Reset: Yes Additional Info:
Created attachment 153150 [details] Sample .DOCX file to show bug with conversion to HTML
Upon further inspection: Converting the attached file from DOCX to HTML using command line (e.g. soffice --headless --convert-to html problem.docx) results in a different HTML than exporting the file from the application GUI.
It appears the application export function is more accurate than the command line conversion function. In the case of the attachment in this bug, the application export is correct and accurate while the conversion function is incorrect.
Same issue: https://ask.libreoffice.org/en/question/179698/headings-are-numbered-when-converting-docx-to-html/
A new major release of LibreOffice is available since this bug was reported. Could you please try to reproduce it with the latest version of LibreOffice from https://www.libreoffice.org/download/libreoffice-fresh/ ? I have set the bug's status to 'NEEDINFO'. Please change it back to 'UNCONFIRMED' if the bug is still present in the latest version.
Bug still existing in new version
What is the origin of this DOCX file? MS Office? Version what?
Any docx file
Yes MS office that uses docx
Ok, so here's the deal: If you unzip the docx file and look at word/document.xml, the paragraphs are defined as a numbered list, using numPr as documented here: https://c-rex.net/projects/samples/ooxml/e1/Part4/OOXML_P4_DOCX_numPr_topic_ID0EBBCM.html As the document does not show the numbering in LibreOffice, the result from "Export as xhtml" is more correct: it *does* create an ordered list <ol>, but it has the rule list-style: none; for <li> elements. Also, the numbering in the xhtml export starts from 1. while the numbering in the document produced by "Save as html" start from 0. for some reason. I know it is silly that we have two ways to save html...
Thanks. I'm trying to verify your suggestion. I tried both commands below: soffice --headless --convert-to html problem.docx soffice --headless --convert-to xhtml problem.docx The resulting HTML and XHTML files are identical. Is there a particular way to convert to the XHTML as you suggested?
Seems like related to this bug: https://bugs.documentfoundation.org/show_bug.cgi?id=67035
(In reply to echan00 from comment #11) > Thanks. I'm trying to verify your suggestion. I tried both commands below: > > soffice --headless --convert-to html problem.docx > > soffice --headless --convert-to xhtml problem.docx > > The resulting HTML and XHTML files are identical. Is there a particular way > to convert to the XHTML as you suggested? File - Export and pick XHTML
Similar problem with font in HTML in numbered list wrong. See new ticket Bug 140146 for precise complaint
Reproduced with LO 7.3 alpha0, 7.2 beta1 and 7.0.6 on Ubuntu 18.04, using "File > Save as... > HTML", and opening the resulting .html file in Firefox 89.0.1 or Chromium 91.0.4472.101. Two lines are numbered, starting at 0. "File > Export > XHTML" results in HTML files without numbering, as expected. Version: 7.3.0.0.alpha0+ / LibreOffice Community Build ID: f446a203fa2897bab8ae7686c948a8bf060675c6 CPU threads: 8; OS: Linux 4.15; UI render: default; VCL: gtk3 Locale: en-AU (en_AU.UTF-8); UI: en-US TinderBox: Linux-rpm_deb-x86_64@86-TDF, Branch:master, Time: 2021-06-24_15:16:38 Calc: threaded Version: 7.2.0.0.beta1 / LibreOffice Community Build ID: c6974f7afec4cd5195617ae48c6ef9aacfe85ddd CPU threads: 8; OS: Linux 4.15; UI render: default; VCL: gtk3 Locale: en-AU (en_AU.UTF-8); UI: en-US Calc: threaded Version: 7.0.6.2 Build ID: 144abb84a525d8e30c9dbbefa69cbbf2d8d4ae3b CPU threads: 8; OS: Linux 4.15; UI render: default; VCL: gtk3 Locale: en-AU (en_AU.UTF-8); UI: en-US Calc: threaded
Dear echan00, To make sure we're focusing on the bugs that affect our users today, LibreOffice QA is asking bug reporters and confirmers to retest open, confirmed bugs which have not been touched for over a year. There have been thousands of bug fixes and commits since anyone checked on this bug report. During that time, it's possible that the bug has been fixed, or the details of the problem have changed. We'd really appreciate your help in getting confirmation that the bug is still present. If you have time, please do the following: Test to see if the bug is still present with the latest version of LibreOffice from https://www.libreoffice.org/download/ If the bug is present, please leave a comment that includes the information from Help - About LibreOffice. If the bug is NOT present, please set the bug's Status field to RESOLVED-WORKSFORME and leave a comment that includes the information from Help - About LibreOffice. Please DO NOT Update the version field Reply via email (please reply directly on the bug tracker) Set the bug's Status field to RESOLVED - FIXED (this status has a particular meaning that is not appropriate in this case) If you want to do more to help you can test to see if your issue is a REGRESSION. To do so: 1. Download and install oldest version of LibreOffice (usually 3.3 unless your bug pertains to a feature added after 3.3) from https://downloadarchive.documentfoundation.org/libreoffice/old/ 2. Test your bug 3. Leave a comment with your results. 4a. If the bug was present with 3.3 - set version to 'inherited from OOo'; 4b. If the bug was not present in 3.3 - add 'regression' to keyword Feel free to come ask questions or to say hello in our QA chat: https://web.libera.chat/?settings=#libreoffice-qa Thank you for helping us make LibreOffice even better for everyone! Warm Regards, QA Team MassPing-UntouchedBug