Bug 145133 - DOCX to HTML conversion with text boxes present: incorrect layout
Summary: DOCX to HTML conversion with text boxes present: incorrect layout
Status: NEW
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: filters and storage (show other bugs)
Version:
(earliest affected)
6.0.7.3 release
Hardware: All All
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks: (X)HTML-Export
  Show dependency treegraph
 
Reported: 2021-10-14 15:12 UTC by altunajulian
Modified: 2024-12-21 17:45 UTC (History)
3 users (show)

See Also:
Crash report or crash signature:


Attachments
Test file to test the inconsistencies (5.08 KB, application/vnd.openxmlformats-officedocument.wordprocessingml.document)
2021-10-15 15:55 UTC, altunajulian
Details

Note You need to log in before you can comment on or make changes to this bug.
Description altunajulian 2021-10-14 15:12:02 UTC
This issue occurs when converting a docx file to html using the command line.
The command used is 'soffice --headless --convert-to "html:HTML:EmbedImages" --outdir ./dir file.docx'.

If the file contains rectangles that have text boxes in them, the behavior is not consistent. For each rectangle, a span is created. In some cases, the text in the rectangle is inside this span while in other cases the text is placed outside. This causes empty spans and text placed after it.

I've seen in some cases that an absolute positioning is added to these spans, causing text overlapping, but I haven't been able to create a MWE to reproduce this.

You can contact me for a file that reproduces this inconsistency if needed, but it can be reproduced creating a file, adding multiple rectangle shapes with text in them. Save the file to docx and then convert using the previous command.
Comment 1 Xisco Faulí 2021-10-14 18:47:40 UTC
Thank you for reporting the bug. Please attach a sample document, as this makes it easier for us to verify the bug. 
I have set the bug's status to 'NEEDINFO'. Please change it back to 'UNCONFIRMED' once the requested document is provided.
(Please note that the attachment will be public, remove any sensitive information before attaching it. 
See https://wiki.documentfoundation.org/QA/FAQ#How_can_I_eliminate_confidential_data_from_a_sample_document.3F for help on how to do so.)
Comment 2 altunajulian 2021-10-15 15:55:11 UTC
Created attachment 175759 [details]
Test file to test the inconsistencies
Comment 3 Buovjaga 2022-10-19 10:43:33 UTC
Repro with file

Arch Linux 64-bit
Version: 7.5.0.0.alpha0+ / LibreOffice Community
Build ID: ffc23650d988051bf9fe43edeb4e16096907b080
CPU threads: 8; OS: Linux 6.0; UI render: default; VCL: kf5 (cairo+xcb)
Locale: fi-FI (fi_FI.UTF-8); UI: en-US
Calc: threaded
Built on 19 October 2022
Comment 4 QA Administrators 2024-10-19 03:18:11 UTC
Dear altunajulian,

To make sure we're focusing on the bugs that affect our users today, LibreOffice QA is asking bug reporters and confirmers to retest open, confirmed bugs which have not been touched for over a year.

There have been thousands of bug fixes and commits since anyone checked on this bug report. During that time, it's possible that the bug has been fixed, or the details of the problem have changed. We'd really appreciate your help in getting confirmation that the bug is still present.

If you have time, please do the following:

Test to see if the bug is still present with the latest version of LibreOffice from https://www.libreoffice.org/download/

If the bug is present, please leave a comment that includes the information from Help - About LibreOffice.
 
If the bug is NOT present, please set the bug's Status field to RESOLVED-WORKSFORME and leave a comment that includes the information from Help - About LibreOffice.

Please DO NOT

Update the version field
Reply via email (please reply directly on the bug tracker)
Set the bug's Status field to RESOLVED - FIXED (this status has a particular meaning that is not 
appropriate in this case)


If you want to do more to help you can test to see if your issue is a REGRESSION. To do so:
1. Download and install oldest version of LibreOffice (usually 3.3 unless your bug pertains to a feature added after 3.3) from https://downloadarchive.documentfoundation.org/libreoffice/old/

2. Test your bug
3. Leave a comment with your results.
4a. If the bug was present with 3.3 - set version to 'inherited from OOo';
4b. If the bug was not present in 3.3 - add 'regression' to keyword


Feel free to come ask questions or to say hello in our QA chat: https://web.libera.chat/?settings=#libreoffice-qa

Thank you for helping us make LibreOffice even better for everyone!

Warm Regards,
QA Team

MassPing-UntouchedBug