Bug Hunting Session
Bug 119142 - FILEOPEN DOCX: MS text boxes imported as frames when solo, and as LO text boxes when grouped
Summary: FILEOPEN DOCX: MS text boxes imported as frames when solo, and as LO text box...
Status: NEW
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Writer (show other bugs)
(earliest affected) release
Hardware: x86-64 (AMD64) All
: medium normal
Assignee: Not Assigned
Depends on:
Blocks: DOCX-Styles
  Show dependency treegraph
Reported: 2018-08-07 12:58 UTC by Costas
Modified: 2019-09-10 14:47 UTC (History)
4 users (show)

See Also:
Crash report or crash signature:

examples of lost style formatting between word and writer (22.59 KB, application/vnd.openxmlformats-officedocument.wordprocessingml.document)
2018-08-07 12:59 UTC, Costas
Text boxes in Word, grouped and ungrouped (278.14 KB, application/vnd.openxmlformats-officedocument.wordprocessingml.document)
2019-09-10 14:17 UTC, Costas
File saved as old DOC file (295.50 KB, application/msword)
2019-09-10 14:43 UTC, Costas

Note You need to log in before you can comment on or make changes to this bug.
Description Costas 2018-08-07 12:58:18 UTC
There seems to be a bug in MS word that breaks LO when opening docx files.

MS Text boxes are imported as LO frames, which is good because styles are preserved.

If the MS text boxes are grouped, then LO imports them as LO text boxes, with no style preservation.

It must relate to an MS bug because if in MS Word I copy and paste multiple ungrouped MS text boxes, the style is not preserved!

Copy paste already grouped MS text boxes preserves the styles.

I am attaching a docx file with examples of text boxes, solo and grouped.

ps. dont tell me to go and tell MS to fix their software. I would rather have LO give an option to convert all textboxes to frames even if they are grouped.

Steps to Reproduce:
1.with word create 3 text boxes with styles ike title and heading
2.group 2 of the boxes together and leave the other alone
3.open the file in LO

Actual Results:
no formatting in the text boxes that were grouped in MS Word

Expected Results:
Styles to be preserved even of text boxes are grouped.

Reproducible: Always

User Profile Reset: No

OpenGL enabled: Yes

Additional Info:
Comment 1 Costas 2018-08-07 12:59:37 UTC
Created attachment 144010 [details]
examples of lost style formatting between word and writer
Comment 2 Costas 2018-08-12 19:30:00 UTC
Just to clarify that opening doxc files with grouped text boxes does NOT break LO, it just doesn't work as expected. I guess I was a bit frustrated when I wrote the report :)

The same behaviour exists in 

Build ID: 1:6.0.3-0ubuntu1
Comment 3 Buovjaga 2018-09-07 13:07:16 UTC

In old versions of LibO the grouped text boxes are not seen, just paragraphs.

Arch Linux 64-bit
Build ID: 033a68c49fe2b8aa397832d92d400eb0259ea809
CPU threads: 8; OS: Linux 4.18; UI render: default; VCL: gtk3_kde5; 
Locale: fi-FI (fi_FI.UTF-8); Calc: threaded
Built on September 5th 2018
Comment 4 QA Administrators 2019-09-09 05:30:26 UTC Comment hidden (obsolete)
Comment 5 Costas 2019-09-10 14:14:22 UTC
I confirm that the bug is still present in 

LO Version: (x64)
Build ID: b79626edf0065ac373bd1df5c28bd630b4424273
CPU threads: 4; OS: Windows 10.0; UI render: default; VCL: win; 
Locale: en-GB (en_GB); UI-Language: en-GB
Calc: threaded

I uploaded a new file for testing
Comment 6 Costas 2019-09-10 14:17:49 UTC
Created attachment 154080 [details]
Text boxes in Word, grouped and ungrouped

New file for testing, the old one had issues with placement of the text boxes. I have included photos of how it looks on my computer, in case there are differences in yours.
Comment 7 Costas 2019-09-10 14:43:58 UTC
Created attachment 154082 [details]
File saved as old DOC file

LO 3.3 does opens the file properly and styles are preserved.
LO 6.5 opens file but complains it has macros, styles are preserved
Comment 8 Costas 2019-09-10 14:47:17 UTC
In summary:

DOCX (latest windows 365 build)

LO 3.3 opens the docx file and the boxes are empty and all over the place.
LO 6.5 opens the docx file and the formatting is not preserved in the grouped boxes.

97-2003 DOC
LO 3.3 opens the doc file and formatting is preserved in the text boxes
LO 6.5 opens the doc file complaining there is a macro, and formatting is preserved in the text boxes

I hope someone can make sense of all the confounding factors.