Download it now!
Bug 75208 - Fileopen DOCX: text boxes imported in frame with wrong spacing or empty (sample comment 7)
Summary: Fileopen DOCX: text boxes imported in frame with wrong spacing or empty (samp...
Status: NEW
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Writer (show other bugs)
Version:
(earliest affected)
unspecified
Hardware: Other All
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords: filter:docx
Depends on:
Blocks: DOCX
  Show dependency treegraph
 
Reported: 2014-02-19 14:17 UTC by Gorka Navarrete
Modified: 2020-01-09 16:55 UTC (History)
3 users (show)

See Also:
Crash report or crash signature:


Attachments
complex .docx document with lots of formatting issues when opened in LO (770.76 KB, application/vnd.openxmlformats-officedocument.wordprocessingml.document)
2014-02-19 14:17 UTC, Gorka Navarrete
Details
.DOCX resaved in MSO (554.11 KB, application/vnd.openxmlformats-officedocument.wordprocessingml.document)
2020-01-09 15:39 UTC, Timur
Details
Minimal 3 pages .docx saved in MSO (71.45 KB, application/vnd.openxmlformats-officedocument.wordprocessingml.document)
2020-01-09 16:12 UTC, Timur
Details
Minimal 3 pages .docx saved in MSO as PDF (454.64 KB, application/pdf)
2020-01-09 16:13 UTC, Timur
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Gorka Navarrete 2014-02-19 14:17:07 UTC
Created attachment 94362 [details]
complex .docx document with lots of formatting issues when opened in LO

With every new version of OO first and then LO I've been testing how my 2009 docx PhD thesis looked. Back in the day the formatting problems were horrendous. Now it is more bearable but still a long shot from perfect interoperativity.

I am not sure if this may be of help but decided to attach the document as an example of a real world complex docx file in which it becomes impossible to work switching from MS Office and LO, something necessary in collaborative environments as academia.

Hopefully it can be used as a test-case to help nail down all the formatting bugs existent.

To replicate, simply open the file in MS Office 2007 and then in LO 4.2. Most problems are focused on figures not showing, text frames now showing all the content (e.g. search for "Cuadro 2"), etc.

If needed I can try to list the formatting problems thoroughly.
Comment 1 Thomas van der Meulen 2014-03-23 17:05:40 UTC
Thank you for your bug report, I can reproduce this bug running 
Version: 4.3.0.0.alpha0+
Build ID: 1a67b7cc3d5dc3dcd0de0e247f638c33d57dea1b
TinderBox: MacOSX-x86@49-TDF, Branch:master, Time: 2014-03-23_05:59:09
OS: Mac osx 10.9.2


I have compared it with Microsoft word 2007 on Windows 7

Because of the problems the page number are the pages on libreoffice.
Problems that I have found:
-no smart-art on page 33, 80, 81, 99, 100, 113, 124,142
-Text box is wrong on page 38, 46, 48, 75, 80, 81, 83, 86, 101, 105, 109, 120, 124, 145, 147
-on page 56,62, 67, 88, 95/96 the white/not filled dots are placed wrong
- page counter in footer is missing
Comment 2 A (Andy) 2015-12-27 20:05:19 UTC
Reproducible with LO 5.1.0.1, Win 8.1

Missing page counter in the footer, missing smart-arts (e.g. page 33), different length of the document (164 vs 174 pages), wrong text boxes (e.g. page 76).
Comment 3 QA Administrators 2018-05-31 02:52:36 UTC Comment hidden (obsolete)
Comment 4 Roman Kuznetsov 2020-01-09 14:57:39 UTC
still repro in

Version: 6.5.0.0.alpha0+ (x64)
Build ID: 2d736e1a0a2bbd41fe7793d52bbcc7bfc89c7da3
CPU threads: 4; OS: Windows 10.0 Build 18362; UI render: default; VCL: win; 
Locale: ru-RU (ru_RU); UI-Language: en-US
Calc: threaded
Comment 5 Timur 2020-01-09 15:39:42 UTC
Created attachment 157040 [details]
.DOCX resaved in MSO

Sample .docx is 2007 version. Here is .docx resaved in Word.

This was wrong bug report, because rule is one issue per bug, and even that after search for duplicates.
Comment 6 Timur 2020-01-09 16:02:16 UTC
(In reply to Thomas van der Meulen from comment #1)
> Problems that I have found:
> -no smart-art on page 33, 80, 81, 99, 100, 113, 124,142
Smart-art on page 33 present in new DOCX, with some distortion. Already known issue that it's not read from 2007 format. Needs search for distortion issue. 
80 and 81 no smart-art in Word, that's page 83, same as previous. Etc.


> -Text box is wrong on page 38, 46, 48, 75, 80, 81, 83, 86, 101, 105, 109,
> 120, 124, 145, 147
Page 38 nothing, page 39: seen in new DOCX, wrong spacing. Needs search. 
From 83 seen in new DOCX. 1st wrong spacing, 2nd empty. Needs search. 


> -on page 56,62, 67, 88, 95/96 the white/not filled dots are placed wrong
Not clear.


> - page counter in footer is missing
Not in new DOCX.


So, main issues are empty text boxes, wrong spacing in some and smaller issues with smart art.
If issue not found in search, report needs a minimal sample DOCX created in MSO.


"different length of the document (164 vs 174 pages)" no need to report, cannot be fixed like that, it's text engine and rendering.
Comment 7 Timur 2020-01-09 16:12:27 UTC
Created attachment 157041 [details]
Minimal 3 pages .docx saved in MSO

Here is minimal 3 pages .docx (around original page 83) saved in MSO for the issue of text boxes.
And here things get strange.
Text boxes that were empty when opened from full new DOCX saved in MSO are not empty in this minimal .docx. 
Just issue with spacing.
Comment 8 Timur 2020-01-09 16:13:02 UTC
Created attachment 157042 [details]
Minimal 3 pages .docx saved in MSO as PDF