Bug 91799 - FILEOPEN LibO doesn't open .doc file with table and page break correctly
Summary: FILEOPEN LibO doesn't open .doc file with table and page break correctly
Status: RESOLVED FIXED
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Writer (show other bugs)
Version:
(earliest affected)
3.5.0 release
Hardware: Other All
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords: filter:doc
Depends on:
Blocks: DOC-Tables DOC-Page
  Show dependency treegraph
 
Reported: 2015-06-01 17:16 UTC by m_josenhans
Modified: 2020-06-03 11:23 UTC (History)
3 users (show)

See Also:
Crash report or crash signature:


Attachments
Doc version of the file (70.50 KB, application/msword)
2015-06-02 16:03 UTC, m_josenhans
Details
Docx version of the file, created with MS Win Word. (16.73 KB, application/vnd.openxmlformats-officedocument.wordprocessingml.document)
2015-06-02 16:05 UTC, m_josenhans
Details

Note You need to log in before you can comment on or make changes to this bug.
Description m_josenhans 2015-06-01 17:16:18 UTC
LibreOffice does not open the *.doc file containing a table and page breaks correctly. Also the *.doc file saved with MS Windows and opened as *.docx can not be opened correctly. 

As I can not completely assure that all private data has been replaced, please contact me to retrieve the test files.

Tested on Windows and Linux.
Comment 1 tommy27 2015-06-02 07:10:10 UTC
please upload test file
Comment 2 m_josenhans 2015-06-02 16:03:53 UTC
Created attachment 116246 [details]
Doc version of the file

The original file with content replaced.
Comment 3 m_josenhans 2015-06-02 16:05:36 UTC
Created attachment 116247 [details]
Docx version of the file, created with MS Win Word.

This file is broken even more, when viewed with LibreOffice.
Comment 4 Buovjaga 2015-06-09 12:05:03 UTC
(In reply to m_josenhans from comment #2)
> Created attachment 116246 [details]
> Doc version of the file
> 
> The original file with content replaced.

Yes, it is broken when compared with Word viewer.

LibO shows empty first page and the row heights in the table on the 2nd page are off.

Win 7 Pro 64-bit Version: 5.1.0.0.alpha1+
Build ID: d28102b1ed0c31500bbc68453a7b7613bd2bfa06
TinderBox: Win-x86@39, Branch:master, Time: 2015-06-09_01:06:39
Locale: fi-FI (fi_FI)
Comment 5 QA Administrators 2016-09-20 10:02:03 UTC Comment hidden (obsolete)
Comment 6 m_josenhans 2016-09-24 17:22:53 UTC
The table of the attched *.doc version is broken. The table of the attached *.docx version does not seem to be broken, however it does not show any text content. 

Could anyone with access to MS Word / MS Word Viewer please check the text content of the *.docx attachment and compare it to the text content shown with LibreOffice.
Comment 7 m_josenhans 2016-09-24 17:25:44 UTC
Addition to above comment: 
tested with:
Version: 5.1.4.2
Build ID: 1:5.1.4-0ubuntu1
CPU Threads: 2; OS Version: Linux 4.4; UI Render: default; 
Locale: de-DE (en_US.UTF-8)
Comment 8 Buovjaga 2016-10-06 11:35:01 UTC
DOC still rendered like in comment 4.
DOCX shows empty table in LibO.
DOCX table shows content in A1-A3 in Word Viewer.

DOCX file should get its own bug report, if you have the time and energy.

Win 7 Pro 64-bit Version: 5.3.0.0.alpha0+
Build ID: 7da2f3ce9f7b049c177a735a146dae84a764d3f7
CPU Threads: 4; OS Version: Windows 6.1; UI Render: default; 
TinderBox: Win-x86@42, Branch:master, Time: 2016-10-04_03:49:06
Locale: fi-FI (fi_FI); Calc: CL

LibreOffice 3.5.0rc3 
Build ID: 7e68ba2-a744ebf-1f241b7-c506db1-7d53735
Comment 9 m_josenhans 2016-11-11 15:53:57 UTC
As requested the docx-version has been put into a separate bug report: 

See bug #103869.
Comment 10 Xisco Faulí 2017-11-07 23:30:23 UTC
Bug no longer reproducible in

Version: 6.0.0.0.alpha1+
Build ID: 38a8157d9447134913d88c6c373d99a7aa472857
CPU threads: 4; OS: Linux 4.10; UI render: default; VCL: gtk3; 
Locale: ca-ES (ca_ES.UTF-8); Calc: group

Closing as RESOLVED WORKSFORME

[Improvement found by office-interoperability-tools]
Comment 11 Xisco Faulí 2017-11-07 23:31:38 UTC
It would be interesting to bisect this issue to find at which point it was fix to see if it helps to fix bug 103869 as well...
Comment 12 Justin L 2020-06-03 10:47:25 UTC
fixed in LO 5.4 by commit 6aba29576df7a2a40e54040d4dd09d94d6594741
Author: Miklos Vajna on Thu May 11 17:38:40 2017 +0200
    tdf#107773 DOC import: consider page breaks for multi-page floattables
    
    See commit 8fe8bd6c3b5b1a539b7370f8c457fa69c061d2de (Related: fdo#61594
    SwWW8ImplReader::StartApo: don't always start a frame, 2013-05-13) for
    the context, this is one more case where having a multi-page floattable
    as a multi-page one is a better option.
    
    The additional condition is that in case there is a break right after
    the table, then always import the table as a non-floating one, as there
    won't be any wrapping anyway (due to the break).