Bug 93456 - Empty *.docx text-document after saving and reopening - XML Contains Data
Summary: Empty *.docx text-document after saving and reopening - XML Contains Data
Status: RESOLVED INVALID
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: LibreOffice (show other bugs)
Version:
(earliest affected)
Inherited From OOo
Hardware: All All
: highest major
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
: 93455 (view as bug list)
Depends on:
Blocks:
 
Reported: 2015-08-15 18:19 UTC by johannes.weisser
Modified: 2017-08-09 10:00 UTC (History)
4 users (show)

See Also:
Crash report or crash signature:


Attachments
Original File (20.88 KB, application/vnd.openxmlformats-officedocument.wordprocessingml.document)
2015-08-15 18:24 UTC, Joel Madero
Details
PDF Export of Original File (30.67 KB, application/pdf)
2015-08-15 18:24 UTC, Joel Madero
Details
MS OFFICE 2010 PDF (236.46 KB, application/pdf)
2015-08-15 18:30 UTC, Joel Madero
Details

Note You need to log in before you can comment on or make changes to this bug.
Description johannes.weisser 2015-08-15 18:19:15 UTC
*.docx document was saved on Windows 7.

After reopening all text was lost. Document contained only 6 of the 12 original pages. Only thing that survived where the title page and the index.

the document opens almost fine in microsoft word. so it is most likly the import filter that is not working properly.
Comment 1 Joel Madero 2015-08-15 18:23:45 UTC
Just to clarify:

Steps to reproduce:

1. Open the attachment;

Observed:
The file appears to be several pages long but seems empty.

2. If you export to pdf you get empty pages as well

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++

Verified:
Ubuntu 15.04 x64
LibreOffice (3.3 - 5.0)

Note: This file was created with LibreOffice 3.3 - not with Microsoft Office.


Setting as:
New: Confirmed;
Major - the file appears to be completely empty despite the xml having all the required data - plus it opens in Word (2010) and was created with LibreOffice;
Highest - to me this seems serious enough to push to most annoying status, as it was created with LibreOffice and the result is a multi-page document that, despite having the needed info, comes up empty within LibreOffice
Comment 2 Joel Madero 2015-08-15 18:24:01 UTC
Created attachment 117937 [details]
Original File
Comment 3 Joel Madero 2015-08-15 18:24:21 UTC
Created attachment 117938 [details]
PDF Export of Original File
Comment 4 Joel Madero 2015-08-15 18:30:38 UTC
Created attachment 117939 [details]
MS OFFICE 2010 PDF
Comment 5 V Stuart Foote 2015-08-15 21:07:32 UTC
*** Bug 93455 has been marked as a duplicate of this bug. ***
Comment 6 Julien Nabet 2015-08-16 14:35:37 UTC
On pc Debian x86-64 with master sources updated today, I could reproduce this.
Here are some kinds of console logs:
warn:writerfilter:8783:1:writerfilter/source/dmapper/DomainMapper_Impl.cxx:558: no context of type 1 available
warn:svl:8783:1:svl/source/items/style.cxx:264: StyleSheet-Follow not found
warn:writerfilter:8783:1:writerfilter/source/dmapper/DomainMapper.cxx:1021: DomainMapper::lcl_attribute: unhandled token: 92355
warn:legacy.osl:8783:1:oox/source/helper/storagebase.cxx:67: StorageBase::StorageBase - missing base input stream
warn:legacy.osl:8783:1:vcl/source/gdi/image.cxx:392: ImageAryData::Load: failed to load image 'cmd/lc_changecasetolower.png'
warn:legacy.osl:8783:1:vcl/source/gdi/image.cxx:392: ImageAryData::Load: failed to load image 'cmd/lc_changecasetoupper.png'
warn:sw:8783:1:sw/inc/swrect.hxx:295: SVRect() without Width or Height
warn:legacy.osl:8783:1:editeng/source/items/frmitems.cxx:476: unknown MemberId
W: Unknown node under /registry/extlang: deprecated
W: Unknown node under /registry/grandfathered: comments
Comment 7 Oliver Specht (CIB) 2016-02-25 11:54:12 UTC
The problem is in the document in word/footer2.xml. It opens a field twice.

<w:ftr>
 <w:p>
  <w:pPr>
   <w:pStyle w:val="style29"/><w:jc w:val="center"/>
  </w:pPr>
  <w:r>
   <w:fldChar w:fldCharType="begin"/>
  </w:r>
  <w:r>
   <w:rPr/>
   <w:fldChar w:fldCharType="begin"/>
  </w:r>
  <w:r>
   <w:instrText> PAGE </w:instrText>
  </w:r>
  <w:r>
   <w:fldChar w:fldCharType="separate"/>
  </w:r>
  <w:r>
   <w:t>13</w:t>
  </w:r>
  <w:r>
   <w:fldChar w:fldCharType="end"/>
  </w:r>
 </w:p>
</w:ftr>
Comment 8 QA Administrators 2017-03-06 15:04:44 UTC Comment hidden (obsolete)
Comment 9 Timur 2017-08-09 10:00:13 UTC
As Joel already noted "This file was created with LibreOffice 3.3" and this bug shouldn't have been confirmed like this. 

Reporter didn't specify:
- do you have the original document, with all the text, can you attach that one?
- what is the original document before saving, odt or docx, in which program?
- how was the document created, from scratch or by coping from somewhere or by saving from some other format?
etc.

Attachment 117937 [details] shows 219 errors in OOXML Productivity Tool.
Conclusion: this is most likely LO filesave bug, but we cannot confirm without original DOCX from MSO or ODT from LO that saves wrong as DOCX. 

If attachment 117937 [details] is saved in MSO, it opens with text in LO. 
So I'll close this one as Invalid.