Bug 134804 - Large Write doc corrupted on opening
Summary: Large Write doc corrupted on opening
Status: RESOLVED INSUFFICIENTDATA
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Writer (show other bugs)
Version:
(earliest affected)
6.3.6.2 release
Hardware: x86-64 (AMD64) Windows (All)
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2020-07-14 15:50 UTC by Alan Cummings
Modified: 2022-05-16 13:49 UTC (History)
5 users (show)

See Also:
Crash report or crash signature:


Attachments
UPDATE on Writer large file corruption error (6.26 MB, application/msword)
2020-11-16 23:14 UTC, Alan Cummings
Details
Image to support report to Xisco Faulí for bug 134804 (377.93 KB, image/jpeg)
2022-05-02 13:57 UTC, Alan Cummings
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Alan Cummings 2020-07-14 15:50:38 UTC
Description:
I worked on the word processed document - 55 or so pages of text and graphics. Everything was normal.  Reopened the document this afternoon.  Instead of opening at the last text entry as usual it opened 2-3 pages earlier.  The header margin was increased and the cursor was in the header.  I scrolled down to continue.  I entered a couple of sentences and the app hung. I could only close and reopen. I found that many images had been replaced with the same image and the positioning was altered. Some pages had developed large gaps in the text. 

Actual Results:
As it "trashed" my document, it is not possible to reproduce the error.
My laptop is an HP Pavilion 15-cs with 120Gb NVMe and 8Gb RAM. Fully updated Windows 10. HDD has 61Gb free

Expected Results:
I resaved with a new filename. I reopened the original file and found too much damage to continue use. I had a PDF of the file from 2 days back so loaded that, noted the last text. I opened the Writer document and deleted everything above that which was in the PDF. The remaining new text and images then automatically corrected the layout as entered yesterday and today. I saved and then created a new PDF.


Reproducible: Didn't try


User Profile Reset: Yes


OpenGL enabled: Yes

Additional Info:
[Information automatically included from LibreOffice]
Locale: en-GB
Module: TextDocument saved as odt format
[Information guessed from browser]
OS: Windows 10 Home Edition - fully updated.
OS is 64bit: Yes
Version: 6.3.6.2 (x64)
Build ID: 2196df99b074d8a661f4036fca8fa0cbfa33a497
CPU threads: 4; OS: Windows 10.0; UI render: GL; VCL: win; 
Locale: en-GB (en_GB); UI-Language: en-GB
Calc: threaded
gerrit.libreoffice.org / core / 2196df99b074d8a661f4036fca8fa0cbfa33a497
Comment 1 Roman Kuznetsov 2020-07-14 16:33:52 UTC
Without your source document we can't repro your problem. Please attach the document if you'll see the same problem again
Comment 2 Alan Cummings 2020-07-15 08:22:56 UTC
(In reply to Roman Kuznetsov from comment #1)
> Without your source document we can't repro your problem. Please attach the
> document if you'll see the same problem again

Hi Roman
The document is 44Mb and would not send on the report.  I could provide a link to the report in Google Drive if you wish.
However, the document is "trashed" which suggests that it was the Save process the previous evening that perhaps did the damage.
Many thanks for your reply.
Comment 3 QA Administrators 2020-07-16 03:44:52 UTC Comment hidden (obsolete)
Comment 4 Xisco Faulí 2020-07-16 09:46:50 UTC
Could you please attach a minimal sample of the document where the issue is reproduced ?
Comment 5 Alan Cummings 2020-11-16 23:08:25 UTC Comment hidden (obsolete)
Comment 6 Alan Cummings 2020-11-16 23:14:53 UTC
Created attachment 167345 [details]
UPDATE on Writer large file corruption error

UPDATE:
Version: 6.4.7.2 (x64)
Build ID: 639b8ac485750d5696d7590a72ef1b496725cfb5
CPU threads: 4; OS: Windows 10.0 Build 19041; UI render: default; VCL: win; 
Locale: en-GB (en_GB); UI-Language: en-GB
Calc: threaded

Working on a NEW Writer document today, loaded and saved exclusively in Ms Word format I have again experienced file corruption.  The document was written in Chapters saved individually.  The document that corrupted was a "page 1" with all other chapters loaded into it then saved before editing.

First I noticed that left and right margin widths had changed and were unable to be corrected on pages or for the entire highlighted document.  Then, on examination, I found that a number of the images in the document had moved or had disappeared leaving only a text frame as previously.

I have attached the entire document this time.
Comment 7 QA Administrators 2020-11-17 05:01:24 UTC Comment hidden (obsolete)
Comment 8 Buovjaga 2021-07-27 14:41:50 UTC
(In reply to Alan Cummings from comment #6)
> Created attachment 167345 [details]
> UPDATE on Writer large file corruption error

No other comments, but I noticed this is in .doc format. It is a good idea to treat non-ODF formats as export-only, so your original document is always in .odt format.
Comment 9 BogdanB 2021-07-28 05:52:39 UTC
I have read this on wikipedia:
"Because the DOC file format was a closed specification for many years, inconsistent handling of the format persists and may cause some loss of formatting information when handling the same file with multiple word processing programs."

https://en.wikipedia.org/wiki/Doc_(computing)

So, as Buovjaga have said, don't use .doc format.
Comment 10 Xisco Faulí 2022-05-02 12:17:59 UTC
A new major release of LibreOffice is available since this bug was reported.
Could you please try to reproduce it with the latest version of LibreOffice
from https://www.libreoffice.org/download/libreoffice-fresh/ ?
I have set the bug's status to 'NEEDINFO'. Please change it back to
'UNCONFIRMED' if the bug is still present in the latest version.
Comment 11 Alan Cummings 2022-05-02 13:57:04 UTC Comment hidden (obsolete)
Comment 12 Alan Cummings 2022-05-02 15:37:06 UTC
(In reply to Alan Cummings from comment #11)
> Created attachment 179890 [details]
> Image to support report to Xisco Faulí for bug 134804
> 
> Image showing page length issue following the fix of missing / corrupted
> issues in large Word / Write document using .DOCX format.

The image corruption / loss originally reported appears to be resolved (the document is now 188 pages) and the images have loaded correctly.  The document format is also correct apart from the new issue of the page length being different from the Word original.  
The issue could be a font issue as the Word version uses Arial.  I will try to download Arial to Lo and will retry the load.

Many thanks for all the help guys.
Alan
Comment 13 QA Administrators 2022-05-03 03:42:11 UTC Comment hidden (obsolete)
Comment 14 Timur 2022-05-16 13:46:48 UTC
Finally, it wasn't possible to identify a cause and this wasn't reproducible, so I set InsuficcientData.

I'm sorry for all the troubles, but basic principles are: use ODT and export as DOC/DOCX, report with minimal sample not holding personal data, treat every problem as a different bug, search in existing bugs..

Good luck.
Comment 15 Timur 2022-05-16 13:49:27 UTC
As for: 
 Instead of opening at the last text entry as usual it opened 2-3 pages earlier
it's likely bug 146988 	.