Bug 136884 - Stuck forever on opening DOC file with chinese fonts and images contents
Summary: Stuck forever on opening DOC file with chinese fonts and images contents
Status: NEW
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Writer (show other bugs)
Version:
(earliest affected)
Inherited From OOo
Hardware: x86-64 (AMD64) Windows (All)
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks: Calc-External-Datalink
  Show dependency treegraph
 
Reported: 2020-09-18 22:40 UTC by Meriyi
Modified: 2023-09-21 02:04 UTC (History)
4 users (show)

See Also:
Crash report or crash signature:


Attachments
doc file (6.07 MB, application/msword)
2020-09-18 22:40 UTC, Meriyi
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Meriyi 2020-09-18 22:40:17 UTC
Created attachment 165674 [details]
doc file

Check the attached MS doc file. The doc file contains chinese fonts and some images (I'm not the author of the doc file). When I tried to open it with libreoffice writer, it's stuck at "Importing document" process. I had to kill the libreoffice process with task Manager. There is no issue if I open the DOC file with WPS writer which is very fast (took about 3 seconds).

I used the online converter (https://document.online-convert.com/convert-to-doc) to convert the doc file to same doc filetype. I can open the converted DOC file in libreoffice writer now, but the importing was very slow (about 1-2 minutes to finish). Then I saved it as ODT file, the ODT file can be opened much faster (about 6 or 7 seconds). Then I tried saving the ODT file as DOC file in the writer, opening the created DOC file requires around 1 minute.

I had tried disabling the openGL setting but no change on the result. I had tried apache openoffice too, same result.

The issue exists on libreoffice v6 and the latest v7.
Comment 1 Meriyi 2020-09-20 09:00:03 UTC
Further note: using online converter isn't really a good workaround, I found out the conversion result isn't perfect, some images are missing. In libreoffice writer v6, sometimes the converted doc file can't be opened at all (stuck at "Importing document" process too like the original doc file).
Comment 2 Julien Nabet 2020-09-20 17:06:51 UTC
On pc Debian x86-64 with master sources updated today, I could reproduce this.

I noticed this log on console:
warn:filter.ms:81357:81357:filter/source/msfilter/msdffimp.cxx:6269: remaining record longer than available data, ppt or parser is wrong
Comment 3 Noel Grandin 2020-09-25 11:44:44 UTC
The problem here is that the document contains URL links to dozens of images on the internet which no longer exist, so the load process is sitting there trying to load dozens of images which do not exist.
Comment 4 QA Administrators 2022-11-14 03:31:56 UTC Comment hidden (obsolete)
Comment 5 Kira Tubo 2023-09-21 02:04:54 UTC
About 54 seconds on v.3.3 and 59 seconds on daily master build on windows. Updating earliest version from 7.0.1.2 to "Inherited from OOo"

Version: 24.2.0.0.alpha0+ (X86_64) / LibreOffice Community
Build ID: 486ae5db6987411d5e394de94b2b077099d03856
CPU threads: 6; OS: Windows 10.0 Build 22621; UI render: Skia/Raster; VCL: win
Locale: en-US (en_US); UI: en-US
Calc: CL threaded