Bug 136227 - FILEOPEN DOCX with a large 23 pages table slow (4 seconds ODT 28 seconds DOCX)
Summary: FILEOPEN DOCX with a large 23 pages table slow (4 seconds ODT 28 seconds DOCX)
Status: RESOLVED FIXED
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Writer (show other bugs)
Version:
(earliest affected)
6.4.0.3 release
Hardware: All All
: low minor
Assignee: Not Assigned
URL:
Whiteboard:
Keywords: bibisected, bisected, filter:docx, perf, regression
Depends on:
Blocks: DOCX-Tables DOCX-Opening
  Show dependency treegraph
 
Reported: 2020-08-28 14:04 UTC by Telesto
Modified: 2023-07-31 18:45 UTC (History)
3 users (show)

See Also:
Crash report or crash signature:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Telesto 2020-08-28 14:04:15 UTC
Description:
FILEOPEN DOCX with a large table slow (4 seconds ODT 28 seconds DOCX)

Steps to Reproduce:
1. Open attachment 149343 [details] (bug 123517) (and monitor opening time: around 2 seconds)
3. Save as DOCX
4. Close the document
5. Open the DOCX -> 28 seconds

Actual Results:
Opening time for DOCX 28 seconds

Expected Results:
In the area of ODT


Reproducible: Always


User Profile Reset: No



Additional Info:
Found in
7.1

12 seconds with
6.0.5


11 seconds with 
4.4.7.2
Comment 1 Telesto 2020-08-28 14:05:44 UTC
12 seconds with
Version: 6.3.0.0.beta1+ (x86)
Build ID: 5cfac16dbd4af456a7fb6d52c8953c69a72ba2ba
CPU threads: 4; OS: Windows 6.3; UI render: default; VCL: win; 
Locale: nl-NL (nl_NL); UI-Language: en-US
Calc: CL
Comment 2 Dieter 2020-11-03 20:08:01 UTC
Sounds lik a duplicate of a bug, you've reported before. Please feeld free to change it backto UNCONFIRMED, if you disagree.

*** This bug has been marked as a duplicate of bug 135683 ***
Comment 3 Aron Budea 2020-11-04 01:36:31 UTC
There's a performance regression here, let's keep this ticket open on the regression, the general performance difference of ODT vs DOCX can be handled in another ticket.

The times I measured are ~15 s (6.3 and before) vs ~25s (6.4 and after).
Bibisected to the following commit using repo bibisect-linux-64-6.4. Adding CC: to Michael Stahl

https://cgit.freedesktop.org/libreoffice/core/commit/?id=5ba30f588d6e41a13d68b1461345fca7a7ca61ac
author		Michael Stahl <Michael.Stahl@cib.de>	2019-09-06 19:36:48 +0200
committer	Michael Stahl <Michael.Stahl@cib.de>	2019-09-17 10:45:40 +0200

tdf#64222 sw: better DOCX import/export of paragraph marker formatting
Comment 4 Timur 2020-11-04 07:09:46 UTC Comment hidden (obsolete)
Comment 5 Timur 2021-09-09 13:45:41 UTC
DOCX loading with 'time' is:

43all oldest/3.5
real	0m13,365s   user	0m12,395s   sys	0m0,377s
41max m
real	0m10,972s   user	0m10,128s   sys	0m0,211s
42max m
real	0m13,166s   user	0m11,501s   sys	0m0,191s
44max m
real	0m13,681s   user	0m11,537s   sys	0m0,313s
5.2 m 
real	0m24,968s   user	0m21,194s   sys	0m0,476s
6.0 m
real	0m23,981s   user	0m22,522s   sys	0m0,488s
6.3 m
real	0m27,135s   user	0m26,031s   sys	0m0,613s
6.4 m
real	0m31,946s   user	0m29,505s   sys	0m0,606s
7.3+ m
real	0m35,849s   user	0m35,054s   sys	0m0,600s

Bug 144395 has a larger and thus better sample for regression in 6.4.
This one is showing a continuous degradation from 4.2.
Comment 6 Gabor Kelemen (allotropia) 2023-05-15 15:52:43 UTC
So this went for me down from 

$ time OOO_EXIT_POST_STARTUP=1 isw --norestore Untitled\ 3.docx

real    0m30,390s
user    0m0,000s
sys     0m0,015s

to 

$ time OOO_EXIT_POST_STARTUP=1 isw --norestore Untitled\ 3.docx

real    0m11,590s
user    0m0,000s
sys     0m0,015s

in 7.4.5 with

https://git.libreoffice.org/core/+/3998b98749739b2c499ffc4d83188e1034b66750

author	Miklos Vajna <vmiklos@collabora.com>	Mon Dec 19 08:47:18 2022 +0100
committer	Mike Kaganski <mike.kaganski@collabora.com>	Thu Dec 22 06:12:45 2022 +0000

sw: ODT import/export of DOCX's paragraph marker formatting


For reference the odt loads in:
$ time OOO_EXIT_POST_STARTUP=1 isw --norestore Untitled\ 3\(1\).odt

real    0m5,736s
user    0m0,015s
sys     0m0,000s

But the 4.x load times are back.