Bug 143313 - FILEOPEN Large PDF's take too long to load in LibreOffice Draw
Summary: FILEOPEN Large PDF's take too long to load in LibreOffice Draw
Status: RESOLVED INSUFFICIENTDATA
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Draw (show other bugs)
Version:
(earliest affected)
7.0.3.1 release
Hardware: All All
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks: PDF-Import-Draw
  Show dependency treegraph
 
Reported: 2021-07-12 12:47 UTC by maxela5435
Modified: 2022-01-22 20:48 UTC (History)
3 users (show)

See Also:
Crash report or crash signature:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description maxela5435 2021-07-12 12:47:02 UTC
Description:
I tried disabling OpenCL and switching OS'ses and computers but the bug is reproducible in LibreOffice 7.0.3.1 and also the lastest one (7.1.4.2), this issue does not happend when opening small pdf's.

Steps to Reproduce:
1. Open a large PDF file in LibreOffice Draw  (20MB)
2. Find out it consumes your system resources
3. It finnaly opens after 6 minutues

3.

Actual Results:
It takes around 6 minutes to load.

Expected Results:
PDF opens in one second like Adobe PDF.


Reproducible: Always


User Profile Reset: No



Additional Info:
Version: 7.1.4.2 (x64) / LibreOffice Community
Build ID: a529a4fab45b75fefc5b6226684193eb000654f6
CPU threads: 4; OS: Windows 10.0 Build 17763; UI render: Skia/Raster; VCL: win
Locale: es-ES (es_ES); UI: es-ES
Calc: threaded
Comment 1 V Stuart Foote 2021-07-12 14:39:39 UTC
Please split [1] your "large" PDF into individual PDF pages--isolate the page(s) that are slow for filter import to Draw.

Post the extracted slow loading PDF page(s) to this issue.

=-ref-=
[1] you can use Adobe Acrobat if you have access. Or simply the PDFtk 'free' toolkit (https://www.pdflabs.com/tools/pdftk-the-pdf-toolkit/) will work here to split your problem PDF.
Comment 2 Kevin Suo 2021-07-15 01:08:27 UTC
I have submitted several patched related to the sdext.pdfimport. Could you please use a most recent daily build to test if it is faster now. If not, please sent me a test pdf via email.
Comment 3 maxela5435 2021-07-17 11:50:54 UTC
I am afraid the PDF contains some "personal info", however, I can tell you that I isolated a page witch is a floorplan with a lot of geometrical details and lines. I'm starting testing with daily build.
Comment 4 maxela5435 2021-07-17 12:12:03 UTC
Testing with LibreOffice Alpha0 7.3.0.0, reveals issue is still ocurring in the alpha version. It takes more or less the same time.
Comment 5 QA Administrators 2021-07-18 03:30:14 UTC Comment hidden (obsolete)
Comment 6 V Stuart Foote 2021-07-18 15:02:09 UTC
(In reply to maxela5435 from comment #3)
> I am afraid the PDF contains some "personal info", however, I can tell you
> that I isolated a page witch is a floorplan with a lot of geometrical
> details and lines. I'm starting testing with daily build.

So, if you delete that one page with complex floorplan, the import filter brings the PDF into draw reasonably quickly otherwise?

We've seen similar issues importing SVG and WMF/EMF with very complex fill patterns.  We'd need the floorplan page of the PDF to test against. Post it if able--or, "anonymize" it (replace all text)--and if still slow to load please post that.
Comment 7 maxela5435 2021-07-18 18:07:28 UTC
If I isolate the two pages that weight 1MB each, PDF still takes long to load, each 1MB page takes around 4 minutes to import, I am sorry, I can't anonymize the PDF because it contains dozens of text and even editing is slow (bug #108411 and bug #101674), Is a real problem, this is a dealbreaker for me at least. I'm taking a discussion to Ask LibreOffice.
Comment 8 Buovjaga 2021-07-18 18:42:03 UTC
(In reply to maxela5435 from comment #7)
> If I isolate the two pages that weight 1MB each, PDF still takes long to
> load, each 1MB page takes around 4 minutes to import, I am sorry, I can't
> anonymize the PDF because it contains dozens of text and even editing is
> slow (bug #108411 and bug #101674), Is a real problem, this is a dealbreaker
> for me at least. I'm taking a discussion to Ask LibreOffice.

For editing PDFs with thousands of objects, I recommend sK1: https://sk1project.net/sk1/download/

Hopefully you could use it to anonymise somehow.
Comment 9 QA Administrators 2022-01-15 04:02:03 UTC
Dear maxela5435,

This bug has been in NEEDINFO status with no change for at least
6 months. Please provide the requested information as soon as
possible and mark the bug as UNCONFIRMED. Due to regular bug
tracker maintenance, if the bug is still in NEEDINFO status with
no change in 30 days the QA team will close the bug as INSUFFICIENTDATA
due to lack of needed information.

For more information about our NEEDINFO policy please read the
wiki located here:
https://wiki.documentfoundation.org/QA/Bugzilla/Fields/Status/NEEDINFO

If you have already provided the requested information, please
mark the bug as UNCONFIRMED so that the QA team knows that the
bug is ready to be confirmed.
 
Thank you for helping us make LibreOffice even better for everyone!

Warm Regards,
QA Team

MassPing-NeedInfo-Ping