Description: I found aq PDF , that cannot be opened correctly, see file Steps to Reproduce: 1.See this File: https://journals.ametsoc.org/doi/pdf/10.1175/1520-0477%281985%29066%3C0505%3APSASC%3E2.0.CO%3B2 2. 3. Actual Results: Libreoffice cannot show the first page correctly at this page i cannot upload a picture Expected Results: Libreoffice show a pdf with bad charakter; please see the link: https://journals.ametsoc.org/doi/pdf/10.1175/1520-0477%281985%29066%3C0505%3APSASC%3E2.0.CO%3B2 Reproducible: Didn't try User Profile Reset: No Additional Info: Version: 6.4.2.2 (x64) Build-ID: 4e471d8c02c9c90f512f7f9ead8875b57fcb1ec3 CPU-Threads: 4; BS: Windows 6.1 Service Pack 1 Build 7601; UI-Render: Standard; VCL: win; Gebietsschema: de-DE (de_DE); UI-Sprache: de-DE Calc: threaded
The PDF opens fine, the issue is that it had been prepared with OCR of the page images. You can remove the OCR by opening in your PDF viewer of choice and then printing the result back to PDF. Just the page images will be output--none of the OCR text runs. Alternatively if you prefer, or need the OCR results--you can do that with LibreOffice Draw. It is a manual process where by on each page of the imported PDF you select the source page's image and delete it, leaving the OCR text runs behind. But, it would be kind of convenient if the pdf import filter offered methods to strip out either the image, or the OCR text when both are present.
Created attachment 181270 [details] PDF from the bug description Let's add the linked PDF as attachment.
*** This bug has been marked as a duplicate of bug 104770 ***