Bug 151577 - Writer PDF import filter should default to producing paragraphs of text, not drawing objects
Status: RESOLVED DUPLICATE of bug 32249
Product: LibreOffice
Component: filters and storage (show other bugs)
Blocks: PDF-Import-Writer
Reported: 2022-10-16 18:59 UTC by Eyal Rozenberg
Modified: 2022-10-17 02:27 UTC (History)
A two-paragraph Writer document exported to PDF (13.09 KB, application/pdf)
2022-10-16 18:59 UTC, Eyal Rozenberg
The original Writer document (28.97 KB, application/vnd.oasis.opendocument.text)
2022-10-16 19:00 UTC, Eyal Rozenberg

Comment 1 Eyal Rozenberg 2022-10-16 18:59:32 UTC
A two-paragraph Writer document exported to PDF

When opening a (Writer-created) PDF document, with text in several paragraphs, the resulting document should be paragraphs of text, very similar or identical to those in the original document which produced the PDF. This, provided that the PDF has not been manipulated in some complex and esoteric way which breaks up its paragraphs internally (i.e. when it is objectively difficult to decide whether such paragraphs exist and what their boundaries are).

We should not be getting a bunch of independently-positioned drawing objects - single word or single line - except for the content in the PDF document which necessitates it. The drawing-object-by-default approach may be fitting for use in LO Draw or Impress (although there as well one could consider paragraph-level boxes).

Reproduction instruction:

1. Create a new Writer document
2. Enter a couple of paragraphs of text; make each of them multi-line.
3. Save the document as a PDF.
4. Open the PDF in Writer (not in Draw! Use the Writer PDF import filter)

Expected result: You get paragraphs of text.

Actual result: You get many single-line textboxes.

The attachment will let you skip steps (1.)-(3.) .
Comment 2 Eyal Rozenberg 2022-10-16 19:00:56 UTC
The original Writer document

Opening the PDF should result in a document that is very similar to this one (the original document exported to PDF).
Comment 2 m.a.riosv 2022-10-17 02:27:50 UTC

*** This bug has been marked as a duplicate of bug 32249 ***