Bug 151577 - Writer PDF import filter should default to producing paragraphs of text, not drawing objects
Summary: Writer PDF import filter should default to producing paragraphs of text, not ...
Status: RESOLVED DUPLICATE of bug 32249
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: filters and storage (show other bugs)
Version:
(earliest affected)
7.5.0.0 alpha0+
Hardware: All All
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks: PDF-Import-Writer
  Show dependency treegraph
 
Reported: 2022-10-16 18:59 UTC by Eyal Rozenberg
Modified: 2022-10-17 02:27 UTC (History)
1 user (show)

See Also:
Crash report or crash signature:


Attachments
A two-paragraph Writer document exported to PDF (13.09 KB, application/pdf)
2022-10-16 18:59 UTC, Eyal Rozenberg
Details
The original Writer document (28.97 KB, application/vnd.oasis.opendocument.text)
2022-10-16 19:00 UTC, Eyal Rozenberg
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Eyal Rozenberg 2022-10-16 18:59:32 UTC
Created attachment 183089 [details]
A two-paragraph Writer document exported to PDF

When opening a (Writer-created) PDF document, with text in several paragraphs, the resulting document should be paragraphs of text, very similar or identical to those in the original document which produced the PDF. This, provided that the PDF has not been manipulated in some complex and esoteric way which breaks up its paragraphs internally (i.e. when it is objectively difficult to decide whether such paragraphs exist and what their boundaries are).

We should not be getting a bunch of independently-positioned drawing objects - single word or single line - except for the content in the PDF document which necessitates it. The drawing-object-by-default approach may be fitting for use in LO Draw or Impress (although there as well one could consider paragraph-level boxes).

Reproduction instruction:

1. Create a new Writer document
2. Enter a couple of paragraphs of text; make each of them multi-line.
3. Save the document as a PDF.
4. Open the PDF in Writer (not in Draw! Use the Writer PDF import filter)

Expected result: You get paragraphs of text.

Actual result: You get many single-line textboxes.

The attachment will let you skip steps (1.)-(3.) .
Comment 1 Eyal Rozenberg 2022-10-16 19:00:56 UTC
Created attachment 183090 [details]
The original Writer document

Opening the PDF should result in a document that is very similar to this one (the original document exported to PDF).
Comment 2 m_a_riosv 2022-10-17 02:27:50 UTC

*** This bug has been marked as a duplicate of bug 32249 ***