Bug 162487 - Draw 24.2.5.2 imported PDF text runs split into a single draw object per character
Summary: Draw 24.2.5.2 imported PDF text runs split into a single draw object per char...
Status: NEEDINFO
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: filters and storage (show other bugs)
Version:
(earliest affected)
24.2.5.2 release
Hardware: All Windows (All)
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks: PDF-Import-Draw
  Show dependency treegraph
 
Reported: 2024-08-15 22:53 UTC by Debra James
Modified: 2024-09-13 11:02 UTC (History)
3 users (show)

See Also:
Crash report or crash signature:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Debra James 2024-08-15 22:53:05 UTC
Description:
Text cannot be selected and overwritten with replacement text.  Sentences and words do not appear in edit fields.  Letters in words can be selected and deleted, but a replacement letter cannot be entered.

Steps to Reproduce:
1.Open PDF document in Drow.
2.Choose pointer to select text for edit. No cursor appears within the box where that contains the selected text.  One letter may be chosen and it can only be deleted.
3.The Undo of the deleted text is the only function available to replace text.

Actual Results:
If a key stroke is made to replace deleted text, and the edit box disappeared with the letter deletion and there is no ability to write text in the empty space.

Expected Results:
The appearance of a cursor within the selected text box, and the ability to delete letters and words and type new letters and words that will be the same font and size of the original text.


Reproducible: Always


User Profile Reset: No

Additional Info:
No other information.
Comment 1 m_a_riosv 2024-08-15 23:41:58 UTC
Maybe you need to enable
Menu/Edit/Edit Mode
or the files has only read permissions.

Please attach a sample file, reduce the size as much as possible without private information, and paste the information in Menu/Help/About LibreOffice, there is a copy icon.
Comment 2 V Stuart Foote 2024-08-16 02:01:22 UTC
LibreOffice can not edit a PDF document. Instead we filter import a PDF and convert it elements into ODF drawing objects. Text will be parsed into text runs filling draw Text boxes--without any lexical context of the source document, and simply follows the element sequence of the PDF document.

Fidelity to original layout can vary greatly, and ability to "touch up" the draw text boxes holding text runs from the PDF will depend on if the PDF included usable embedded font--or if your system has the font used in the PDF installed locally.
Comment 3 V Stuart Foote 2024-08-18 00:58:34 UTC
As appended to meta bug 99746

<snip>
Comment 9 -- bug 99746
Debra James 2024-08-17 23:42:10 UTC
Created attachment 195882 [details]
Able to edit PDFs on Dell eXP 13 9370 with Draw v7.3.4.2

I am running v7.3.4.2 on my old laptop, a Dells XPS 13 9370 with Win 11 Home, and do not have any additional fonts, editing or graphics applications.  I have been using Libre Office for about 9 years, and been able to edit text on PDFs with this version and all prior versions.

Comment 10 -- bug 99746 Debra James 2024-08-18 00:01:12 UTC
Created attachment 195883 [details]
Unable to edit PDF text with v24.2.5

I have provided an attachment that displays how I can edit PDF text with Draw on my current laptop.  I just installed v25.2.5 on my other laptop, and am unable to edit PDF text.

Comment 11 -- bug 99746 Debra James 2024-08-18 00:08:01 UTC
I have posted some screenshots to display what I am experiencing with trying to changing text with Draw 24.2.5.  I have confirmed that Edit Mode is enabled.  I am an ordinary user and frequently use the "edit" PDF function when sending rental agreements to guests of my vacation rentals.  I have a mini-PC which I use only for streaming, and will install Draw on it to see if I have the same experience.

</snip>

As noted, we do not edit PDF's in LibreOffice. You've been filter importing to a Draw ODF canvas as a sequence of draw 'Text box' objects and editing those. That work flow is sensitive to any font subset into the PDF and not available on the system with LibreOffice installed.

Otherwise I can not confirm. The Draw object text boxes of a PDF filter imported to Draw canvas can be selected, and the edit cursor provides overwrite, deletion and insertion of alternate text. On Save-as filter export to a new PDF the changes to those text runs are retained as made.

Version: 24.2.5.1 (X86_64) / LibreOffice Community
Build ID: 2ccb78ad6bdfe3f3356a7a7f294ec388775c5816
CPU threads: 8; OS: Windows 10.0 Build 19045; UI render: default; VCL: win
Locale: en-US (en_US); UI: en-US
Calc: CL threaded
Comment 4 Buovjaga 2024-09-11 13:10:17 UTC
Needinfo while we wait for example PDF.
Comment 5 V Stuart Foote 2024-09-11 15:31:45 UTC
@Debra, the NEEDINFO is to you. We need to examine a PDF that you are unable to import into Draw and then edit strings in the drawing shape text boxes.

The filter import uses poppler to split out the objects, which are then written into draw objects on the document canvas. Looking at your screen clips, the "Shipping" string is coming in as a sequence of drawing objects one character per PDF element.

We can not reproduce that with any PDF, so we need to examine one of yours that you continue to have issues with. 

You can use simple substitution in your source document to anonymize any personal info (try C for any character, A-Za-z and N for any number 0-9) and keep the rest of your document intact such that a PDF produced from it still causes the issue you see that the text runs are split into multiple draw objects.

Some split up of PDF text runs is normal, but every character into its own draw object is not.  But we'd need to be able to verify and reproduce.

Thanks!
Comment 6 Debra James 2024-09-12 13:50:16 UTC
Thank you for responding.  I am about to go on a monthlong trip and will definitely follow up and provide the request items after I return.
Comment 7 QA Administrators 2024-09-13 03:13:30 UTC Comment hidden (obsolete)