When i import a PDF with text in it I get editing function on separate paragraph one by one , what i need and want (like all users will expect to do) to do it is to edit all paragraf like I do an office document normaly. Option to union the the paragraph to edit them flawless and easy at start ? Unificate all paragraph on page to be editable like a simple full form. It is time consuming to click and edit every paragrapf one at a time. Tryed union from the right click menu on them and I get a plain graphic non editable txt form with the txt tool (aka big T icon).
Hope this small option will get until final release.
I also sometimes wished such a feature, but I'm afraid that would cost too much manpower. Compared to other needs definitively not more than Importance "Medium", I doubt that that ever will be integrated.
I am a developer on a digital library software, and, aiming at supporting digital preservation, I was thinking of exploiting the wonderful PDF importer filter of LibreOffice to archive .odt document next to the original .pdf (as the .odt document should provide more value for future retrieval and the use of the document).
Indeed I also find this a very nice feature to have and it should be possible to implement it via some heuristic such as merging together subsequent lines that are not too far from each other (e.g. say that they are not more distant than the height of the character).
If no-one have time to work on it I'd be glad to give it a try in my spare time, if someone could be so kind to point me at the most appropriate source code files that would need to be touched.
That would be great.
I'm afraid that won't be easy. I do not know how that works for other OS, but for WIN I have to install the "Oracle PDF Import Extension" from
<http://extensions.services.openoffice.org/en/search/node/pdf import>, what itself afaik uses XPDF <http://foolabs.com/xpdf/about.html> as text extractor.
That's all I can contribute.
BTW: Version is for the first version where the problem has been observed!
(In reply to comment #3)
> @Samuele Kaplun:
> I'm afraid that won't be easy. I do not know how that works for other OS, but
> for WIN I have to install the "Oracle PDF Import Extension" from
> <http://extensions.services.openoffice.org/en/search/node/pdf import>, what
> itself afaik uses XPDF <http://foolabs.com/xpdf/about.html> as text extractor.
From <http://www.libreoffice.org/features/extensions/> I understand that finally this extension is part of the core LibreOffice source tree. Is this so?
> That's all I can contribute.
> BTW: Version is for the first version where the problem has been observed!
Sorry for this!! That makes perfect sense!
> From <http://www.libreoffice.org/features/extensions/> I understand that
> finally this extension is part of the core LibreOffice source tree. Is this so?
I thought so, too, but for my 3.4 I definitively had to download the extension. Pls see
[This is an automated message.]
This bug was filed before the changes to Bugzilla on 2011-10-16. Thus it
started right out as NEW without ever being explicitly confirmed. The bug is
changed to state NEEDINFO for this reason. To move this bug from NEEDINFO back
to NEW please check if the bug still persists with the 3.5.0 beta1 or beta2 prereleases.
Details on how to test the 3.5.0 beta1 can be found at:
more detail on this bulk operation: http://nabble.documentfoundation.org/RFC-Operation-Spamzilla-tp3607474p3607474.html
Was New by good reasons. But it#s the question whether there is a realistic chance to get this enhancement.
*** Bug 38084 has been marked as a duplicate of this bug. ***
*** Bug 84712 has been marked as a duplicate of this bug. ***
Please read this message in its entirety before responding.
Your bug was confirmed at least 1 year ago and has not had any activity on it for over a year. Your bug is still set to NEW which means that it is open and confirmed. It would be nice to have the bug confirmed on a newer version than the version reported in the original report to know that the bug is still present -- sometimes a bug is inadvertently fixed over time and just never closed.
If you have time please do the following:
1) Test to see if the bug is still present on a currently supported version of LibreOffice (preferably 4.2 or newer).
2) If it is present please leave a comment telling us what version of LibreOffice and your operating system.
3) If it is NOT present please set the bug to RESOLVED-WORKSFORME and leave a short comment telling us your version and Operating System
Please DO NOT
1) Update the version field
2) Reply via email (please reply directly on the bug tracker)
3) Set the bug to RESOLVED - FIXED (this status has a particular meaning that is not appropriate in this case)
+ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +
LibreOffice is powered by a team of volunteers, every bug is confirmed (triaged) by human beings who mostly give their time for free. We invite you to join our triaging by checking out this link:
There are also other ways to get involved including with marketing, UX, documentation, and of course developing - http://www.libreoffice.org/get-help/mailing-lists/.
Lastly, good bug reports help tremendously in making the process go smoother, please always provide reproducible steps (even if it seems easy) and attach any and all relevant material
I can confirm this bug in LO 18.104.22.168. on Windows 7
*** Bug 91896 has been marked as a duplicate of this bug. ***
Is there no bug voting? This is a major turndown!
*** Bug 105274 has been marked as a duplicate of this bug. ***
*** Bug 93039 has been marked as a duplicate of this bug. ***
LibreOffice has provided functional filter import of PDF into Draw (default Open action), and into Impress and Writer or also Draw by import filter selection.
With each filter selected, the rendering to respective document canvas follows the structure of the document as recorded within the PDF and text elements are rendered into styled Text box or Frames.
The PDF filter(s) do not "reflow" text into Paragraph objects. That would require a very complex treatment of the PDF structure to reliably extract syntax and layout--at the expense of fidelity rendering the PDF document.
Replacing of supplementing the PDF filters to provide "reflow" back into paragraphs is seen as out-of-scope for the project as we are not a PDF editor.
The core PDF filters and function are sufficient to our needs of high rendering fidelity.
This is fertile ground for an extension.