Bug 150508 - Emf embeded data does not get converted properly when saving a doc as html
Summary: Emf embeded data does not get converted properly when saving a doc as html
Status: RESOLVED INSUFFICIENTDATA
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: graphics stack (show other bugs)
Version:
(earliest affected)
7.3.4.2 release
Hardware: All All
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2022-08-20 01:57 UTC by plehal
Modified: 2023-09-22 03:16 UTC (History)
2 users (show)

See Also:
Crash report or crash signature:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description plehal 2022-08-20 01:57:36 UTC
Description:
I need to convert document files into html which contain emf images embedded into the doc files. However, when saving as html, the automatic conversion of this emf data into gif files is randomly missing text data from the image. Using inkscape --export-type flag I am able to convert emf data into png or gif without any text loss. Is there any setting in LibreOffice where you can choose which library is used for such conversions? I read some where that inkscape and Libreoffice use the same library to read emf data. If so, why are the results different? I am using Libreoffice 7.3.4.2 on Fedora 36. The html gif results are equally bad on Windows.

Steps to Reproduce:
1. Create a doc which contains embedded emf image containing a few text entries.
2. Convert doc into html page.
3. Resulting gif files generated for emf data will missing text.

Actual Results:
gif images have text missing in them

Expected Results:
Image conversion should not lose text or any entity for that matter.


Reproducible: Always


User Profile Reset: Yes


OpenGL enabled: Yes

Additional Info:
Inkscape is able to properly convert emf files contained in the doc file into png without any loss. Perhaps an easy solution is to use same library for conversion that inkscape uses.Or make it a configurable option in settings to select library and output formats for any input types found in the doc while saving as html.

I found this bug while trying to convert thousands of word files into html/markdown data in Polarion(Siemens ALM).

Tested this  on multiple Computers running linux and windows.
Comment 1 Bartosz 2022-10-17 08:08:15 UTC
Please attach the example doc file (or emf files), on which the problem is visible.
Comment 2 Buovjaga 2023-02-22 12:11:53 UTC
(In reply to Bartosz from comment #1)
> Please attach the example doc file (or emf files), on which the problem is
> visible.

Set to NEEDINFO.
Change back to UNCONFIRMED after you have provided the document.
Comment 3 QA Administrators 2023-08-22 03:06:08 UTC Comment hidden (obsolete)
Comment 4 QA Administrators 2023-09-22 03:16:54 UTC
Dear plehal,

Please read this message in its entirety before proceeding.

Your bug report is being closed as INSUFFICIENTDATA due to inactivity and
a lack of information which is needed in order to accurately
reproduce and confirm the problem. We encourage you to retest
your bug against the latest release. If the issue is still
present in the latest stable release, we need the following
information (please ignore any that you've already provided):

a) Provide details of your system including your operating
   system and the latest version of LibreOffice that you have
   confirmed the bug to be present

b) Provide easy to reproduce steps – the simpler the better

c) Provide any test case(s) which will help us confirm the problem

d) Provide screenshots of the problem if you think it might help

e) Read all comments and provide any requested information

Once all of this is done, please set the bug back to UNCONFIRMED
and we will attempt to reproduce the issue. Please do not:

a) respond via email 

b) update the version field in the bug or any of the other details
   on the top section of our bug tracker

Warm Regards,
QA Team

MassPing-NeedInfo-FollowUp