Bug 90656 - Metadata in Word 6 imported with wrong encoding
Summary: Metadata in Word 6 imported with wrong encoding
Status: NEW
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: filters and storage (show other bugs)
Version:
(earliest affected)
4.1.6.2 release
Hardware: Other Linux (All)
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2015-04-16 16:48 UTC by Karl Ove Hufthammer
Modified: 2023-06-14 03:12 UTC (History)
0 users

See Also:
Crash report or crash signature:


Attachments
Sample document, which is rendered incorrectly in LibreOffice (11.00 KB, application/msword)
2015-04-16 16:48 UTC, Karl Ove Hufthammer
Details
Screenshot showing how the document looks in Word 6.0 (9.64 KB, image/png)
2015-04-16 16:49 UTC, Karl Ove Hufthammer
Details
Screenshot showing how the document looks in LibreOffice (38.37 KB, image/png)
2015-04-16 16:49 UTC, Karl Ove Hufthammer
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Karl Ove Hufthammer 2015-04-16 16:48:32 UTC
Created attachment 114826 [details]
Sample document, which is rendered incorrectly in LibreOffice

The metadata (document properties) of imported Word 6.0 documents seems to have the wrong character encoding, rendering all non-ASCII characters as black question mark character.

I’ll attach an example, a screenshot of how it looks in LibreOffice and one of how it should look (i.e., how it looks in Word 6.0).

Basically, document title is shown using a field (placed in the header). It should say: ‘Men takene mine får du aldri, svar på spørsmål’. But in LibreOffice, it is rendered as ‘Men takene mine f�r du aldri, svar p� sp�rsm�l’. The latter is also what is shown under ‘File → Properties’. So it looks like it’s a problem with *importing* of Word 6.0 documents, not of rendering fields.

The characters that are incorrectly rendered are the non-ASCII characters (å and ø, both available in ISO 8859-1).

(The document also has other rendering bugs, but I’ll file separate bug reports for this.)
Comment 1 Karl Ove Hufthammer 2015-04-16 16:49:09 UTC
Created attachment 114827 [details]
Screenshot showing how the document looks in Word 6.0
Comment 2 Karl Ove Hufthammer 2015-04-16 16:49:36 UTC
Created attachment 114828 [details]
Screenshot showing how the document looks in LibreOffice
Comment 3 MM 2015-04-17 11:05:31 UTC
Unconfirmed for v4.3.7.1 under windows 7 x64.
Confirmed for v4.1.6.2 under mint 16 x64.
Confirmed for v4.4.2.2 under mint 17.1 x64.

Looks like linux only ?!
Comment 4 Karl Ove Hufthammer 2015-04-17 14:17:14 UTC
MM, that’s a clue. It looks like it is not OS-specific, but *locale*-specific. If I change my locale from the default UTF-8 locale to an ISO 8859-1 locale, for example with:

export LC_ALL=nn_NO.latin1
(and even ‘export LC_ALL=C’ seems to work)

the characters are correctly rendered.

I guess the Word 6.0 document is encoded as Windows-1252. But LibreOffice incorrectly treats the document properties as being in the user’s locale (while normal body text is correctly interpreted as being in Windows-1252, since non-ASCII characters work fine there).
Comment 5 QA Administrators 2016-09-20 09:24:34 UTC Comment hidden (obsolete)
Comment 6 Karl Ove Hufthammer 2016-09-22 06:56:18 UTC
I can confirm that this bug is still valid. I still observe the exact same behaviour as in the screenshot in comment 2. This is on LibreOffice 5.2.1.2 on Linux under an UTF-8 locale.
Comment 7 Xisco Faulí 2017-09-29 08:53:14 UTC Comment hidden (obsolete)
Comment 8 Karl Ove Hufthammer 2018-05-23 19:19:55 UTC
I can confirm that his bug is still present in:

Versjon: 6.0.4.2
Build ID: 00m0(Build:2)
CPU-trådar:4; OS:Linux 4.16; UI-utformar:standard; VCL: gtk3; 
Lokale: nn-NO (nn_NO.UTF-8); Calc: CL
Comment 9 QA Administrators 2019-05-24 02:57:26 UTC Comment hidden (obsolete)
Comment 10 QA Administrators 2021-05-24 04:27:00 UTC Comment hidden (obsolete)
Comment 11 Karl Ove Hufthammer 2021-06-13 10:08:49 UTC
I can confirm that this bug is still present in:

Version: 7.1.3.2 / LibreOffice Community
Build ID: 10(Build:2)
CPU threads: 4; OS: Linux 5.12; UI render: default; VCL: kf5
Locale: nn-NO (nn_NO.utf8); UI: nn-NO
Calc: threaded
Comment 12 QA Administrators 2023-06-14 03:12:49 UTC
Dear Karl Ove Hufthammer,

To make sure we're focusing on the bugs that affect our users today, LibreOffice QA is asking bug reporters and confirmers to retest open, confirmed bugs which have not been touched for over a year.

There have been thousands of bug fixes and commits since anyone checked on this bug report. During that time, it's possible that the bug has been fixed, or the details of the problem have changed. We'd really appreciate your help in getting confirmation that the bug is still present.

If you have time, please do the following:

Test to see if the bug is still present with the latest version of LibreOffice from https://www.libreoffice.org/download/

If the bug is present, please leave a comment that includes the information from Help - About LibreOffice.
 
If the bug is NOT present, please set the bug's Status field to RESOLVED-WORKSFORME and leave a comment that includes the information from Help - About LibreOffice.

Please DO NOT

Update the version field
Reply via email (please reply directly on the bug tracker)
Set the bug's Status field to RESOLVED - FIXED (this status has a particular meaning that is not 
appropriate in this case)


If you want to do more to help you can test to see if your issue is a REGRESSION. To do so:
1. Download and install oldest version of LibreOffice (usually 3.3 unless your bug pertains to a feature added after 3.3) from https://downloadarchive.documentfoundation.org/libreoffice/old/

2. Test your bug
3. Leave a comment with your results.
4a. If the bug was present with 3.3 - set version to 'inherited from OOo';
4b. If the bug was not present in 3.3 - add 'regression' to keyword


Feel free to come ask questions or to say hello in our QA chat: https://web.libera.chat/?settings=#libreoffice-qa

Thank you for helping us make LibreOffice even better for everyone!

Warm Regards,
QA Team

MassPing-UntouchedBug