Bug 87191 - Encoding error when importing from old Word for Mac document
Summary: Encoding error when importing from old Word for Mac document
Status: RESOLVED WORKSFORME
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: filters and storage (show other bugs)
Version:
(earliest affected)
4.3.4.1 release
Hardware: x86-64 (AMD64) Linux (All)
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2014-12-10 12:20 UTC by Olivier Berten
Modified: 2014-12-12 20:52 UTC (History)
0 users

See Also:
Crash report or crash signature:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Olivier Berten 2014-12-10 12:20:43 UTC
I recently received an old document typed on Word for Mac. Accented characters got scrambled because they hadn't been converted from MacRoman to Unicode. I have no idea whether there is anything like an "encoding" or "charset" parameter in these old documents but I guess one can safely assume if it's from Word for Mac, it would be encoded in MacRoman.
Comment 1 Urmas 2014-12-10 20:42:56 UTC
I'm afraid 'old document by Word for Mac' a bit too broad. Could you open your file in any hex viewer and paste the first 16 bytes of it here?
Comment 2 Olivier Berten 2014-12-10 21:22:37 UTC
fe 37 00 23 00 00 00 00 00 00 04 00 00 19 00 00
Comment 3 Urmas 2014-12-10 21:46:12 UTC
Interestingly, I created a file with version 0x23 via export converter, and it is converted properly.

Could you attach the document here? If it is not suitable for the public view, you can send it to me to obfuscate it first.
Comment 4 Urmas 2014-12-12 20:52:58 UTC
Well, no document, no bug.