Bug 85316 - Copy and paste as HTML from within LibreOffice fails to preserve characters outside the Basic Multilingual Plane
Summary: Copy and paste as HTML from within LibreOffice fails to preserve characters o...
Status: RESOLVED DUPLICATE of bug 81129
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: LibreOffice (show other bugs)
Version:
(earliest affected)
Inherited From OOo
Hardware: All All
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2014-10-22 08:13 UTC by Matthew Francis
Modified: 2016-01-06 23:59 UTC (History)
3 users (show)

See Also:
Crash report or crash signature:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Matthew Francis 2014-10-22 08:13:12 UTC
Steps to reproduce:
1. Load the file from https://bugs.freedesktop.org/attachment.cgi?id=108223 (from bug 85109)
2. Select all
3. Copy
4. Paste special - as HTML

Expected results
- Chinese text outside the BMP is preserved

Actual results
- Chinese text is replaced with random characters


This appears to be a separate issue from bug 85315, as it occurs on both OSX and Linux from 3.3.0 to current master, and the characters inserted are random (well, presumably not random, but e.g. "character & 0xFFFF") rather than "?"
Comment 1 Joel Madero 2014-10-24 04:07:56 UTC
How can we reproduce this from a blank document?
Comment 2 Matthew Francis 2014-10-24 04:53:12 UTC
(In reply to Joel Madero from comment #1)
> How can we reproduce this from a blank document?

- By typing such characters using an input method - I don't have an easy example of that, which is why I linked to the above document instead

or

- By copying the characters directly from bug 85109 and pasting as plain text

If you need a hand, give me a shout in #libreoffice-qa any time
Comment 3 Buovjaga 2014-10-24 18:55:26 UTC
I could reproduce using Matthew's suggested steps.

Notes: could not think of an input method either. Copying from this page results in questions marks: http://en.wikipedia.org/wiki/List_of_CJK_Unified_Ideographs_Extension_B_%28Part_1_of_7%29

Version: 4.4.0.0.alpha1+
Build ID: 0a82645c360158f9cc0fdabe2a52f1ff8f981bed
TinderBox: Win-x86@39, Branch:master, Time: 2014-10-24_06:59:23
Comment 4 QA Administrators 2015-12-20 16:07:47 UTC Comment hidden (obsolete)
Comment 5 Mark Hung 2016-01-06 23:59:00 UTC

*** This bug has been marked as a duplicate of bug 81129 ***