Bug 85316

Summary: Copy and paste as HTML from within LibreOffice fails to preserve characters outside the Basic Multilingual Plane
Product: LibreOffice Reporter: Matthew Francis <fdbugs>
Component: LibreOfficeAssignee: Not Assigned <libreoffice-bugs>
Status: RESOLVED DUPLICATE    
Severity: normal CC: fdbugs, ilmari.lauhakangas, jmadero.dev
Priority: medium    
Version: Inherited From OOo   
Hardware: All   
OS: All   
Whiteboard:
Crash report or crash signature: Regression By:

Description Matthew Francis 2014-10-22 08:13:12 UTC
Steps to reproduce:
1. Load the file from https://bugs.freedesktop.org/attachment.cgi?id=108223 (from bug 85109)
2. Select all
3. Copy
4. Paste special - as HTML

Expected results
- Chinese text outside the BMP is preserved

Actual results
- Chinese text is replaced with random characters


This appears to be a separate issue from bug 85315, as it occurs on both OSX and Linux from 3.3.0 to current master, and the characters inserted are random (well, presumably not random, but e.g. "character & 0xFFFF") rather than "?"
Comment 1 Joel Madero 2014-10-24 04:07:56 UTC
How can we reproduce this from a blank document?
Comment 2 Matthew Francis 2014-10-24 04:53:12 UTC
(In reply to Joel Madero from comment #1)
> How can we reproduce this from a blank document?

- By typing such characters using an input method - I don't have an easy example of that, which is why I linked to the above document instead

or

- By copying the characters directly from bug 85109 and pasting as plain text

If you need a hand, give me a shout in #libreoffice-qa any time
Comment 3 Buovjaga 2014-10-24 18:55:26 UTC
I could reproduce using Matthew's suggested steps.

Notes: could not think of an input method either. Copying from this page results in questions marks: http://en.wikipedia.org/wiki/List_of_CJK_Unified_Ideographs_Extension_B_%28Part_1_of_7%29

Version: 4.4.0.0.alpha1+
Build ID: 0a82645c360158f9cc0fdabe2a52f1ff8f981bed
TinderBox: Win-x86@39, Branch:master, Time: 2014-10-24_06:59:23
Comment 4 QA Administrators 2015-12-20 16:07:47 UTC Comment hidden (obsolete)
Comment 5 Mark Hung 2016-01-06 23:59:00 UTC

*** This bug has been marked as a duplicate of bug 81129 ***