Bug 85109 - Characters of CJK Unified Ideographs Extension B do not display properly
Summary: Characters of CJK Unified Ideographs Extension B do not display properly
Status: RESOLVED WORKSFORME
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: LibreOffice (show other bugs)
Version:
(earliest affected)
4.3.2.2 release
Hardware: x86-64 (AMD64) Windows (All)
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2014-10-16 19:38 UTC by Anson Ng
Modified: 2018-07-15 05:11 UTC (History)
4 users (show)

See Also:
Crash report or crash signature:


Attachments
Test file (21.23 KB, application/vnd.oasis.opendocument.text)
2014-10-22 07:39 UTC, Matthew Francis
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Anson Ng 2014-10-16 19:38:00 UTC
I have only confirmed this with HKSCS Chinese characters, which belong to CJK Unified Ideographs Extension B, e.g. 𠺝, 𠺢, 𠝹, 𡃁, 𤓓, etc.  I made sure I use fonts that have these glyphs and I change the document's default Asian language to Chinese (Hong Kong).  The characters are either displayed as a big square box or blank space.  This issue is reproducible on Writer, Calc, etc.  The same characters displayed properly with Notepad, Notepad++, or Microsoft Word 2010.
Comment 1 Matthew Francis 2014-10-22 07:39:55 UTC
Created attachment 108223 [details]
Test file

I can't reproduce this on OSX or Linux, so it may be a Windows specific problem

Attached is a document which contains the sample characters as both live text and an image, which may help someone with Windows to reproduce the issue
Comment 2 Anson Ng 2014-10-22 18:52:08 UTC
Somehow it is working fine now.  I'm closing this.
Comment 3 erandur 2015-09-23 12:58:09 UTC
For me the Problem persists (on Windows 10, LO version 5.0.1.2). Interestingly enough, the problem only exists if I start a new file directly or open a file which doesn't contain CJK extension characters as the first file. If the first file I open contains CJK extension characters (such as the file from the attachment), everything works as it should (including in any other files I open or create during the same session).

I also noticed that whenever CJK extensions don't work, so does font preview in the selection list. To be concrete, when CJK extensions don't work (i.e. if I start a new document normally), font names are displayed in a default SansSerif font (Arial or something similar) in the font selection list instead of as a preview of the respective font.
Comment 4 Buovjaga 2015-10-09 18:46:08 UTC
Bug does not meet the criteria for Status 'REOPENED'
https://wiki.documentfoundation.org/QA/Bugzilla/Fields/Status/REOPENED#Criteria
Status -> UNCONFIRMED
Comment 5 V Stuart Foote 2015-12-13 18:42:23 UTC
On Windows 10 Pro 64-bit en-US with

Version: 5.2.0.0.alpha0+
Build ID: 917d59a84124d1022bd1912874e7a53c674784f1
CPU Threads: 8; OS Version: Windows 6.2; UI Render: GL; 
TinderBox: Win-x86@62-merge-TDF, Branch:MASTER, Time: 2015-12-12_12:17:04
Locale: en-US (en_US)

In LibreOffice when font for the paragraph to receive the copied text is correctly set in advance to use SimSun Extended B (SimSun-ExtB) (or another CJK font with glyphs for those code pages defined) then Edit -> Paste Special: Unformatted text correctly handles glyphs.

However, Standard paste and Paste Special HTML will corrupt individual characters and not assign the correct unicode value, but that is bug 81129