Created attachment 176860 [details] An UTF-16BE-with-BOM-encoded HTML, featuring non-BMP emojis Importing the attached HTML to Writer, the result shows question marks for emojis. Importing it as plain text shows them correctly.
https://gerrit.libreoffice.org/c/core/+/126658
Mike Kaganski committed a patch related to this issue. It has been pushed to "master": https://git.libreoffice.org/core/commit/21154ea8c450f9f5568b32123d34a20e498a9290 tdf#146173: combine non-BMP characters' surrogates correctly It will be available in 7.4.0. The patch should be included in the daily builds available at https://dev-builds.libreoffice.org/daily/ in the next 24-48 hours. More information about daily builds can be found at: https://wiki.documentfoundation.org/Testing_Daily_Builds Affected users are encouraged to test the fix and report feedback.
Mike Kaganski committed a patch related to this issue. It has been pushed to "libreoffice-7-3": https://git.libreoffice.org/core/commit/409a0e4ed268c06af924696dbdc29a7edd09df41 tdf#146173: combine non-BMP characters' surrogates correctly It will be available in 7.3.0.0.beta2. The patch should be included in the daily builds available at https://dev-builds.libreoffice.org/daily/ in the next 24-48 hours. More information about daily builds can be found at: https://wiki.documentfoundation.org/Testing_Daily_Builds Affected users are encouraged to test the fix and report feedback.