Bug 132544 - FILEOPEN DOCX: Emphasis mark appearance should vary depending on text language
Summary: FILEOPEN DOCX: Emphasis mark appearance should vary depending on text language
Status: NEW
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Writer (show other bugs)
Version:
(earliest affected)
5.2.7.2 release
Hardware: x86-64 (AMD64) All
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords: filter:docx
Depends on:
Blocks: CJK DOCX
  Show dependency treegraph
 
Reported: 2020-04-30 06:25 UTC by chengming1
Modified: 2024-09-19 17:15 UTC (History)
3 users (show)

See Also:
Crash report or crash signature:


Attachments
the file causing the bug (42.61 KB, application/vnd.openxmlformats-officedocument.wordprocessingml.document)
2020-04-30 06:28 UTC, chengming1
Details
Reproducer illustrating language differences (15.70 KB, application/vnd.openxmlformats-officedocument.wordprocessingml.document)
2024-09-19 17:00 UTC, Jonathan Clark
Details
Comparison screenshot (34.35 KB, image/png)
2024-09-19 17:01 UTC, Jonathan Clark
Details

Note You need to log in before you can comment on or make changes to this bug.
Description chengming1 2020-04-30 06:25:40 UTC
Description:
Dots UNDER characters become ABOVE characters and disappears when printing.

Steps to Reproduce:
1.Open the .docx file in the attachment
2. Print the file.

Actual Results:
1.There are some dots above characters. .
2.If printed this docx file, all dots disappeared.

Expected Results:
These dots should be under characters in MS Word.



Reproducible: Always


User Profile Reset: No



Additional Info:
版本: 6.4.0.3
Build ID: 6.4.0.3-2.fc32
CPU 线程: 8; 操作系统: Linux 5.7; UI 渲染: 默认; VCL: gtk3; 
区域语言: zh-CN (zh_CN.UTF-8); UI 语言: zh-CN
Calc: threaded
Comment 1 chengming1 2020-04-30 06:28:42 UTC
Created attachment 160106 [details]
the file causing the bug
Comment 2 Dieter 2020-05-04 08:27:03 UTC
I confirm tih with

Version: 7.0.0.0.alpha0+ (x64)Build ID: 8c8b3a4f83f67882b284ddc3b3fe10d3fe6dedf4CPU threads: 4; OS: Windows 10.0 Build 18363; UI render: Skia/Raster; VCL: win; Locale: de-DE (de_DE); UI-Language: en-GBCalc: CL

In the last four lines I can see dots below characters in Word 2016. In LO they're above characters and they disappear in an exported PDF.
Comment 3 Ming Hua 2020-05-04 10:50:28 UTC
Also reproduced in 5.2.7 (the oldest version I have here):
Version: 5.2.7.2 (x64)
Build ID: 2b7f1e640c46ceb28adf43ee075a6e8b8439ed10
CPU Threads: 2; OS Version: Windows 6.19; UI Render: default; 
Locale: zh-CN (zh_CN); Calc: group
Comment 4 QA Administrators 2022-05-05 03:54:01 UTC Comment hidden (obsolete)
Comment 5 Dieter 2022-05-05 12:04:58 UTC
Still present in

Version: 7.4.0.0.alpha0+ (x64) / LibreOffice Community
Build ID: 83d0f2eebae41d431d9a5bfd1a918523977752d0
CPU threads: 4; OS: Windows 10.0 Build 19044; UI render: Skia/Raster; VCL: win
Locale: de-DE (de_DE); UI: en-GB
Calc: CL

Compared with Word 2016
Comment 6 QA Administrators 2024-05-05 03:16:47 UTC Comment hidden (obsolete)
Comment 7 Ming Hua 2024-05-05 06:49:25 UTC
Still reproducible with:

Version: 24.2.2.1 (X86_64) / LibreOffice Community
Build ID: bf759d854b5ab45b6ef0bfd22e51c6dc4fb8b882
CPU threads: 12; OS: Windows 10.0 Build 22631; UI render: Skia/Vulkan; VCL: win
Locale: zh-CN (zh_CN); UI: zh-CN
Calc: CL threaded

and

Version: 24.8.0.0.alpha0+ (X86_64) / LibreOffice Community
Build ID: 6027c219444ade7274841a5ba45e62bdc1ea3238
CPU threads: 12; OS: Windows 10.0 Build 22631; UI render: Skia/Vulkan; VCL: win
Locale: zh-CN (zh_CN); UI: en-US
Calc: CL threaded
Comment 8 Jonathan Clark 2024-09-19 17:00:05 UTC
Created attachment 196548 [details]
Reproducer illustrating language differences

Example .docx demonstrating how the dot emphasis mark is rendered and positioned differently depending on text language, even within the same document.
Comment 9 Jonathan Clark 2024-09-19 17:01:16 UTC
Created attachment 196549 [details]
Comparison screenshot

Screenshot comparing LO to Microsoft Word, while displaying the example document attached previously.
Comment 10 Jonathan Clark 2024-09-19 17:15:25 UTC
OOXML provides the <w:em/> tag to indicate that a section of text should be rendered with am emphasis mark. The w:val="dot" attribute is specified for rendering dots above characters, and w:val="underDot" for rendering dots below characters; LibreOffice currently interprets these attributes accordingly.

However, Word does not.

Word interprets the "dot" value differently depending on text language, even within the same document. This is demonstrated by attachment 196548 [details]. All of the lines in this document are identical, other than the w:lang tags.


I was not able to reproduce the print issue.