Bug 132000 - Digit characters saved with different associated language/font when exporting DOCX
Summary: Digit characters saved with different associated language/font when exporting...
Status: NEW
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Writer (show other bugs)
Version:
(earliest affected)
4.1 all versions
Hardware: All All
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks: DOCX DOCX-RTL
  Show dependency treegraph
 
Reported: 2020-04-09 12:01 UTC by Eyal Rozenberg
Modified: 2023-05-25 14:41 UTC (History)
1 user (show)

See Also:
Crash report or crash signature:


Attachments
Document exhibiting the bug (13.07 KB, application/vnd.openxmlformats-officedocument.wordprocessingml.document)
2020-04-09 21:14 UTC, Eyal Rozenberg
Details
A document exhibiting the bug (4.85 KB, application/vnd.openxmlformats-officedocument.wordprocessingml.document)
2020-12-27 12:12 UTC, Eyal Rozenberg
Details
LO Writer 7.0.3.1 rendering of testcase document (48.75 KB, image/png)
2020-12-27 12:13 UTC, Eyal Rozenberg
Details
MS Word rendering of testcase document (13.46 KB, image/png)
2020-12-27 12:22 UTC, Eyal Rozenberg
Details
Renderings in LO Writer and in MS Word (181.32 KB, image/jpeg)
2020-12-27 12:40 UTC, Eyal Rozenberg
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Eyal Rozenberg 2020-04-09 12:01:42 UTC
Description:
I'm working in LO writer on a document which originally was a DOCX file but has undergone a lot of edits - in LO writer. In this document there's a paragraph of Hebrew text in the Default Style, no custom formatting, containing a number. When I view the file in LO (e.g. 6.4.2.2), I see the Hebrew-font glyphs for the number; but if I open it in MS Word, I see the English-font glyphs.

If I then add text is MS Word, save and reopen - the added number glyphs use the Hebrew font both in LO and MS Office.

Notes:

* This is a rather old bug, it's quite likely it has a duplicate I couldn't find.
* I don't get this behavior if I start with a brand-new document.

Steps to Reproduce:
See above

Actual Results:
See above

Expected Results:
See above


Reproducible: Always


User Profile Reset: No



Additional Info:
Should get the Hebrew glyphs in both applications.
Comment 1 Eyal Rozenberg 2020-04-09 21:14:09 UTC
Created attachment 159456 [details]
Document exhibiting the bug

The digit glyphs in the attached document should appear in Times New Roman font when opened in MS Word, but in David font in LO Writer.
Comment 2 Xisco Faulí 2020-05-13 11:26:48 UTC
Reproduced in

Version: 7.0.0.0.alpha1+
Build ID: 1ffe59ef31186e36ad0aa7bbcdd32e407ee8d26c
CPU threads: 4; OS: Linux 4.19; UI render: default; VCL: gtk3; 
Locale: en-US (en_US.UTF-8); UI: en-US
Calc: threaded

and

Version 4.1.0.0.alpha0+ (Build ID: efca6f15609322f62a35619619a6d5fe5c9bd5a)
Comment 3 Eyal Rozenberg 2020-12-27 12:09:18 UTC
I should stress that even though this bug was only filed this year, it is very fundamental and problematic.

Even though it doesn't prevent working with the saved document, it means that you absolutely cannot get a document created in LO to look, in MS-Word, as though it had been created with MS-Word. 

Moreover, this bug manifests almost universally - as almost any RTL-language document has _some_ digits _somewhere_, making it extremely high-visibility.

The above also has a significant negative impact on the willingness of people to switch to LO. When a prospective user sees that using LO "messes up the fonts" - even if, semantically, the difference is subtle - they conclude that weird things will happen to their document in general if they switch.

Out of all the outstanding RTL-LTR bugs, this should be - IMHO - either the most highly-prioritized or one of the few most-prioritized.
Comment 4 Eyal Rozenberg 2020-12-27 12:12:41 UTC
Created attachment 168510 [details]
A document exhibiting the bug

Made the testcase document easier to reproduce, in that it doesn't require the presence of David CLM (which MS-Word users don't automatically have). Now, the RTL text is in Arial font, and the LTR font is Times New Roman - so the difference is clearer. Will also post a couple of screenshots.
Comment 5 Eyal Rozenberg 2020-12-27 12:13:35 UTC
Created attachment 168513 [details]
LO Writer 7.0.3.1 rendering of testcase document
Comment 6 Eyal Rozenberg 2020-12-27 12:22:46 UTC
Created attachment 168517 [details]
MS Word rendering of testcase document

This is what the document looks like in a recent version of MS Word (Office 365 MSO 16.0.something, the version numbers have been getting weirder I think).
Comment 7 Eyal Rozenberg 2020-12-27 12:40:13 UTC
Created attachment 168518 [details]
Renderings in LO Writer and in MS Word

Sized and somewhat-aligned the renderings to emphasize the difference. Remember - focus on the font of the digits, not the differences in the rendering of fonts overall (which is partly due to my resizing, and partly due to Linux-vs-Windows font rendering engine).
Comment 8 eladhen2 2021-01-01 21:04:17 UTC
I agree. I think this is pretty bad and looks unprofessional.