Bug 164503 - [META] Multiple bugs manifesting in LO->ODT->MSO->DOCX->LO round-trip of formal letter
Summary: [META] Multiple bugs manifesting in LO->ODT->MSO->DOCX->LO round-trip of form...
Status: UNCONFIRMED
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Writer (show other bugs)
Version:
(earliest affected)
24.8.4.2 release
Hardware: All All
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords: filter:docx
Depends on: 165193 165199
Blocks: DOCX-RTL RTL
  Show dependency treegraph
 
Reported: 2024-12-28 19:45 UTC by Eyal Rozenberg
Modified: 2025-02-11 14:31 UTC (History)
3 users (show)

See Also:
Crash report or crash signature:


Attachments
Original letter ODT (34.50 KB, application/vnd.oasis.opendocument.text)
2024-12-28 19:46 UTC, Eyal Rozenberg
Details
DOCX (saved from #198311 opened in MSO Word 16) (16.21 KB, application/vnd.openxmlformats-officedocument.wordprocessingml.document)
2024-12-28 19:48 UTC, Eyal Rozenberg
Details
Rendering of both documents in LO 25.8 nightly (201.12 KB, image/png)
2024-12-28 19:50 UTC, Eyal Rozenberg
Details
DOCX (saved from #198311 opened in MSO Word 2408) (16.72 KB, application/vnd.openxmlformats-officedocument.wordprocessingml.document)
2025-02-11 11:32 UTC, Eyal Rozenberg
Details
DOCX (saved from #198311 opened in MSO Word 2408 & Sשהקג) (16.33 KB, application/vnd.openxmlformats-officedocument.wordprocessingml.document)
2025-02-11 14:31 UTC, Eyal Rozenberg
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Eyal Rozenberg 2024-12-28 19:45:47 UTC
Consider the soon-to-be attached documents. It is a formal letter, in the Hebrew language - one of the most popular kinds of documents people use Word Processors for.

We start with the letter in LO Writer (I've replaced the actual text with gibberish, mostly). Actually, there's a bit of a fib here, since this document might have, at some point in the past, been an opened DOCX that got modified, so not written from _scratch_ in Writer probably. But, be that as it may - we now:

1. Save letter as ODT; this is the first attachment
2. Open the letter with MS Office Word 16, on Windows.
3. Save the letter as a Word Document, i.e. DOCX, in MSO Word; that DOCX is the second attachment.
4. Open the DOCX file in LibreOffice

Comparing the two documents in LO Writer, we observe several differences:

1. Some paragraphs - but not all of them - have LTR rather than RTL direction.
2. The originally left-aligned first paragraph, when flipping its direction to RTL - doesn't become left-aligned again, but right-aligned; and then if we flip again, it _does_ become left-aligned, i.e. doesn't revert to LTR and right-aligned
3. The title paragraph is no longer centered
4. The underline width for the "Title Subject Part" text is thinner than it was originally.
5. The DF paragraph border for the Default Paragraph Style second-to-last paragraph (which is actually not very well formatted, it should have its own style, but that's not a bug, just a gaffe on my part with the original document) - loses its bottom border in the DOCX.
6. The "Body Text" paragraphs now have style "Text Body (WW)"
7. The "Body Text" paragraphs are now no longer justified, but right-aligned
8. The first two characters in the first paragraph are rendered as "English, USA", i.e. with the Western LG font - even though Writer _is_ able to figure out the rest of the paragraph, including the trailing numbers, are "Hebrew"


Now, some of these issues are likely already-reported bugs. For those, let's mark this bug as blocked by those issues. For yet-unreported bugs, let's add the report and then mark the new bugs as blocking this one.
Comment 1 Eyal Rozenberg 2024-12-28 19:46:47 UTC
Created attachment 198311 [details]
Original letter ODT
Comment 2 Eyal Rozenberg 2024-12-28 19:48:24 UTC
Created attachment 198312 [details]
DOCX (saved from #198311 opened in MSO Word 16)
Comment 3 Eyal Rozenberg 2024-12-28 19:50:18 UTC
Created attachment 198313 [details]
Rendering of both documents in LO 25.8 nightly

Rendered with:

Version: 25.8.0.0.alpha0+ (X86_64) / LibreOffice Community
Build ID: 2305fe302e12c4256e452589e2533772d4213e59
CPU threads: 4; OS: Linux 6.6; UI render: default; VCL: gtk3
Locale: en-IL (en_IL); UI: en-US


but I believe this looks the same with 24.8.3.2 as well.
Comment 4 Eyal Rozenberg 2025-02-11 11:32:26 UTC
Created attachment 199136 [details]
DOCX (saved from #198311 opened in MSO Word 2408)

Also attaching the file saved by a more recent version of word
Comment 5 Eyal Rozenberg 2025-02-11 14:31:10 UTC
Created attachment 199144 [details]
DOCX (saved from #198311 opened in MSO Word 2408 & Sשהקג)


With a more recent MS Word version, 2408 Build 16.0.17928.20336) 32-bit, some of he problems don't manifest: (3.), (4.).