Bug 104605 - direct conversion from rtf to docx (command line --headless or via gui text missing in output file
Summary: direct conversion from rtf to docx (command line --headless or via gui text m...
Status: RESOLVED DUPLICATE of bug 103544
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: filters and storage (show other bugs)
Version:
(earliest affected)
4.4.0.3 release
Hardware: All All
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords: bibisected, regression
Depends on:
Blocks:
 
Reported: 2016-12-12 12:42 UTC by Ivan
Modified: 2019-08-13 05:52 UTC (History)
1 user (show)

See Also:
Crash report or crash signature:


Attachments
document that hase missing text when converted to docx (71.72 KB, text/rtf)
2016-12-12 12:44 UTC, Ivan
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Ivan 2016-12-12 12:42:54 UTC
Description:
Trying to convert some rtf files to docx via
soffice --headless --convert-to docx Bibel.rtf
resulting docx file when opened back in LibreOffice (or in Word) misses some text

Odd thing is when I open rtf inside LibreOffice text is there and visible/looks ok, but saving to docx again text is missing.
If I save opened rtf as .odt, close document, open saved .odt and than do save as docx than the text isn't missing.


Steps to Reproduce:
1. from terminal do soffice --headless --convert-to docx Bibel.rtf
2. open converted Bibel.docx file 

or

1. start LibreOffice
2. open Bibel.rtf (notice full title is there)
3. save as .docx (doesn't matter if Microsoft Word 2007-2013 XML or Office Open XML Text)
4. close file
5. open saved .docx file

Actual Results:  
docx created is missing text "Das Alte Testament"

Expected Results:
full title text "Erster Teil Das Alte Testament" being there


Reproducible: Always

User Profile Reset: Yes.
Also tried on OS X and on Windows

Additional Info:
Rtf document has an image that is anchored to the (missing) text



User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.11; rv:50.0) Gecko/20100101 Firefox/50.0
Comment 1 Ivan 2016-12-12 12:44:11 UTC
Created attachment 129526 [details]
document that hase missing text when converted to docx
Comment 2 Xisco Faulí 2016-12-13 09:29:37 UTC Comment hidden (obsolete)
Comment 3 Xisco Faulí 2016-12-13 09:35:22 UTC
However, I can reproduce it in

Version: 5.2.3.3
Build ID: d54a8868f08a7b39642414cf2c8ef2f228f780cf
CPU Threads: 4; OS Version: Linux 4.8; UI Render: default; 
Locale: ca-ES (ca_ES.UTF-8); Calc: group

so it has been recently fixed. Closing this as RESOLVED WORKSFORME and adding backportRequest:5.2
Comment 4 Ivan 2016-12-13 14:48:38 UTC
I downloaded latest OS X version from https://www.libreoffice.org/download/libreoffice-fresh/ 

Version: 5.2.3.3
Build ID: d54a8868f08a7b39642414cf2c8ef2f228f780cf
CPU Threads: 8; OS Version: Mac OS X 10.11.6; UI Render: default; 
Locale: en-US (en.UTF-8); Calc: group

bug is still threre


I'm trying also with latest from 
http://dev-builds.libreoffice.org/pre-releases/mac/x86_64/

Version: 5.3.0.0.alpha1
Build ID: f4ca1573fcf445164c068c1046ab5d084e1b005f
CPU Threads: 8; OS Version: Mac OS X 10.11.6; UI Render: default; 
Locale: en-US (en.UTF-8); Calc: group

and bug is still there.

I couldn't find 5.4 OS X build to test
Comment 5 Justin L 2019-08-13 05:52:55 UTC
fixed in LO 5.3 by commit b927c1f4b334f80d2c2965e5b7327d6b6a685105
Author: Tamás Zolnai Date:   Fri Oct 28 15:24:51 2016 +0000
bug 103544: DOCX exp.: Image loss when have a frame anchored to the same para.
    Regression from: 83d51e5e52688c4c9bc0ad70a511458bb06a242d


It was originally broken in LO 4.4 by commit 1c876f5616522ab695de8c0316cdb0c601081815
Author: Rohit Deshmukh <rohit.deshmukh@synerzip.com>
CommitDate: Thu Jun 19 11:01:17 2014 +0000
  bug 78590: Fix for Corruption of para with framePr & drawing object into para
        Issue:
         - File contains paragraph with framePr have graphic object in it.
         - So Libreoffice converts framePr into textbox.
         - So after saving file textbox gets exported with drawing object.
         - MS office does not allowed drawing object inside tetxbox.

*** This bug has been marked as a duplicate of bug 103544 ***