Bug 43066

Summary: FILESAVE conversion from *.doc to *.odt
Product: LibreOffice Reporter: Sadi <sadiyumusak>
Component: WriterAssignee: Not Assigned <libreoffice-bugs>
Status: RESOLVED FIXED    
Severity: normal CC: iamtester8
Priority: low    
Version: 3.4.4 release   
Hardware: x86 (IA32)   
OS: Linux (All)   
Whiteboard:
Crash report or crash signature: Regression By:
Attachments: specimen - original doc file
specimen - doc file converted to odt by lo332
doc file converted to odt by lo343 (with too many tags)
doc file converted first to docx by msword2007 and then to odt by lo343
doc file converted to odt by devlo350b2 - screenshot
Screenshot of lots of tags in odt converted from doc with 3.5b2

Description Sadi 2011-11-18 07:19:11 UTC
It seems LibreOffice 3.4.3 converts files from *.doc to *.odt format with too many unnecessary tags (which create problems in Computer-Assisted Translation Tools like OmegaT) although there is no such problem in LibreOffice 3.3.2 or when the file is first converted from *.doc to *.docx format in MS Word 2007 and then from *.docx to *.odt format in LibreOffice 3.4.3.
Comment 1 Sadi 2011-11-18 07:20:10 UTC
Created attachment 53658 [details]
specimen - original doc file
Comment 2 Sadi 2011-11-18 07:21:05 UTC
Created attachment 53659 [details]
specimen - doc file converted to odt by lo332
Comment 3 Sadi 2011-11-18 07:21:52 UTC
Created attachment 53660 [details]
doc file converted to odt by lo343 (with too many tags)
Comment 4 Sadi 2011-11-18 07:23:01 UTC
Created attachment 53661 [details]
doc file converted first to docx by msword2007 and then to odt by lo343
Comment 5 Sadi 2011-11-22 01:24:05 UTC
LibreOffice 3.4.4 has the same problem...
Comment 6 tester8 2012-01-11 11:31:08 UTC
LOdev 3.5.0beta2 
4ca392c-760cc4d-f39cf3d-1b2857e-60db978
Ubuntu 10.04.3 x86
Linux 2.6.32-37-generic Russian UI

Convertedd file has size 21204, while 3.4.3 file has 22837.
Is bug still there for you with 3.5?
Comment 7 Sadi 2012-01-11 14:06:59 UTC
Created attachment 55468 [details]
doc file converted to odt by devlo350b2 - screenshot

The problem reported persists in Libre Offive 3.4.5 Beta as can be seen in the screenshot of the file opened in OmegaT.
Comment 8 Sadi 2012-01-12 01:26:51 UTC
Correction: I meant Libre Office 3.5.0 Beta 2...


(In reply to comment #7)
> Created attachment 55468 [details]
> doc file converted to odt by devlo345b2 - screenshot
> 
> The problem reported persists in Libre Offive 3.4.5 Beta as can be seen in the
> screenshot of the file opened in OmegaT.
Comment 9 tester8 2012-01-12 05:02:58 UTC
Created attachment 55489 [details]
Screenshot of lots of tags in odt converted from doc with 3.5b2

<text> and </text> repeats and repeats.
Comment 10 tester8 2012-01-12 05:04:05 UTC
OK, reproduced.
Comment 11 Sadi 2012-03-23 06:23:40 UTC
This bug is no longer present in LibreOffice 3.5