In you save the 3.4 MB file as rtf you obtain a 27 MB file (!!!) that cannot be open. Furthmore if you modify the doc file and try to save it you'll obtain a doc file which will be double in size. The same operation in LibO 3.3.2 shows a 3.7 MB rtf file and a 3.4 MB doc file.
Created attachment 47700 [details] test file packed with rar
LO 3.5.0 beta-1 save the bugdoc in RTF without problem. The file is of the same magnitude as the .doc (~ 3.6 Mio when the .doc is 3.2 Mio). The problem is that LO has big difficulties to open the RTF, it need a lot of RAM and CPU, when Abiword opens this file without problem. I guess there is a memory leak in the RTF filter. Miklos: perhaps you should have a look to this particular file. Feel free to reassign if you can't handle this bug. Best regards. JBF
So the tokenizer itself (aka "rtf import filter") spends 12993ms on importing the rtf doc (as JBF says, the size is 3,6M here as well on master - but that is supposed to be similar in earlier versions as well). I think that is far for such a document of 500 pages for a text-based format (of course a binary format like .doc will be faster, that's not news). What can be improved here is a progressbar like the doc or the odt importer already has, will look into that.
(In previous comment: s/far/fair/.) Progressbar is implemented in master: http://cgit.freedesktop.org/libreoffice/core/commit/?id=92c7b6733e55a6ab62bc231ecf0ffd5c0da7c8d2
Hi Miklos, Is this progress bar solving bug 44157 too ? Best regards. JBF
*** Bug 44157 has been marked as a duplicate of this bug. ***
Hi JBF, I think so - unless the reporter attaches a document which is special in some way. Thanks for the hint, I closed that bug for now as duplicate of this one. Miklos
Fix confirmed with LO 3.5.1. The size of the RTF file produced by LO 3.5.1 is 3.7 MB. Closing. Thank you.