Bug 92064 - LO unusable with Tibetan super long paragraphs
Summary: LO unusable with Tibetan super long paragraphs
Status: NEW
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Writer (show other bugs)
Version:
(earliest affected)
4.4.4.1 rc
Hardware: x86-64 (AMD64) Linux (All)
: medium major
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks: CTL-China-Minorities
  Show dependency treegraph
 
Reported: 2015-06-14 11:26 UTC by Elie Roux
Modified: 2019-06-17 14:46 UTC (History)
5 users (show)

See Also:
Crash report or crash signature:


Attachments
Tibetan text (with long paragraph) (187.64 KB, application/x-xz)
2015-06-14 11:26 UTC, Elie Roux
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Elie Roux 2015-06-14 11:26:30 UTC
Created attachment 116527 [details]
Tibetan text (with long paragraph)

Dear All,

In Tibetan the notion of paragraph doesn't exist, and thus texts (even hundreds of pages) are usually in only one paragraph (no line break). MS Word apparently handles that without performance issue, but LibreOffice has huge performance issues when opening or editing this kind of file. See for instance attached (xzipped) file, I didn't really manage to open it, the CPU starts to heat a lot, LO doesn't answer anymore and I have to kill it by hand. This file comes from http://www.dharmadownload.net (the second text from http://www.dharmadownload.net/pages/english/Sungbum/006_mdzod%20bdun/pages/01_mdzod%20bdun%20-%20yid%20zhin%20mdzod.html), and is a normal Tibetan text, not a long paragraph torture test made to test performance...

This bug makes LO unusable in production for Tibetan, which is a pity as many tools are in gestation for spell checking and grammar checking for Tibetan in LO, that would make it much better than Word.

This might be related to https://bugs.documentfoundation.org/show_bug.cgi?id=89666 or https://bugs.documentfoundation.org/show_bug.cgi?id=39372 but it still happens on 4.4.4~rc1 (Debian/sid), so the patches don't seem to be enough.

Thank you very much!
Comment 1 Julien Nabet 2015-06-14 12:40:44 UTC
Elie: I updated tdf#89666 because 4.5.0 won't exist. There have been some patches on 5.0 branch.
Since Michael didn't put tdf#89666 as FIXED, I suppose he thinks there's still work to do.

Anyway, I could reproduce this on pc Debian x86-64 with master sources updated today.
I noticed this on console:
warn:legacy.osl:9871:1:oox/source/helper/graphichelper.cxx:117: GraphicHelper::GraphicHelper - cannot get target frame
Comment 2 Elie Roux 2015-06-14 12:43:37 UTC
Thanks for your consideration! It would be a huge help to the Tibetan community if this could work!
Comment 3 Julien Nabet 2015-09-02 12:35:21 UTC
Version corresponds to "earliest affected" as indicated.

I'll give a try with master sources (future 5.1.0).
Comment 4 Julien Nabet 2015-09-02 18:13:22 UTC
With master sources updated today, I still get the hang when opening.
After 2 minutes, I've got this:
 warn:legacy.osl:11092:1:oox/source/helper/graphichelper.cxx:117: GraphicHelper::GraphicHelper - cannot get target frame
W: Unknown node under /registry/extlang: deprecated
W: Unknown node under /registry/grandfathered: comments
W: Unknown node under /registry/grandfathered: comments

(the 3 last lines aren't specific to this bugtracker)

Miklos: since the xz file contains an rtf, thought you might be interested in this one.
Comment 5 Elie Roux 2015-09-02 18:40:50 UTC
I have the same problem with a docx file... I can convert it into .odt and put it here if you want.
Comment 6 Julien Nabet 2015-09-02 18:57:51 UTC
(In reply to Elie Roux from comment #5)
> I have the same problem with a docx file... I can convert it into .odt and
> put it here if you want.
I suppose it may help to have different formats, so go ahead! :-)
Comment 7 QA Administrators 2016-09-20 10:28:43 UTC Comment hidden (obsolete)
Comment 8 Elie Roux 2016-09-20 16:05:03 UTC
Although there's definitely an improvement compared to 5.0 version, my LO 5.2.0.4 under Debian/Sid is still very slow at opening the file indicated in the initial report, and adding or removing a character takes at least 10s, so I think it's safe to say that LO is still unusable for long Tibetan texts...
Comment 9 Xisco Faulí 2017-09-29 08:51:28 UTC Comment hidden (obsolete)
Comment 10 Elie Roux 2017-09-29 09:22:35 UTC
bug still present, Debian 9, LO 5.4.1.2
Comment 11 Buovjaga 2019-04-24 19:08:00 UTC
Takes 3 minutes to open, unusable perf after that.

Arch Linux 64-bit
Version: 6.3.0.0.alpha0+
Build ID: cfbb223d5666cb803539ac98918ff39b27efc6e7
CPU threads: 8; OS: Linux 5.0; UI render: default; VCL: gtk3; 
Locale: fi-FI (fi_FI.UTF-8); UI-Language: en-US
Calc: threaded
Built on 24 April 2019