I updated to the recent LO 3.6.1 on my distro Arch Linux, and have noticed, that the Calc is quite slow ever since when manipulating with bigger data file.
I have by e.g. about 5MB large ODS spreadsheet, in which I store various data.
Before on LO 3.5.* , it worked fast.
Now, it takes minutes to load or save the file at change. And I'm updating the file quite often per day.
When I was searching for an root cause, I've noticed, that only one CPU core is being in use by CPU process when loading/saving.
I believe, this migt be it, as I have laptop with 4 core CPU (i5) , and htop/top revealed only one working on 100% at the process of load/save.
I've tested it also on one another separate arch virtual pc, having only two CPU cores, and htop/top again revealed only one CPU core in use at 100%.
As might be seen in here: https://bbs.archlinux.org/viewtopic.php?id=148492 , I'm not the only person affected.
I've also found this problem on the Ubuntu distro: https://bugs.launchpad.net/ubuntu/+source/libreoffice/+bug/1034999
Furthermore, I'm not sure, whether this problem is present on Linux systems only, as I haven't tested it yet at other systems like Windows, but it it possible, that it might be related to other OS's as well.
Created attachment 66797 [details]
Saving of these data took 37 minutes, but is only 330K large!
Saving of these data took 37 minutes, but is only 330K large!
We've (people on the Arch forum link I've mentioned before) made some further testing, and simply said:
The more data per one cell, e.g. multiple lines per one cell, having a very lot of such data in the sheet, e.g. 10-20 columns and several thousand rows (20x4000), and the time for load/save operations grows.
But if there is just one line per cell, in such 20x4000 sheet, the speed ain't affected, it's saved instantly.
It worked much faster before each new LO version, till it got this slug speed.
Can this be fixed please?
We have no multithreaded import/export so one core is normal.
(In reply to comment #3)
> We have no multithreaded import/export so one core is normal.
It might be normal for one line cell content.
But in cases as I described - an multiline cell content, the performance went horribly down.
Try it yourself, and you'll see on your own.
I don't know what precisely changed in between of the versions, but the processing performance decreased every time after each major release, 3.4, 3.5, 3.6.
And since such feature (multiline cell content) is available, it would be great if it get's some fix.
thx in advance
Confirmed this performance problem on LO 126.96.36.199 on Windows Vista 32bit.
Opening the attached file took 5minutes+ till I killed LO.
Possibly related - A workbook of mine which recalcs in <1s using 3.5.7 and previous versions now takes ~5min. Sorry I cannot share it as there is too much proprietary data in there. Loading is not noticeably slower. As with these reports, 1 CPU is 100% utilized by scalc the entire time. I see the same results in both 188.8.131.52 and 184.108.40.206. I am using Ubuntu Linux 12.04. Thanks for your work on this great package.
Did some investigation. Basically I haven't really figured out why importing of multiline contents (which are equivalent of rich-text contents) became so slow all of a sudden, but what's clear is that, to solve this, we need to re-write the code that parses and creates rich-text cells (aka edit cells). The current code uses way over-engineered UNO API for rich-text cells, and it's not humanly possible to grok that enough to be able to speed things up. It's just not doable.
The bad news is that this won't make it into 4.0. I'll see if I can tackle this during the 4.1 development cycle.
Note for self for future work:
Start in ScXMLTableRowCellContext::CreateChildContext and write a new handler to parse the <text:p> elements. Use XclImpStringHelper::CreateCell() as a reference, and import rich-text contents directly into ScEditCell.
Jamie - your bug seems un-related to import - can you file a new bug for that - and (if at all possible) generate a callgrind trace with debuginfo installed for it :-)
I'm getting ready to tackle this.
The work is on-going on the feature/ods-edit-cell-import branch.
Just merged the feature/ods-edit-cell-import branch onto master. Here are my numbers for opening the attached document.
3.5: 75.1015 sec
3.6: (too long to measure)
4.0: (too long to measure)
4.1 (master): 18.2441 sec
The numbers suggest that the perf regression is now gone, to say the least. Not only that, the number is 3 times better than 3.5.
I'll call this fixed. The fix will be available in 4.1. As I said in Comment 7, we won't be able to backport this into 4.0 due to its very invasive change.
That's great news. However, can you confirm if the same regression when saving an ods file is also fixed by this?
Opening/Saving an ods we work a lot on takes following times:
LO 3.4: Open 15.5s, Save 7.9s
LO 3.6: Open 37.3s, Save 43.2s
(In reply to comment #13)
> That's great news. However, can you confirm if the same regression when
> saving an ods file is also fixed by this?
> Opening/Saving an ods we work a lot on takes following times:
> LO 3.4: Open 15.5s, Save 7.9s
> LO 3.6: Open 37.3s, Save 43.2s
Could you open a separate bug for it? It's best to separate the import and export parts, since they are two separates code set. Thanks.
> Could you open a separate bug for it? It's best to separate the import and
> export parts, since they are two separates code set. Thanks.
Thank you Kohei.
Anyway, I've thought, that it will be fixed both, import/load AND export/save at once. I also mentioned this in my initial description, so who would have tell, that you will concentrate on one function only (doesn't matter that it is elsewhere in the code set as you said).
If I understood it right, now it means, that it will load faster, but saving will still be affected, and will take hell long.
Ok, so Daniel opened the Bug 60740.
Do you think, that you can also focus on this new bug fix ASAP, to have both fixed available in the upcomming 4.1 ?
And thank you in advance for your effort on both.
Great, the 4.1.* finally got into my distribution's official repository.
Import/load works much much faster than before (even on my older computer).
Perfect work Kohei, thank you very much.
Migrating Whiteboard tags to Keywords: (perf)