1. The original xlsx file from Excel is ten times smaller than after saving in LibreOffice
-rwx------ 1 jmorrison None 150748 Dec 22 14:21 test-original.xlsx
-rwx------ 1 jmorrison None 1439545 Dec 22 14:36 test-saved.xlsx*
2. The original xlsx file is readable by the python script xlsx2csv
git clone https://github.com/dilshod/xlsx2csv
pip install xlsx2csv
stuff from spreadsheet,,,,,,,,,,,,,,,,
Traceback (most recent call last):
File "/usr/bin/xlsx2csv", line 847, in <module>
File "/usr/bin/xlsx2csv", line 178, in convert
File "/usr/bin/xlsx2csv", line 247, in _convert
File "/usr/bin/xlsx2csv", line 558, in to_csv
File "/usr/bin/xlsx2csv", line 660, in handleStartElement
startCol = start.group(1)
AttributeError: 'NoneType' object has no attribute 'group'
Researching the python error, it seems that Unicode is being returned where UTF-8 is expected
Please attach both files.
Created attachment 111238 [details]
Original Excel 2007 file
smaller test input file
Created attachment 111239 [details]
xlsx file saved by LibreOffice
Had to redact the original file. Libreoffice version still has output problem with xlsx2csv.
Created attachment 111240 [details]
Excel 2007 file input test
removed hidden sheets
Created attachment 111241 [details]
Libreoffice xlsx with problems
This libreoffice xlsx file can not be parsed with xlsx2csv while original can be.
I added more lines to the excel file and the saved libreoffice file is 5x larger.
In a large spreadsheet with hundreds of lines the file size difference is noticable. Excel xlxs file of 140k, LibreOffice was 1.4 MB.
The problem is caused by this element on the sheet 1.
As for the file size, there just has to be a duplicate bug somewhere.
@Eike: Do we have a way to produce OOXML range strings without whole column/whole row references?
According to 22.214.171.124 in the spec this element requires that:
The row and column bounds of all cells in this worksheet. Corresponds to the range that would contain all c elements written under sheetData. Does not support whole column or whole row reference notation.
** Please read this message in its entirety before responding **
To make sure we're focusing on the bugs that affect our users today, LibreOffice QA is asking bug reporters and confirmers to retest open, confirmed bugs which have not been touched for over a year.
There have been thousands of bug fixes and commits since anyone checked on this bug report. During that time, it's possible that the bug has been fixed, or the details of the problem have changed. We'd really appreciate your help in getting confirmation that the bug is still present.
If you have time, please do the following:
Test to see if the bug is still present on a currently supported version of LibreOffice
(5.1.5 or 5.2.1 https://www.libreoffice.org/download/
If the bug is present, please leave a comment that includes the version of LibreOffice and
your operating system, and any changes you see in the bug behavior
If the bug is NOT present, please set the bug's Status field to RESOLVED-WORKSFORME and leave
a short comment that includes your version of LibreOffice and Operating System
Please DO NOT
Update the version field
Reply via email (please reply directly on the bug tracker)
Set the bug's Status field to RESOLVED - FIXED (this status has a particular meaning that is not
appropriate in this case)
If you want to do more to help you can test to see if your issue is a REGRESSION. To do so:
1. Download and install oldest version of LibreOffice (usually 3.3 unless your bug pertains to a feature added after 3.3)
2. Test your bug
3. Leave a comment with your results.
4a. If the bug was present with 3.3 - set version to "inherited from OOo";
4b. If the bug was not present in 3.3 - add "regression" to keyword
Feel free to come ask questions or to say hello in our QA chat: http://webchat.freenode.net/?channels=libreoffice-qa
Thank you for helping us make LibreOffice even better for everyone!
The issue was resolved resolved with LibreOffice 5.3
*** Bug 88106 has been marked as a duplicate of this bug. ***