1. The original xlsx file from Excel is ten times smaller than after saving in LibreOffice -rwx------ 1 jmorrison None 150748 Dec 22 14:21 test-original.xlsx -rwx------ 1 jmorrison None 1439545 Dec 22 14:36 test-saved.xlsx* 2. The original xlsx file is readable by the python script xlsx2csv git clone https://github.com/dilshod/xlsx2csv pip install xlsx2csv xlsx2csv test-original.xlsx stuff from spreadsheet,,,,,,,,,,,,,,,, xlsx2csv test-saved.xlsx Traceback (most recent call last): File "/usr/bin/xlsx2csv", line 847, in <module> xlsx2csv.convert(outfile, sheetid) File "/usr/bin/xlsx2csv", line 178, in convert self._convert(sheetid, outfile) File "/usr/bin/xlsx2csv", line 247, in _convert sheet.to_csv(writer) File "/usr/bin/xlsx2csv", line 558, in to_csv self.parser.ParseFile(self.filehandle) File "/usr/bin/xlsx2csv", line 660, in handleStartElement startCol = start.group(1) AttributeError: 'NoneType' object has no attribute 'group' Researching the python error, it seems that Unicode is being returned where UTF-8 is expected https://stackoverflow.com/questions/15232832/python-regex-attributeerror-nonetype-object-has-no-attribute-groups
Please attach both files.
Created attachment 111238 [details] Original Excel 2007 file smaller test input file
Created attachment 111239 [details] xlsx file saved by LibreOffice Had to redact the original file. Libreoffice version still has output problem with xlsx2csv.
Created attachment 111240 [details] Excel 2007 file input test removed hidden sheets
Created attachment 111241 [details] Libreoffice xlsx with problems This libreoffice xlsx file can not be parsed with xlsx2csv while original can be. I added more lines to the excel file and the saved libreoffice file is 5x larger. In a large spreadsheet with hundreds of lines the file size difference is noticable. Excel xlxs file of 140k, LibreOffice was 1.4 MB.
The problem is caused by this element on the sheet 1. <dimension ref="1:15"/> As for the file size, there just has to be a duplicate bug somewhere.
@Eike: Do we have a way to produce OOXML range strings without whole column/whole row references? According to 18.3.1.35 in the spec this element requires that: The row and column bounds of all cells in this worksheet. Corresponds to the range that would contain all c elements written under sheetData. Does not support whole column or whole row reference notation.
** Please read this message in its entirety before responding ** To make sure we're focusing on the bugs that affect our users today, LibreOffice QA is asking bug reporters and confirmers to retest open, confirmed bugs which have not been touched for over a year. There have been thousands of bug fixes and commits since anyone checked on this bug report. During that time, it's possible that the bug has been fixed, or the details of the problem have changed. We'd really appreciate your help in getting confirmation that the bug is still present. If you have time, please do the following: Test to see if the bug is still present on a currently supported version of LibreOffice (5.1.5 or 5.2.1 https://www.libreoffice.org/download/ If the bug is present, please leave a comment that includes the version of LibreOffice and your operating system, and any changes you see in the bug behavior If the bug is NOT present, please set the bug's Status field to RESOLVED-WORKSFORME and leave a short comment that includes your version of LibreOffice and Operating System Please DO NOT Update the version field Reply via email (please reply directly on the bug tracker) Set the bug's Status field to RESOLVED - FIXED (this status has a particular meaning that is not appropriate in this case) If you want to do more to help you can test to see if your issue is a REGRESSION. To do so: 1. Download and install oldest version of LibreOffice (usually 3.3 unless your bug pertains to a feature added after 3.3) http://downloadarchive.documentfoundation.org/libreoffice/old/ 2. Test your bug 3. Leave a comment with your results. 4a. If the bug was present with 3.3 - set version to "inherited from OOo"; 4b. If the bug was not present in 3.3 - add "regression" to keyword Feel free to come ask questions or to say hello in our QA chat: http://webchat.freenode.net/?channels=libreoffice-qa Thank you for helping us make LibreOffice even better for everyone! Warm Regards, QA Team MassPing-UntouchedBug-20160920
The issue was resolved resolved with LibreOffice 5.3
*** Bug 88106 has been marked as a duplicate of this bug. ***