Bug 70631 - Other: LibreOffice is not parsing Excel 2003 XML files when file opened directly by double-click
Summary: Other: LibreOffice is not parsing Excel 2003 XML files when file opened direc...
Status: RESOLVED WORKSFORME
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: LibreOffice (show other bugs)
Version:
(earliest affected)
4.1.1.2 release
Hardware: Other Windows (All)
: medium normal
Assignee: Not Assigned
URL:
Whiteboard: BSA
Keywords:
Depends on:
Blocks:
 
Reported: 2013-10-18 19:49 UTC by Michal
Modified: 2015-01-24 11:55 UTC (History)
3 users (show)

See Also:
Crash report or crash signature:


Attachments
test XML file (122.45 KB, text/xml)
2013-12-05 12:06 UTC, RS
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Michal 2013-10-18 19:49:37 UTC
Problem description: 
LibreOffice is ignoring Excel 2003 XML files.

Steps to reproduce:
1. Put Excel 2003 XML file on desktop
2. Open (double click)
3. You will see xml code instead document

There is workaround. You need to start LO and then choose Open->File type Excel 2003 XML->Select file. This works well.

Current behavior:
Not working.

Expected behavior:
Should work. Excel 2003 XML is still used.
              
Operating System: Windows 7
Version: 4.1.1.2 release
Comment 1 ign_christian 2013-10-19 09:36:02 UTC
Hi Michal, please try again using LO 4.1.2.3 & resetting user profile. If it works, please change status to RESOLVED WORKSFORME.

Otherwise, attach your sample file here & change status to UNCONFIRMED.
Comment 2 m_a_riosv 2013-10-19 12:12:57 UTC
I think this is a dup

*** This bug has been marked as a duplicate of bug 70100 ***
Comment 3 Maxim Monastirsky 2013-10-27 14:34:37 UTC
It can't be a duplicate of bug 70100, as xml detection code is not related in any form to excel detection code.

I can't reproduce this bug on Windows 7 (32-bit) using 4.1.1.2 & 4.1.3.1 with a valid xml file, only with a non-valid one. So I'll mark this bug as WORKSFORME.
Comment 4 RS 2013-12-05 10:00:38 UTC
still an issue in Version 4.1.3.2 on Xenapp 6.5HRP2 (Windows 2k8R2)
and Windows 8.1 Desktop.

Thanks Michal for your workaround.

But in my opinion this is a downgrade of the old behavior.
The parser doesn't recognize it right.

Bug 70100 looks similar, but is not the same, they are talking about Excel 2003 xls files, not those Excel 2003 xml files.
Comment 5 Maxim Monastirsky 2013-12-05 10:07:41 UTC
(In reply to comment #4)
> still an issue in Version 4.1.3.2 on Xenapp 6.5HRP2 (Windows 2k8R2)
> and Windows 8.1 Desktop.
@RS: Hi, please attach the affected file.
Comment 6 RS 2013-12-05 12:06:10 UTC
Created attachment 90300 [details]
test XML file

This is a anonymized test file.
Comment 7 RS 2013-12-05 12:12:07 UTC
Here is an interesting additional information.
At first i edited(only field content to anonymize) the original file with TextPad Editor and saved it.
After this i could open the file by selecting "Open with -> LibreOffice Calc" in the context menu.

Then i tried to anonymize the original file by using the windows editor which worked and LibreOffice Calc still can't open the file out of the Windows Explorer (as the original one).

What is wrong with MS Excel 2003 XML Output? 

Thank you for investigating. =)
Comment 8 Maxim Monastirsky 2013-12-05 13:30:58 UTC
@RS: The attached file is UTF-8 with BOM. Removing the BOM makes LO detect the file correctly. It's the same observation as at Bug 46262 comment 9.

The detection fails in filter/source/xmlfilterdetect/filterdetect.cxx in isXMLStream function. There is no handling of BOM.

*** This bug has been marked as a duplicate of bug 46262 ***
Comment 9 RS 2013-12-05 13:48:03 UTC
Maxim, please not this fast, the other thread is dead since 2012.

This BUG is new and LO 4.0.4 opened the file correctly.

We got those files from a big marketing agency in germany and i'm sure we are not the only company which has to deal with UTF-8 BOM XML files.

Regards,
Ralf
Comment 10 Maxim Monastirsky 2013-12-05 16:39:57 UTC
(In reply to comment #9)
> Maxim, please not this fast, the other thread is dead since 2012.
Sorry, but those are the rules of this bug tracker. A new bug should be closed as a duplicate of the older. See https://wiki.documentfoundation.org/QA/Bugzilla/FAQ#How_to_Mark_a_Bug_as_Duplicate

Anyway, I submitted a patch for Bug 46262 which should fix also your file detection:
https://gerrit.libreoffice.org/#/c/6942/