Download it now!
Bug 96499 - FILEOPEN: HTML format .xls file shows NUMERIC cell value while TEXT type is expected
Summary: FILEOPEN: HTML format .xls file shows NUMERIC cell value while TEXT type is e...
Status: NEW
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Calc (show other bugs)
Version:
(earliest affected)
5.0.4.2 release
Hardware: All All
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks: HTML-Import
  Show dependency treegraph
 
Reported: 2015-12-15 05:38 UTC by Kevin Suo
Modified: 2019-05-15 10:46 UTC (History)
3 users (show)

See Also:
Crash report or crash signature:


Attachments
test file (21.19 KB, application/vnd.ms-excel)
2015-12-15 05:38 UTC, Kevin Suo
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Kevin Suo 2015-12-15 05:38:48 UTC
Created attachment 121310 [details]
test file

Attached is an .xls file in html format. The file contains two columns:Bank Account Number and ID Card Number. We expect these two fields to be "text" type. 
However, when open with LibreOffice Calc, the cells are showing float point numeric values. In Microsoft Office and WPS Office the cells are showing as "text" types as expected.

Steps to reproduce:
1. Open the attached xls file in Calc;
2. Observe the cell values. 
--> They are showing as numeric. We expect the cells to be "text" values.

Version: 5.0.4.2
Build ID: 2b9802c1994aa0b7dc6079e128979269cf95bc78
Locale: zh-CN (zh_CN)
Win10 x64

PS. This issue was initially reported by libreoffice_xf in the LibreOffice Chinese Forum:
http://www.libreofficechina.org/thread-1390-1-1.html
Comment 1 Buovjaga 2015-12-21 18:02:13 UTC
Confirmed.

Win 7 Pro 64-bit Version: 5.2.0.0.alpha0+
Build ID: 014633f83e44ae8ba33087b6f38e8e253e281969
CPU Threads: 4; OS Version: Windows 6.1; UI Render: default; 
TinderBox: Win-x86@62-merge-TDF, Branch:MASTER, Time: 2015-12-15_06:21:44
Locale: fi-FI (fi_FI)
Comment 2 QA Administrators 2017-03-06 14:21:29 UTC Comment hidden (obsolete)
Comment 3 Kevin Suo 2017-03-08 08:45:33 UTC
Bug still exists in the latest master.


---------------

MORE INFO:
With a debug run, I get the follow warning when open the attached test document in Calc:

warn:legacy.tools:19891:1:editeng/source/editeng/eehtml.cxx:54: EditHTMLParser::EditHTMLParser: Where does the encoding come from?
warn:svtools:19891:1:svtools/source/svhtml/parhtml.cxx:1427: GetOption: unknown HTML option
warn:svtools:19891:1:svtools/source/svhtml/parhtml.cxx:1427: GetOption: unknown HTML option
<same line repeated 20 times>
Comment 4 QA Administrators 2018-08-22 02:35:38 UTC Comment hidden (obsolete)
Comment 5 Kevin Suo 2019-05-15 10:46:43 UTC
(In reply to QA Administrators from comment #4)

Still reproducible in

Version: 6.3.0.0.alpha1+
Build ID:d2fa9c0d657877c967e41fdd0091f81d1b7ca048
CPU 线程:4; 操作系统:Linux 4.18; UI 渲染:默认; VCL: gtk3; 
Locale: zh-CN (zh_CN.UTF-8); UI-Language: zh-CN
Calc: threaded
Ubuntu 18.04 LTS X64