Bug 101975 - Problem with utf8 html-xls to csv conversion
Summary: Problem with utf8 html-xls to csv conversion
Status: NEW
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Calc (show other bugs)
(earliest affected) release
Hardware: x86-64 (AMD64) Linux (All)
: medium normal
Assignee: Not Assigned
Depends on:
Blocks: CSV-Export
  Show dependency treegraph
Reported: 2016-09-07 19:38 UTC by viacheslav.sychov
Modified: 2018-07-21 02:39 UTC (History)
2 users (show)

See Also:
Crash report or crash signature:

input file (283 bytes, application/vnd.ms-excel)
2016-09-07 19:38 UTC, viacheslav.sychov
output file (134 bytes, application/vnd.ms-excel)
2016-09-07 19:39 UTC, viacheslav.sychov

Note You need to log in before you can comment on or make changes to this bug.
Description viacheslav.sychov 2016-09-07 19:38:22 UTC
Created attachment 127205 [details]
input file


I got problem with file conversion from html-xls format to csv, when I try convert file, using command:

# libreoffice --headless --convert-to csv --outdir . test.xls

convert /root/pc/test.xls -> /root/pc//test.csv using filter : Text - txt - csv (StarCalc)

Output file have strange chars (0xd1 0x3f), instead chars from origin files.

(original char) -> (replaced char)
0xd1 0x81 -> 0xd1 0x3f
0xd1 0x8d -> 0xd1 0x3f
0xd1 0x8f -> 0xd1 0x3f

0xd0 0x90 -> 0xd0 0x3f 
0xd0 0x81 -> 0xd0 0x3f 
0xd0 0xd9 -> 0xd0 0x3f 

Output file (base64):
Comment 1 viacheslav.sychov 2016-09-07 19:39:16 UTC
Created attachment 127206 [details]
output file
Comment 2 viacheslav.sychov 2016-09-07 19:42:10 UTC
Version: LibreOffice 066b007f5ebcc236395c7d282ba488bca6720265
Kernel: 3.10.0-327.28.3.el7.x86_64
Comment 3 Buovjaga 2016-09-30 11:26:29 UTC

Arch Linux 64-bit, KDE Plasma 5
Build ID: 8e812b87ff7f8c5bf2c6f8858646c55effd2eea3
CPU Threads: 8; OS Version: Linux 4.7; UI Render: default; 
Locale: fi-FI (fi_FI.UTF-8); Calc: group
Built on September 30th 2016

Arch Linux 64-bit
Version (Build ID: e183d5b)
Comment 4 QA Administrators 2018-07-21 02:39:23 UTC
** Please read this message in its entirety before responding **

To make sure we're focusing on the bugs that affect our users today, LibreOffice QA is asking bug reporters and confirmers to retest open, confirmed bugs which have not been touched for over a year.

There have been thousands of bug fixes and commits since anyone checked on this bug report. During that time, it's possible that the bug has been fixed, or the details of the problem have changed. We'd really appreciate your help in getting confirmation that the bug is still present.

If you have time, please do the following:

Test to see if the bug is still present with the latest version of LibreOffice from https://www.libreoffice.org/download/

If the bug is present, please leave a comment that includes the information from Help - About LibreOffice.
If the bug is NOT present, please set the bug's Status field to RESOLVED-WORKSFORME and leave a comment that includes the information from Help - About LibreOffice.

Please DO NOT

Update the version field
Reply via email (please reply directly on the bug tracker)
Set the bug's Status field to RESOLVED - FIXED (this status has a particular meaning that is not 
appropriate in this case)

If you want to do more to help you can test to see if your issue is a REGRESSION. To do so:
1. Download and install oldest version of LibreOffice (usually 3.3 unless your bug pertains to a feature added after 3.3) from http://downloadarchive.documentfoundation.org/libreoffice/old/

2. Test your bug
3. Leave a comment with your results.
4a. If the bug was present with 3.3 - set version to 'inherited from OOo';
4b. If the bug was not present in 3.3 - add 'regression' to keyword

Feel free to come ask questions or to say hello in our QA chat: https://kiwiirc.com/nextclient/irc.freenode.net/#libreoffice-qa

Thank you for helping us make LibreOffice even better for everyone!

Warm Regards,
QA Team