Bug 90210 - Turkish character problem opening old Excel 4.0 xls fie
Summary: Turkish character problem opening old Excel 4.0 xls fie
Status: RESOLVED DUPLICATE of bug 132796
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Calc (show other bugs)
Version:
(earliest affected)
4.4.0.3 release
Hardware: Other Windows (All)
: medium major
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2015-03-24 21:56 UTC by Burak Ural
Modified: 2021-09-03 07:27 UTC (History)
4 users (show)

See Also:
Crash report or crash signature:


Attachments
Sample File produced in Excel 4.0 format. (25.50 KB, application/vnd.ms-excel)
2015-03-24 21:56 UTC, Burak Ural
Details
View of file opened by LibreOffice (125.35 KB, image/jpeg)
2015-03-25 07:54 UTC, Burak Ural
Details
View of file opened by Excel 2003 (113.71 KB, image/jpeg)
2015-03-25 07:54 UTC, Burak Ural
Details
File opened by Excel 2003 and saved in Excel 2003 xls format. Can be opened by LO without any character problems. (17.50 KB, application/vnd.ms-excel)
2015-03-25 07:55 UTC, Burak Ural
Details
printscreen from excel2010 (50.00 KB, image/png)
2015-03-25 09:00 UTC, raal
Details
View of 1st sample file by Office Starter 2010. Fonts displayed correctly. (132.19 KB, image/jpeg)
2015-03-25 09:34 UTC, Burak Ural
Details

Note You need to log in before you can comment on or make changes to this bug.
Description Burak Ural 2015-03-24 21:56:13 UTC
Created attachment 114314 [details]
Sample File produced in Excel 4.0 format.

Hi,

We are using a crm software that uses a converter producing xls reports in Excel 4.0 format (At least that is what Excel 2003 says while trying to save it.)

The file is opened by Libreoffice with a bug that some Turkish characters are not being displayed correctly.
In Excel 2003 , the file opens without any character problems.

If need anymore info, please ask me.

Thanks,
Burak
Comment 1 raal 2015-03-24 22:01:58 UTC
Please attach a screenshot from excel and LO to see where is difference. Thank you.
Comment 2 Burak Ural 2015-03-25 07:54:17 UTC
Created attachment 114323 [details]
View of file opened by LibreOffice
Comment 3 Burak Ural 2015-03-25 07:54:53 UTC
Created attachment 114324 [details]
View of file opened by Excel 2003
Comment 4 Burak Ural 2015-03-25 07:55:52 UTC
Created attachment 114325 [details]
File opened by Excel 2003 and saved in Excel 2003 xls format. Can be opened by LO without any character problems.
Comment 5 raal 2015-03-25 09:00:28 UTC
Created attachment 114327 [details]
printscreen from excel2010

In excel 2010 are characters also not correct.

https://en.wikipedia.org/wiki/Microsoft_Excel#Excel_4.0_.281992.29
Excel 4.0 (1992)

Leaving unconfirmed,  I have not excel2003.
Comment 6 Burak Ural 2015-03-25 09:29:10 UTC
I can see characters are correct on windows 2010. ??

Maybe it is because the fonts or font substitutions are not installed for Turkish language in your computer.
Can you open the 4th in the attachment list correctly ?
If works fine in LO because it is Excel 2003 format. Can you see letters fine using that file?

I also suspect this is because of non-unicode Turkish font characters.
In Excel 4.0, there probably was no Unicode support.
So when opening the file , if there is no non-unicode Turkish fonts installed in OS , you might not see them.
There is probably some non-unicode conversion being made wrong so that Turkish characters are displayed wrongly.
Comment 7 Burak Ural 2015-03-25 09:34:04 UTC
Created attachment 114329 [details]
View of 1st sample file by Office Starter 2010. Fonts displayed correctly.
Comment 8 raal 2015-03-25 09:52:48 UTC
(In reply to Burak Ural from comment #6)
Hello,

> I can see characters are correct on windows 2010. ??
> 
> Maybe it is because the fonts or font substitutions are not installed for
> Turkish language in your computer.
My language is not Turkish

> Can you open the 4th in the attachment list correctly ?

I can see file "Turkish character 2003.xls" correct in excel2010 and LO 4.4.1.2 (letter Ş)

> If works fine in LO because it is Excel 2003 format. Can you see letters
> fine using that file?

Yes
Comment 9 Burak Ural 2015-03-25 11:25:45 UTC
In this case, my best guess is that there is a non-unicode conversion problem in LO , concerning non-unicode Excel documents.

I remember that using old formats like excel 95 was causing troubles while I was using openoffice.org , I guess this bug still comes from the past.
I had a macro that was converting letters automatically when the documents opened...
After Excel 97/2000/2003 format such problems did not existed anymore due to unicode support.

Well it would be nice if this conversion bug can be fixed...

Regards,
Burak
Comment 10 Julien Nabet 2015-03-25 21:35:13 UTC
On pc Debian x86-64 with master sources updated yesterday, I noticed this console log (5 times)
sc/source/filter/excel/xistream.cxx:736: read less bytes than requested
Comment 11 Robinson Tryon (qubit) 2015-04-17 21:36:55 UTC
TESTING with 4.4.2.2 + Ubuntu 14.04

(In reply to raal from comment #5)
> Created attachment 114327 [details]
> printscreen from excel2010
> 
> In excel 2010 are characters also not correct.

Maybe there's an issue in Excel 2010 as well? Ideally, we'd be able to open the file in LibreOffice either way.

The screenshots of the file in LibreOffice match up with my testing. If it's not a matter of missing/substituted fonts, then it appears to be an implementation issue.

Status -> NEW
Comment 12 Maxim Monastirsky 2015-12-09 22:45:07 UTC
This file seems to have CODEPAGE record which equals to 1252 (Latin I), and LO honors it, so NOTOURBUG I believe.
Comment 13 Maxim Monastirsky 2015-12-09 22:53:08 UTC
(In reply to Maxim Monastirsky from comment #12)
> This file seems to have CODEPAGE record which equals to 1252 (Latin I), and
> LO honors it, so NOTOURBUG I believe.
(And adding the ability to choose the encoding is already covered by Bug 35208, so the same way I could close it as a duplicate of Bug 35208.)
Comment 14 Maxim Monastirsky 2016-05-18 21:22:27 UTC
The workaround in Bug 35208 solves this one too.

*** This bug has been marked as a duplicate of bug 35208 ***
Comment 15 Maxim Monastirsky 2016-05-19 10:22:43 UTC

*** This bug has been marked as a duplicate of bug 37408 ***
Comment 16 Mike Kaganski 2021-09-03 07:27:33 UTC
This should follow the default document language setting now.

*** This bug has been marked as a duplicate of bug 132796 ***