Bug 35592

Summary: Sort unexpected for characters with diacritic and following letters
Product: LibreOffice Reporter: Nikolay Morozov <nmorozov>
Component: CalcAssignee: Andras Timar <timar74>
Status: RESOLVED NOTABUG    
Severity: normal CC: LibreOffice, lists, sophi
Priority: medium    
Version: 3.3.2 release   
Hardware: x86 (IA32)   
OS: Windows (All)   
Whiteboard:
Crash report or crash signature: Regression By:
Attachments: Sample document for Sort
another example
Sample documents, pls. see Comment 12

Description Nikolay Morozov 2011-03-23 06:33:40 UTC
Calc threats russian "И" and "Й" like the same letters ("И" is vowel but "Й" is consonant), same thing with "Е" and "Ё" (this cause minor problems case "Е" and "Ё" letters are similar).
If you change sorting language to English (USA) sorting will be good. (Or sometimes natural sort helps).
tested with LibreOffice3.3.2, 3.3.1 and OpenOffice 3.2.1
But LibreOffice in Ubuntu PPA works fine :)
Comment 1 Hidden 2011-05-02 02:02:06 UTC
I attest this behaviour as true crap.
Comment 2 Nikolay Morozov 2011-05-03 11:10:46 UTC
why ?
Comment 3 tester8 2011-06-19 13:52:26 UTC
LO 3.4.1 (OOO340m1 (Build:101))
Ubuntu 10.04.2 x86
Linux 2.6.32-32-generic Russian UI

Reproduced with Е and Ё.
NOT reproduced with И and Й.
Comment 4 Nikolay Morozov 2011-07-21 23:03:41 UTC
bug with И and Й depends on OS main language and language sorting selection, somethimes it works fine sometimes not.
F.E On Windows XP RUS + LO with RUS interface appears with russian sorting, but sorts ok if you select English for sorting. On Unbutu Linux bugs doen't apperas. Plus on Windows XP Eng + LO eng doesn't appears
Comment 5 Rainer Bielefeld Retired 2011-07-27 04:47:10 UTC
Created attachment 49611 [details]
Sample document for Sort

Works fine for me with attached "sample3" and "LibreOffice 3.4.1 RC3 - WIN7  Home Premium (64bit) German UI [OOO340m1 (Build:203)]" with German UI and all language settings "somehow German".

Can you please try again with current version?
Comment 6 Nikolay Morozov 2011-07-28 00:12:12 UTC
Created attachment 49651 [details]
another example

This example reproduce bug with Ё and ё and don't reproduce й and Й
tested with libO 3.4.1 English on Win7 English 
later i will try to use russian interface win and libo
Comment 7 Nikolay Morozov 2011-07-28 00:13:27 UTC
(In reply to comment #5)
> Created an attachment (id=49611) [details]
> Sample document for Sort
 your sample document int good cause you have only one letter per cell
Comment 8 Nikolay Morozov 2011-07-28 00:29:38 UTC
tested : WinXP Rus LibO Rus Interface the same Е and Ё reprodused И and Й appears fixed
Comment 9 Rainer Bielefeld Retired 2011-07-28 00:33:38 UTC
So we can close this one as WORKSFORME for now?
Comment 10 Nikolay Morozov 2011-07-28 00:36:05 UTC
(In reply to comment #9)
> So we can close this one as WORKSFORME for now?

No the bug fixed only with ЙИ
but with ЕЁ reamins

partial fixed :)
Comment 11 Nikolay Morozov 2011-07-28 00:36:46 UTC
just look my example not yours
Comment 12 Rainer Bielefeld Retired 2011-07-28 01:43:11 UTC
[Reproducible] with "LibreOffice 3.4.1 RC3 - WIN7  Home Premium (64bit) German UI [OOO340m1 (Build:203)]" and reporter's second sample, ё and е will not be separated by sort consequently if different contents follows

I created an own more reduced sample showing the problem:
First column Letters with following blank
Second Column with additional following number
For all tests please first select A1:D27
 
a) When you sort by first column you will see beginning letters separated consequently.
b)  When you sort by second column you will see beginning letters alternating, following letters have priority over difference ё - е

ё 23
е 24
ё 25
е 26

I see the same for other letters with diacritics and german umlauts, following letter has sort priority over with/without diacritics.

Bug or feature? I can't tell.
All old LibO and OOo versions do it in the same way. 

@Sophie:
Can you help or do you know someone how can?
Comment 13 Rainer Bielefeld Retired 2011-07-28 01:44:58 UTC
Created attachment 49658 [details]
Sample documents, pls. see Comment 12
Comment 14 Nikolay Morozov 2011-07-28 01:51:04 UTC
Letter ё is not a letter with diacritics, it is independent letter in Russian alphabet, the same thing is with letter Й. More letter "И" is vowel but "Й" is
consonant !!!
Comment 15 Rainer Bielefeld Retired 2011-07-28 02:14:53 UTC
I doubt that LibO has any idea concerning difference between vowels and consonants ;-)

But indeed, if LibO handles difference between ё and e in the same way as between é and e, that might be a completely separate problem.

An other effect is that LibO seems not to separate between characters "with some additional thingy" consequently.

I also asked fro help in the documentation mailing list.

The problem is not limited to CALC, Writer tables seem to sort the same way.
Comment 16 David Nelson 2011-07-28 05:38:57 UTC
I have added myself to the CC list as a docs team custodian for this bug, as a placeholder for future action, as said in the docs ML thread:

http://nabble.documentfoundation.org/CALC-sort-order-priorities-tp3206125p3206125.html
Comment 17 Nikolay Morozov 2011-11-10 00:35:12 UTC
Reproduced with ё and е 3.4.4 Russian Windows
й and и seems fixed
Comment 18 Björn Michaelsen 2011-12-23 11:52:02 UTC
[This is an automated message.]
This bug was filed before the changes to Bugzilla on 2011-10-16. Thus it
started right out as NEW without ever being explicitly confirmed. The bug is
changed to state NEEDINFO for this reason. To move this bug from NEEDINFO back
to NEW please check if the bug still persists with the 3.5.0 beta1 or beta2 prereleases.
Details on how to test the 3.5.0 beta1 can be found at:
http://wiki.documentfoundation.org/QA/BugHunting_Session_3.5.0.-1

more detail on this bulk operation: http://nabble.documentfoundation.org/RFC-Operation-Spamzilla-tp3607474p3607474.html
Comment 19 Björn Michaelsen 2011-12-23 17:01:20 UTC
needinfo keyword redundant by needinfo status.
Comment 20 Nikolay Morozov 2011-12-26 04:41:33 UTC
Reproduced with ё and е 3.5.0 Beta 1 Windows
Comment 21 Nikolay Morozov 2011-12-26 04:52:31 UTC
Beta 2 the same
Comment 22 Rainer Bielefeld Retired 2011-12-26 06:42:37 UTC
Problem due to Comment 12 still reproducible with Parallel Dev-Installation of  "LibreOffice 3.5.0 Beta2- WIN7 Home Premium (64bit) German UI [Build-ID : 8589e48-760cc4d-f39cf3d-1b2857e-60db978], German and Russian locale settings.

I'm a little clueless how we can get some progress here.

@András:
Do you know how to or who can help?
Please feel free to reassign (or reset Assignee to default) if it’s not your area or if provided information is not sufficient. Please set Status to ASSIGNED if you accept this Bug.
Comment 23 Rainer Bielefeld Retired 2011-12-26 07:02:45 UTC
<http://wiki.documentfoundation.org/BugReport_Details#Version>
Comment 24 Urmas 2012-04-09 09:55:50 UTC
According to the 2006 version of rules, Ё is collated with Е, which reflects the current behaviour.
Comment 25 Nikolay Morozov 2012-04-09 10:12:11 UTC
According to russian language Ё is not collated with Е. That simplification can be used by not in sorting
Comment 26 Rainer Bielefeld Retired 2012-04-09 10:45:10 UTC
@Nikolay Morozov, @Urmas:
Some links concerning those rules would be useful.
Comment 27 Urmas 2012-04-16 03:27:44 UTC
"Правила русской орфографии и пунктуации. Полный академический справочник." — М., Эксмо, 2006

§6. [...] В словарях слова с буквой Ё размещаются в общем алфавите слов с буквой Е, напр. еле, елейный, ёлка, еловый, елозить, ёлочка, ёлочный, ель; веселеть, веселить(ся), весёлость, весёлый, веселье [...]