Bug 118601 - word in Greek with all letters capitalized is correct with or without accent (diacritical marks)
Summary: word in Greek with all letters capitalized is correct with or without accent ...
Status: NEW
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Localization (show other bugs)
Version:
(earliest affected)
Inherited From OOo
Hardware: All All
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks: Dictionaries Diacritics
  Show dependency treegraph
 
Reported: 2018-07-07 05:25 UTC by Ηλίας Ηλιάδης
Modified: 2024-05-24 14:06 UTC (History)
3 users (show)

See Also:
Crash report or crash signature:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Ηλίας Ηλιάδης 2018-07-07 05:25:17 UTC
Description:
In Greek, words are considered orthographically correct even if they do not have accents (diacritical marks). LibreOffice reports them as non orthographically correct.

Steps to Reproduce:
1.Write any accented word in Greek: διακριτικός
2.Write the same word with all caps with accented letter: ΔΙΑΚΡΙΤΙΚΌΣ
3.Write the same word with all caps with non accented letter: ΔΙΑΚΡΙΤΙΚΟΣ

Actual Results:
LibreOffice report the non accented word with all caps (ΔΙΑΚΡΙΤΙΚΟΣ) as orthographic error.

Expected Results:
A word should not be reported as orthographic error in both cases (with or without accent)


Reproducible: Always


User Profile Reset: Yes



Additional Info:
Deactivating check for capitalized words is not a solution since the orthographic check is needed. An algorithm that will convert accented letters to non accented and will consider both words (with or without accent) as correct, needs to be implemented. I do not know about other languages, but it is possible that still exist some languages that the capitalized version of words is not considered as error with or without accent.
The problem is also bound to the "all caps" option. Most people want these "temporary,just for print, transformations" to drop the accent, since they prefer capitalized words without accents.
Comment 1 Jacques Guilleron 2018-07-07 11:14:25 UTC
Hi Ηλίας Ηλιάδης,

The unaccented form of the word is not recognized by the dictionary. You can add it yourselfto the dictionairy. Be sure, however, there will be no ombiguity.
Comment 2 Ηλίας Ηλιάδης 2018-07-08 04:12:47 UTC
Hi  Jacques Guilleron,
Please consider the fact that the report is not about my personal dictionary. If there is another place that this bug should be reported I will be glad to report if informed.
Comment 3 Ηλίας Ηλιάδης 2018-07-08 04:17:57 UTC
Also this was not reproduced at almost all MS Word versions I was used in the past, thus many converted docs are now invalidated. (I do not use it anymore so I have no info about it's current behavior)
Comment 4 Xisco Faulí 2018-07-09 11:02:14 UTC
You can't confirm your own bugs. Moving it back to UNCONFIRMED until someone
else confirms it.You can't confirm your own bugs. Moving it back to UNCONFIRMED until someone
else confirms it.
Comment 5 Ηλίας Ηλιάδης 2018-07-12 03:50:13 UTC
Sorry if I was misunderstood, because of the limitation of spoken language. I confirm the bug. My last comment (comment 3 2018-07-08 04:17:57 UTC) was about the existence of this "feature" in Microsoft's products.
Comment 6 Buovjaga 2018-07-18 13:14:14 UTC
(In reply to Ηλίας Ηλιάδης from comment #0)
> Steps to Reproduce:
> 1.Write any accented word in Greek: διακριτικός
> 2.Write the same word with all caps with accented letter: ΔΙΑΚΡΙΤΙΚΌΣ
> 3.Write the same word with all caps with non accented letter: ΔΙΑΚΡΙΤΙΚΟΣ

I installed hunspell-el. LibreOffice reports both all caps versions as incorrect.

Anyway, you are a native Greek speaking individual, so I believe you -> NEW.

Arch Linux 64-bit
Version: 6.2.0.0.alpha0+
Build ID: 860a9daf2b45942a4b10ff22d36aa3fe29be19f4
CPU threads: 8; OS: Linux 4.17; UI render: default; VCL: gtk3; 
Locale: fi-FI (fi_FI.UTF-8); Calc: group threaded
Built on July 14th 2018
Comment 7 Ηλίας Ηλιάδης 2018-07-22 05:39:08 UTC
I think a small script that will remove the diacritics from dictionary words before comparing them with the capitalized version can do the trick.

For removing of diacritics (accent) there is a working LUA table in https://el.wiktionary.org/wiki/Module:Kleida-el used for correct dictionary sorting.

These "maps" can easily be extracted and converted to upper case for the final "check".

The idea is that all words in dictionary be capitalized and then apply to them a "remove accent" script. The "full" capitalized word in LibreOffice can then be checked against these "transformed" words.

Although in the dictionary sorting the case is irrelevant, for correct "orthographic checking" is better to use a "to upper" approach, since final "ς" will be correctly handled in the capitalized version.
Comment 8 QA Administrators 2019-07-30 03:13:58 UTC Comment hidden (obsolete)
Comment 9 alex 2021-07-08 10:36:55 UTC
Hi!
This problem still exists. This is my first time reporting a problem in an open source project, so let me know if you would like more information.


Version: 7.1.3.2 / LibreOffice Community
Build ID: 10(Build:2)
CPU threads: 12; OS: Linux 5.12; UI render: default; VCL: gtk3
Locale: el-GR (el_GR.UTF-8); UI: el-GR
7.1.3-2
Calc: CL
Comment 10 QA Administrators 2023-07-09 03:16:14 UTC
Dear Ηλίας Ηλιάδης,

To make sure we're focusing on the bugs that affect our users today, LibreOffice QA is asking bug reporters and confirmers to retest open, confirmed bugs which have not been touched for over a year.

There have been thousands of bug fixes and commits since anyone checked on this bug report. During that time, it's possible that the bug has been fixed, or the details of the problem have changed. We'd really appreciate your help in getting confirmation that the bug is still present.

If you have time, please do the following:

Test to see if the bug is still present with the latest version of LibreOffice from https://www.libreoffice.org/download/

If the bug is present, please leave a comment that includes the information from Help - About LibreOffice.
 
If the bug is NOT present, please set the bug's Status field to RESOLVED-WORKSFORME and leave a comment that includes the information from Help - About LibreOffice.

Please DO NOT

Update the version field
Reply via email (please reply directly on the bug tracker)
Set the bug's Status field to RESOLVED - FIXED (this status has a particular meaning that is not 
appropriate in this case)


If you want to do more to help you can test to see if your issue is a REGRESSION. To do so:
1. Download and install oldest version of LibreOffice (usually 3.3 unless your bug pertains to a feature added after 3.3) from https://downloadarchive.documentfoundation.org/libreoffice/old/

2. Test your bug
3. Leave a comment with your results.
4a. If the bug was present with 3.3 - set version to 'inherited from OOo';
4b. If the bug was not present in 3.3 - add 'regression' to keyword


Feel free to come ask questions or to say hello in our QA chat: https://web.libera.chat/?settings=#libreoffice-qa

Thank you for helping us make LibreOffice even better for everyone!

Warm Regards,
QA Team

MassPing-UntouchedBug