Bug 162514 - Spellchecker fails to recognize words ending in period (typically abbreviation)
Summary: Spellchecker fails to recognize words ending in period (typically abbreviation)
Status: NEW
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Writer (show other bugs)
Version:
(earliest affected)
24.8.0.0 alpha1+
Hardware: All All
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks: Spell-Checking
  Show dependency treegraph
 
Reported: 2024-08-18 18:32 UTC by Lars Jødal
Modified: 2024-09-17 04:27 UTC (History)
2 users (show)

See Also:
Crash report or crash signature:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Lars Jødal 2024-08-18 18:32:03 UTC
Description:
In LO Writer 24.8.0.3, the spellchecker seems to have stopped recognizing words ending with a period. In a language like Danish, an ending period is a necessary part of many abbreviations. I have found it more difficult to find English examples (abbreviations like "i.e." and "e.g." does not seem to be included in the English standard dictionary).

As an example, "dr." is an abbreviation in Danish for "doktor" (same word as English "doctor"). This abbreviation has been recognized in LO Writer version 24.2 and earlier, but not with 24.8.0, using the same dictionary.

Interestingly, the word IS recognized in Calc and Impress, so it seems to be a problem specifically for Writer.

Steps to Reproduce:
1. Open LO Writer.
2. To use my example: Type "dr." (or "etc.") and change spell-check language to Danish

Actual Results:
The word "dr." (or the word "etc.") is underlined in red as a spellchecking error.

Expected Results:
The word should be recognized by the spellchecker, as it is in the spellchecking dictionary. This is corroborated by right-click on the word for suggestions: "dr." (or "etc.") is among the suggestions.


Reproducible: Always


User Profile Reset: Yes

Additional Info:
Tested with LO 24.8.0.3 and 24.8.0.0.Alpha1:

Version: 24.8.0.3 (X86_64) / LibreOffice Community
Build ID: 0bdf1299c94fe897b119f97f3c613e9dca6be583
CPU threads: 4; OS: Windows 10 X86_64 (10.0 build 19045); UI render: Skia/Raster; VCL: win
Locale: da-DK (da_DK); UI: da-DK
Calc: threaded

Version: 24.8.0.0.alpha1 (X86_64) / LibreOffice Community
Build ID: a17e39caaf73108bee692d6f64a44c62f4066f1d
CPU threads: 4; OS: Windows 10 X86_64 (10.0 build 19045); UI render: Skia/Raster; VCL: win
Locale: da-DK (da_DK); UI: en-GB
Calc: threaded
Comment 1 Jeppe Bundsgaard 2024-09-12 21:56:19 UTC
I can confirm on: 
Version: 24.8.0.3 (X86_64) / LibreOffice Community
Build ID: 480(Build:3)
CPU threads: 8; OS: Linux 6.8; UI render: default; VCL: gtk3
Locale: da-DK (da_DK.UTF-8); UI: da-DK
Ubuntu package version: 4:24.8.0~rc3-0ubuntu0.24.04.1~lo2
Calc: threaded

It is a really serious regression.
Comment 2 Jeppe Bundsgaard 2024-09-14 11:29:53 UTC
I downloaded the latest Danish spellchecker here: https://stavekontrolden.dk/?dictionaries=1

I installed it in LibreOffice 24.8.1.2.
The correct abbreviation "osv." is considered a spelling mistake. 

I extracted da_DK.dic and .aff and used it in hunspell:

> jeppe@d46703:~$ hunspell -d /home/jeppe/Downloads/da_DK
> Hunspell 1.7.2
> osv.
> *

As you see, it is considered correct. This points to LibreOffice 24.8.1.2 having a bug.
Comment 3 Shantanu 2024-09-17 04:27:31 UTC
The abbreviation 'मा.' stands for the Marathi word 'माननीय' (respected). While it is included in the spell check, it has not been supported by any version I have tested so far.

Version: 24.8.1.2 (X86_64) / LibreOffice Community
Build ID: 87fa9aec1a63e70835390b81c40bb8993f1d4ff6
CPU threads: 1; OS: Windows 10 X86_64 (10.0 build 14393); UI render: Skia/Raster; VCL: win
Locale: en-US (en_US); UI: en-US
Calc: threaded

Version: 24.2.1.2 (AARCH64) / LibreOffice Community
Build ID: 420(Build:2)
CPU threads: 2; OS: Linux 6.5; UI render: default; VCL: gtk3
Locale: en-US (C.UTF-8); UI: en-US
Ubuntu package version: 4:24.2.1~rc2-0ubuntu0.22.04.1~lo1
Calc: threaded

Version: 7.6.7.2 (X86_64) / LibreOffice Community
Build ID: dd47e4b30cb7dab30588d6c79c651f218165e3c5
CPU threads: 1; OS: Windows 10.0 Build 14393; UI render: Skia/Raster; VCL: win
Locale: en-US (en_US); UI: en-US
Calc: threaded

There are only a few such words in Marathi, which is why I had not reported it earlier.