Bug 152941 - Some synonyms are not presented
Summary: Some synonyms are not presented
Status: RESOLVED NOTOURBUG
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Writer (show other bugs)
Version:
(earliest affected)
7.4.2.3 release
Hardware: All All
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks: Thesaurus
  Show dependency treegraph
 
Reported: 2023-01-09 08:30 UTC by Jeppe Bundsgaard
Modified: 2023-01-09 13:00 UTC (History)
1 user (show)

See Also:
Crash report or crash signature:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Jeppe Bundsgaard 2023-01-09 08:30:04 UTC
Description:
I have created a new thesaurus for Danish based on an open source wordnet-like ressource: DanNet. My scripts are here: https://github.com/jeppebundsgaard/DanNet2HunspellThesaurus, including a .oxt file that can be imported in LibreOffice.

The .dat and .idx files looks right, and a lot of the words in my test-document are given synonyms. But some are not. I don't see any system which works, and which do not.

Steps to Reproduce:
1.Install the extension https://github.com/jeppebundsgaard/DanNet2HunspellThesaurus/blob/main/da_DK%20incl%20dannet%20thesaurus.oxt
2. Create a text document with this content:
Words given synonyms
Stol
hustru
hus
kærlighed 
kage 
magelig
arm
folkeaktie
y-akse
gryde
klaver
45-knallert

Words not given synonyms
akse
aktie
abe
abc
ankomme

3. Right click and see that the words in the first list have synonyms, while the ones in the next list don't

Actual Results:
Words from the first list have synonyms, words from the second don't.

Expected Results:
All words are in the .idx and .dat-files and should have synonyms.


Reproducible: Always


User Profile Reset: Yes

Additional Info:
Version: 7.4.2.3 / LibreOffice Community
Build ID: 40(Build:3)
CPU threads: 8; OS: Linux 5.19; UI render: default; VCL: gtk3
Locale: da-DK (da_DK.UTF-8); UI: da-DK
Ubuntu package version: 1:7.4.2~rc3-0ubuntu1
Calc: threaded
Comment 1 Stéphane Guillou (stragu) 2023-01-09 09:09:40 UTC
Reproduced with:

Version: 7.6.0.0.alpha0+ (X86_64) / LibreOffice Community
Build ID: ec2f1d73936c9d8cee83c0887170e9ecb8f044ba
CPU threads: 8; OS: Linux 5.15; UI render: default; VCL: gtk3
Locale: en-AU (en_AU.UTF-8); UI: en-US
Calc: threaded

Not sure if the issue lies in LO or the extension.
Comment 2 Jeppe Bundsgaard 2023-01-09 10:09:04 UTC
It turned out that there was actually a system in the words not given synonyms - they were all before the word Antananarivo - which was written with a capitalized first letter (it's the name of the capitol in Madagascar which is also called Tananarive, according to my now functioning thesaurus). 

Making all words lowercase solved the problem, it seems to. I should have tested better before bothering you, I am sorry.
Comment 3 Stéphane Guillou (stragu) 2023-01-09 13:00:15 UTC
No problem, glad it is resolved! Thanks for reporting back :)