Bug 132272 - strange hunspell behavior
Summary: strange hunspell behavior
Status: RESOLVED INVALID
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Linguistic (show other bugs)
Version:
(earliest affected)
6.4.3.2 release
Hardware: All Windows (All)
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2020-04-20 10:30 UTC by Mauro Trevisan
Modified: 2020-05-02 08:02 UTC (History)
2 users (show)

See Also:
Crash report or crash signature:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Mauro Trevisan 2020-04-20 10:30:57 UTC
Description:
I have this affix file

SFX A Y 1
SFX A e o/C e

SFX B Y 1
SFX B e i/C e

SFX C Y 3
SFX C o l o
SFX C i n i
SFX C e r e
And this dictionary file

ble/CAB

Now, I expect Writer to recognize these words: ble, blr, blo, bll, bli, bln

But something strange happens, the recognized words are only: ble, blr, blo, bll, as the third flag, B, is not taken into consideration.

Moreover, if I switch the last two flags (having ble/CBA), then only these words are recognized: ble, blr, bli, bln, as if A is not taken into consideration.

I have the last LibreOffice package (but the same error occurs also with 5.3):

Version: 6.4.3.2 (x64)
Build ID: 747b5d0ebf89f41c860ec2a39efd7cb15b54f2d8
CPU threads: 4; OS: Windows 10.0 Build 18363; UI render: default; VCL: win; 
Locale: en-US (it_IT); UI-Language: en-US
Calc: threaded
Am I missing some functions of hunspell that prevents successive application of a flag if the one that precedes it is twofolded?

This error occurs even without the C flag (bla/AB and bla/BA)

Also by putting the C flag in the last position (bla/ABC and bla/BAC) the error remains, and the inflections produced by the C rule are correctly recognized.

Thank you

Steps to Reproduce:
Load the affix and dictionary file.
Write the words ble, blr, blo, bll, bli, bln

Actual Results:
Recognized words are ble, blr, blo, bll

Expected Results:
Recognized words are ble, blr, blo, bll, bli, bln


Reproducible: Always


User Profile Reset: No



Additional Info:
[Information automatically included from LibreOffice]
Locale: en-US
Module: StartModule
[Information guessed from browser]
OS: Windows (All)
OS is 64bit: no
Comment 1 Mauro Trevisan 2020-05-01 11:23:23 UTC
I think it has something to do with the way I indicate the flags...
This one is not working (dictionary: aver/ẋ1):

  SET UTF-8
  
  FLAG long
  
  SFX ẋ1 Y 1
  SFX ẋ1 er aré/gāĪ4I4 aver
  
  PFX gā Y 1
  PFX gā 0 g .
  
  SFX Ī4 Y 1
  SFX Ī4 0 -u .
  
  SFX I4 Y 1
  SFX I4 0 u .

Instead this one works (dictionary: aver/x):

  SET UTF-8
  
  SFX x Y 1
  SFX x er aré/gIi aver
  
  PFX g Y 1
  PFX g 0 g .
  
  SFX I Y 1
  SFX I 0 -u .
  
  SFX i Y 1
  SFX i 0 u .
Comment 2 Mauro Trevisan 2020-05-02 08:02:44 UTC
Invalid bug, since the hunspell specification declares the long flags to be composed by a double ASCII character.