Bug 151464

Summary: The find function doesn't distinguish between the vowels in នាម់ and នើម
Product: LibreOffice Reporter: todd_bequette
Component: LibreOfficeAssignee: Not Assigned <libreoffice-bugs>
Status: RESOLVED NOTABUG    
Severity: normal CC: ilmari.lauhakangas, vsfoote
Priority: medium    
Version: 7.3.2.2 release   
Hardware: All   
OS: All   
Whiteboard:
Crash report or crash signature: Regression By:

Description todd_bequette 2022-10-11 03:58:36 UTC
Description:
The 'Find' function in LibreOffice 7.3 doesn't distinguish between នាម់ and នើម. 

Steps to Reproduce:
1. Create a new text document in LibreOffice Writer
2. Copy this sample text for testing:
រានោក​កាដាច់​អាហាឆោរុស ម៝ត់​អ្វាញ់​លែៈ​គែស​ពែ​នាម់ ពាង់​នទ្រាប់​នាវ​ញេត​ឆោង​ក្វ៝ង់ អ៝ន់​មា​ទឹង​លែៈ​កោរាញ ទឹង​លែៈ​ឞូនុយ្ស​នទ្រ៝ង់​ក្វ៝ង់​អី​ឆើម​កើល​ជាន​ការ​មា​ពាង់​នើម នអ៝ប់​មា​មពោល​កោរាញ​តាហែន​ផូង​ពើឆី នទ្រែល​មា​ផូង​មេដី នអ៝ប់​មា​មពោល​នទ្រ៝ង់​ក្វ៝ង់ នទ្រែល​មា​មពោល​កោរាញ​ក្វ៝ង់​តាៈ​ឞើស​ទឹង​លែៈ​ឞ្រី​អី​ពាង់​ម៝ត់​អ្វាញ់​នៃ គុ​ឞើស​នាប់​ពាង់​ទាទេ​ដ៝ង់។  
3. Do separate searches for the terms នាម់ and នើម. 

Actual Results:
Searching for នាម់ finds នាម់ and នើម (2 hits). 
Searching for នើម finds នាម់ and នើម (2 hits).

Expected Results:
Searching for នាម់ should only find នាម់ (1 hit). 
Searching for នើម should only find នើម (1 hit).


Reproducible: Always


User Profile Reset: No



Additional Info:
None
Comment 1 Mike Kaganski 2022-10-11 04:34:44 UTC
For me, it only happens when "diacritic-sensitive" is unchecked in the Find & Replace dialog, in which case, it tries to find also Unicode combining marks. I don't know if the characters in question form such marks, though. But still, test with "diacritic-sensitive" checked.
Comment 2 V Stuart Foote 2022-10-12 03:06:43 UTC
Confirm with the STR when using the Find bar, i.e. by <Ctrl>+F launch.

However, the Find bar <ctrl>+F does not set the 'diacritic sensitive' mode for searches.

Using the full "Find and Replace" dialog <ctrl>+H where 'diacritic sensitive' is set by default in the "Other options" section--it in general works best for CTL and CJK search use. 

IHMO => NAB as the Khmer script's vowel signs are handled similar to combining diacritics requiring the use of the more functional Find and Replace dialog.

=-testing-=
Version: 7.5.0.0.alpha0+ (x64) / LibreOffice Community
Build ID: a7683a16e98def4198e1f74a93fecdaacbf639e7
CPU threads: 8; OS: Windows 10.0 Build 19044; UI render: Skia/Vulkan; VCL: win
Locale: en-US (en_US); UI: en-US
Calc: CL threaded

using Windows "Leelawadee UI" font for the Khmer script of the sample text.
Comment 3 Buovjaga 2023-02-21 09:07:36 UTC
I don't see any objections, so let's close