Bug 163346 - AutoCorrect Options; when the word "บรเพ็ชร" is entered, it becomes "บรเพชร"
Summary: AutoCorrect Options; when the word "บรเพ็ชร" is entered, it becomes "บรเพชร"
Status: NEW
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Calc (show other bugs)
Version:
(earliest affected)
24.8.1.2 release
Hardware: x86-64 (AMD64) All
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2024-10-08 10:25 UTC by jteera5
Modified: 2024-10-12 17:30 UTC (History)
4 users (show)

See Also:
Crash report or crash signature:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description jteera5 2024-10-08 10:25:02 UTC
When the word บรเพ็ชร is entered (typed) into a cell, it becomes บรเพชร.

บรเพ็ชร can be copied from another cell correctly. The original บรเพ็ชร was an old entry entered with an older version of LibreOffice.

The text บรเพ็ชร can be copied into a cell that's not in edit mode. If it's in edit mode บรเพ็ชร becomes บรเพชร after pressing Enter, Tab or space bar.
Comment 1 jteera5 2024-10-08 10:27:53 UTC
The problem occurs with or without Hunspell Spellchecker enabled.
No Thai Dictionary was loaded.
Comment 2 jteera5 2024-10-08 10:29:45 UTC
Same problem with the word เพ็ชร.
Comment 3 Julien Nabet 2024-10-08 18:44:47 UTC
Could you try to rename your LO directory profile? (see https://wiki.documentfoundation.org/QA/FirstSteps#Corrupted_user_profile)
Comment 4 ady 2024-10-08 20:32:08 UTC
Which font are you using in the specific cell where this problem is seen?

Please open the Fonts folder of your OS; is the specific font listed as installed in that Font folder? Please note that the fact that LO lists the font within its own UI does not necessarily imply that the specific font is actually installed in your OS.

What happens if you change the font of a specific cell and start the procedure again (i.e. typing-in the intended word from scratch on a new cell with a different font)? Is the problem present with _every_ font (i.e. on each test, start the original procedure again, on a new cell with a different font before you start typing the intended text in that cell)?

This should also be tested with a recent daily master alpha (LO 25.2, ATM).
Comment 5 jteera5 2024-10-10 10:01:53 UTC
(In reply to Julien Nabet from comment #3)
> Could you try to rename your LO directory profile? (see
> https://wiki.documentfoundation.org/QA/FirstSteps#Corrupted_user_profile)

I renamed the C:\Users\Teerapon\AppData\Roaming\LibreOffice\4 and restarted LO, it created a new profile folder. The problem still exists.
Comment 6 jteera5 2024-10-10 10:04:29 UTC
(In reply to ady from comment #4)
> Which font are you using in the specific cell where this problem is seen?
> 
> Please open the Fonts folder of your OS; is the specific font listed as
> installed in that Font folder? Please note that the fact that LO lists the
> font within its own UI does not necessarily imply that the specific font is
> actually installed in your OS.

The font I used is Angsana New which is a standard Windows Thai font. The font is present in C:\Windows\Fonts as angsana.ttc.

> 
> What happens if you change the font of a specific cell and start the
> procedure again (i.e. typing-in the intended word from scratch on a new cell
> with a different font)? Is the problem present with _every_ font (i.e. on
> each test, start the original procedure again, on a new cell with a
> different font before you start typing the intended text in that cell)?
> 

I changed the font of a specific cell to Tahoma and typed เพ็ชร and the word changed to เพชร as soon as I type space or Enter.
I started over on a new cell with some other fonts. The problem is present with Microsoft Sans Serif, Arial and Browallia New (another standard Windows Thai font). All font files are in C:\Windows\Fonts as micross.ttf, arial.ttf and browalia.ttc.

> This should also be tested with a recent daily master alpha (LO 25.2, ATM).
I couldn't find LO 25.2, so I tested with LO 24.8.2.1 (x86_64) which is the latest on LO download page.
Comment 7 ady 2024-10-10 18:40:19 UTC
(In reply to jteera5 from comment #6)

> I couldn't find LO 25.2, so I tested with LO 24.8.2.1 (x86_64) which is the
> latest on LO download page.

Please see <https://wiki.documentfoundation.org/Testing_Daily_Builds>


Additional points to test would be under menu Tools:

* Autoinput;
* Autocorrect Options (e.g. some wrongly-introduced "Replace");
* Auto-redact.

It might be worth testing what happens if _no_ dictionary at all is enabled (not even a custom dictionary).

Perhaps someone in the Localized users mailing list might be able to replicate the problem (or report that they do not reproduce it), or in <https://ask.libreoffice.org> (for the adequate language).
Comment 8 ady 2024-10-10 18:42:54 UTC
JIC, please try LO > menu Help > Restart in Safe mode > Continue in safe mode. Then repeat the original procedure, introducing the intended text in a cell and checking whether the problem is still reproduced.
Comment 9 jteera5 2024-10-12 09:14:38 UTC
(In reply to ady from comment #7)
> (In reply to jteera5 from comment #6)
> 
> > I couldn't find LO 25.2, so I tested with LO 24.8.2.1 (x86_64) which is the
> > latest on LO download page.
> 
> Please see <https://wiki.documentfoundation.org/Testing_Daily_Builds>
> 
> 
> Additional points to test would be under menu Tools:
> 
> * Autoinput;
> * Autocorrect Options (e.g. some wrongly-introduced "Replace");
> * Auto-redact.
> 
> It might be worth testing what happens if _no_ dictionary at all is enabled
> (not even a custom dictionary).
> 
> Perhaps someone in the Localized users mailing list might be able to
> replicate the problem (or report that they do not reproduce it), or in
> <https://ask.libreoffice.org> (for the adequate language).

I installed LO 25.2 and in Autocorrect Options, I found and entry to replace .*เพ็ชร.* with เพชร. After I deleted the entry the problem disapppeared.

The words เพชร and เพ็ชร are both used in Thai language. While เพชร is the correct spelling. เพ็ชร is also used in many names. I think the entry should be removed from future LO installation file. In the meantimes I will remove the entry from my LO installations.
Thank you very much.
Comment 10 Julien Nabet 2024-10-12 10:00:17 UTC
Ok I found the replacement for เพ็ชร here:
https://opengrok.libreoffice.org/xref/core/extras/source/autocorr/lang/th/DocumentList.xml?r=4c91770d#428

I haven't found one for "บรเพ็ชร".

Autocorrect entries added with 4c91770dde40d248af736436716994dc367a54a2
Add Thai AutoCorrect data
As part of tdf#158454

Ilmari: I put you in cc because I couldn't cc Theppitak Karoonboonyanan
Comment 11 Buovjaga 2024-10-12 10:33:09 UTC
(In reply to Julien Nabet from comment #10)
> Ok I found the replacement for เพ็ชร here:
> https://opengrok.libreoffice.org/xref/core/extras/source/autocorr/lang/th/
> DocumentList.xml?r=4c91770d#428
> 
> I haven't found one for "บรเพ็ชร".
> 
> Autocorrect entries added with 4c91770dde40d248af736436716994dc367a54a2
> Add Thai AutoCorrect data
> As part of tdf#158454
> 
> Ilmari: I put you in cc because I couldn't cc Theppitak Karoonboonyanan

I found Theppitak.
Comment 12 Theppitak Karoonboonyanan 2024-10-12 15:56:00 UTC
Indeed, proper names can be exceptions to many typos. Here's why:

For people who are not speaking Thai, even "บรเพ็ชร" itself is considered a typo
of "บอระเพ็ด". But I guess this must be some surname with old-style spelling.
"เพ็ชร" and even "เพ็ชร์" are also common typos. And many proper names are based on
common typos, which could potentially introduce more removals of AutoCorrect
patterns in the future.

While I'm OK with the removal in this case, I'd say do expect more to come.