121779 – Autocorrect: an option to avoid auto-capitalization of some words

Bug 121779 - Autocorrect: an option to avoid auto-capitalization of some words

Summary: Autocorrect: an option to avoid auto-capitalization of some words

Status:	RESOLVED FIXED

Alias:	None

Product:	LibreOffice
Classification:	Unclassified
Component:	Writer (show other bugs)
Version: (earliest affected)	Inherited From OOo
Hardware:	All All

Importance:	medium enhancement
Assignee:	Mike Kaganski

URL:
Whiteboard:	target:6.3.0
Keywords:

Depends on:
Blocks:	AutoCorrect-Complete
	Show dependency tree / graph

Reported:	2018-11-29 06:39 UTC by Mike Kaganski
Modified:	2022-08-19 08:53 UTC (History)
CC List:	4 users (show)

See Also:	127644 121913
Crash report or crash signature:

Attachments
Add an attachment (proposed patch, testcase, etc.)

Note You need to log in before you can comment on or make changes to this bug.

Description Mike Kaganski 2018-11-29 06:39:33 UTC

As per https://ask.libreoffice.org/en/question/174175:

There are some words, that explicitly must start with a lowercase character, and may have some or all other characters uppercase: mRNA, tRNA, iPhone, fMRI. They may get auto-capitalized as the result of applying either "Capitalize first letter of every sentence", or "Correct accidental use of cAPS LOCK key". There is no exception rule for such cases (currently only "Abbreviations (no Subsequent Capital)" and "Words With TWo INitial CApitals" exist, that don't fit to the case).

The proposal is to add such an exclusion rule; it might be either the third list in addition to the two existing, or an option to ignore words from dictionary which start with a lowercase, but contain uppercase characters in the middle.

Comment 1 Jim 2018-11-29 08:30:20 UTC

Bug confirmed.

Version: 6.3.0.0.alpha0+ (x64)
Build ID: 0f25a3c36f27fd51453b9a9115f236b83c143684
CPU threads: 4; OS: Windows 10.0; UI render: GL; VCL: win; 
TinderBox: Win-x86_64@42, Branch:master
Locale: zh-TW (zh_TW); UI-Language: en-US
Calc: threaded

Bug confirmed on this version.

Comment 2 Chiang Han 2018-11-29 08:33:11 UTC Comment hidden (no-value)

Bug not reproducible in version

Although it will automatically adjust the prefix for you, but you can still adjust the case by manual method.

Version: 6.3.0.0.alpha0+ (x64)
Build ID: 0f25a3c36f27fd51453b9a9115f236b83c143684
CPU threads: 4; OS: Windows 10.0; UI render: default; VCL: win; 
TinderBox: Win-x86_64@42, Branch:master, Time: 2018-11-27_20:06:55
Locale: zh-TW (zh_TW); UI-Language: en-US

Comment 3 Heiko Tietze 2018-11-30 10:21:35 UTC

Can we merge the two lists and have only one with exceptions for all autoformatting? Three lists sounds like a suboptimal solution, even worse than the two current.

Comment 4 Cor Nouws 2018-12-04 19:52:17 UTC

(In reply to Heiko Tietze from comment #3)
> Can we merge the two lists and have only one with exceptions for all
> autoformatting? Three lists sounds like a suboptimal solution, even worse
> than the two current.

I think so - unless it give trouble to some languages/language teams?

Comment 5 tommy27 2018-12-05 09:48:31 UTC

makes sense. valid enhancement request.

in the meantime a workaround is to put some specific autocorrect items such as "IPhone -> iPhone" and "FMRI -> fMRI" etc. etc.

Comment 6 Heiko Tietze 2018-12-06 13:52:08 UTC

Having only one list that blocks auto correction sounds like the KISS solution. needsDevAdvice for feasibility

Comment 7 tommy27 2018-12-07 05:21:57 UTC

do you mean a "never autocorrect these words" list? sounds nice.

Comment 8 Mike Kaganski 2018-12-07 06:24:13 UTC

The advise should be from developers familiar with linguistics, so that they could see if it's possible to use such a single list, or it would make some scenarios for some languages difficult/wrong.

Comment 9 László Németh 2018-12-07 08:06:43 UTC

Mike, Heiko: Indeed, the list "Words With TWo INitial CApitals" seems perfect for this extension: Words With TWo INitial CApitals or sMALL iNITIAL

Comment 10 Mike Kaganski 2018-12-07 10:18:45 UTC

https://gerrit.libreoffice.org/64759

@Heiko: could you please check if UI is corrected properly and sufficiently. Thanks!

Comment 11 Heiko Tietze 2018-12-07 12:47:31 UTC

(In reply to László Németh from comment #9)
> Mike, Heiko: Indeed, the list "Words With TWo INitial CApitals" seems
> perfect for this extension: Words With TWo INitial CApitals or sMALL iNITIAL

Would have been nicer to have one list of exceptions but this works too.

(In reply to Mike Kaganski from comment #10)
> @Heiko: could you please check if UI is corrected properly and sufficiently.

Like a charm, as always. We should add a few examples to the list like iPhone and fMRI.

Comment 12 Commit Notification 2018-12-07 12:58:05 UTC

Mike Kaganski committed a patch related to this issue.
It has been pushed to "master":

https://git.libreoffice.org/core/+/7bf6d0d9d8e3e2e07726ca6df246619ee1ead3a3%5E%21

tdf#121779: extend TWo CApitals list to also ignore sMALL iNITIALS

It will be available in 6.3.0.

The patch should be included in the daily builds available at
https://dev-builds.libreoffice.org/daily/ in the next 24-48 hours. More
information about daily builds can be found at:
https://wiki.documentfoundation.org/Testing_Daily_Builds

Affected users are encouraged to test the fix and report feedback.