Bug 158174 - EDITING: AutoCorrect locale options should have an exception for single smart quotes to be substituted back to apostrophes when in the middle of a word
Summary: EDITING: AutoCorrect locale options should have an exception for single smart...
Status: RESOLVED NOTABUG
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Writer (show other bugs)
Version:
(earliest affected)
7.6.2.1 release
Hardware: All All
: medium enhancement
Assignee: Not Assigned
URL:
Whiteboard:
Keywords: needsUXEval
Depends on:
Blocks: AutoCorrect-Complete
  Show dependency treegraph
 
Reported: 2023-11-11 06:31 UTC by João Paulo
Modified: 2024-02-13 12:20 UTC (History)
6 users (show)

See Also:
Crash report or crash signature:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description João Paulo 2023-11-11 06:31:59 UTC
This is an enhancement request for AutoCorrect.

The AutoCorrect locale options should have an exception for converting back the curved single quote mark back as an apostrophe mark when inside a composite word, such as: d’água -> d'água.

The reason is that (at least in the English, French and Portuguese languages) the apostrophe mark is used to indicate the omission of letters in compound words (and other use cases, as the English genitive case, marking of plurals etc.), such as these examples:

* Arraial d'Ajuda
* Caixa d'água
* Copo d'água
* Estrela d'alva
* Galinha d'Angola
* Pau d'arco
* Santa Bárbara d'Oeste
* Vozes d'África

* CD's
* Oakland A's
* VIP's

See https://en.wikipedia.org/wiki/Apostrophe, https://en.wikipedia.org/wiki/Apostrophe#Non-English_use and https://pt.wikipedia.org/wiki/Ap%C3%B3strofo for uses of apostrophe.
Comment 1 Dieter 2023-11-27 20:16:11 UTC
Joao, thank you for reporting that idea. Just to make sure, that i understand it in correct way:

Actual result
AutoCorrect changes apostroph to a curved single quote mark in the middle of a word

Desired result
AutoCorrect shouldn't change apostroph in the middle of a word

Correct?

cc: Design-Team
Comment 2 Heiko Tietze 2023-11-28 08:03:57 UTC
Not sure it's the auto correction Localized Options that kicks in here. If I enter CD's I get CD's, CD`s , CD´s, CD’s, CD`s - depending on the character I type (German keyboard layout). But 'CD' is being converted to ‚CD‘ (in fact this function replaces start/end quotes).

Is the string maybe listed under Replacement (I get typographical ' (Right Single Quotation Mark, U+2019) when typing the simple apostrophe ' (U+0027), defined explicitly for "wird's".

=> NEEDINFO
Comment 3 João Paulo 2023-11-28 18:08:25 UTC
(In reply to Dieter from comment #1)
> Joao, thank you for reporting that idea. Just to make sure, that i
> understand it in correct way:
> 
> Actual result
> AutoCorrect changes apostroph to a curved single quote mark in the middle of
> a word
> 
> Desired result
> AutoCorrect shouldn't change apostroph in the middle of a word
> 
> Correct?
> 
> cc: Design-Team

Yes, Dieter, you are correct, that's what I meant.
Comment 4 QA Administrators 2023-11-29 03:14:36 UTC Comment hidden (obsolete)
Comment 5 Heiko Tietze 2024-01-31 09:11:55 UTC
László, can you clarify what exactly is needed?
Comment 6 László Németh 2024-02-13 12:01:24 UTC
(U+2019) RIGHT SINGLE QUOTATION MARK is the preferred character to use also for apostrophe, so this is the real apostrophe (i.e. the typographical or traditional one, which is used for books etc. in printing: sometimes it's not curved, but more straight, depending on the design of the font face):

Unicode standard: https://unicode.org/charts/PDF/U2000.pdf

While the (ASCII/typewriter) APOSTROPHE is not the preferred character for apostrophe, despite its Unicode name:

APOSTROPHE
= apostrophe-quote (1.0)
= single quote
= APL quote
• neutral (vertical) glyph with mixed usage
• 2019 ’ is preferred for apostrophe
• preferred characters in English for paired
quotation marks are 2018 ‘ & 2019 ’
• 05F3 ׳ is preferred for geresh when writing
Hebrew
→ 02B9 ʹ modifier letter prime
→ 02BC ʼ modifier letter apostrophe
→ 02C8 ˈ modifier letter vertical line
→ 0301 $́ combining acute accent
→ 030D $̍ combining vertical line above
→ 05F3 ׳ hebrew punctuation geresh
→ 2018 ‘ left single quotation mark
→ 2019 ’ right single quotation mark
→ 2032 ′ prime
→ A78C ꞌ latin small letter saltillo

Source: https://unicode.org/charts/PDF/U0000.pdf
Comment 7 László Németh 2024-02-13 12:13:54 UTC
@João: thanks for your bug report! It's worth to check the apostrophes in books. For example, see the book cover of The accidental apostrophe (and the apostrophes inside in the book):

https://www.google.hu/books/edition/The_Accidental_Apostrophe/7Zo6DwAAQBAJ?hl=hu&gbpv=1&dq=portuguese+apostrophe&pg=PT11&printsec=frontcover  

(A similar issue: I was very surprised to notice 20 years ago, that there are no "fi" letter sequences in the books, but special printing marks instead of them, the ligatures: fi (U+FB01). Advanced fonts uses them automatically, in LibreOffice, too, see Linux Libertine G or OpenType fonts. Font feature "quot" replaces the ASCII double quotation marks to the correct one, depending on the text of the document in LibreOffice: http://numbertext.org/linux/fontfeatures.pdf, e.g. "quot" -> “quot” in English, and „quot” in Hungarian.)
Comment 8 László Németh 2024-02-13 12:20:13 UTC
@Heiko: thanks for your question!