Bug 150907 - Capitalize not working after tamil content
Summary: Capitalize not working after tamil content
Status: NEW
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Writer (show other bugs)
Version:
(earliest affected)
3.5.0 release
Hardware: x86-64 (AMD64) Linux (All)
: medium minor
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks: AutoCorrect-Complete
  Show dependency treegraph
 
Reported: 2022-09-11 10:36 UTC by surya R
Modified: 2024-09-29 17:19 UTC (History)
3 users (show)

See Also:
Crash report or crash signature:


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description surya R 2022-09-11 10:36:22 UTC
Description:
> Capitalize works fine when tried in empty document.  
> Capitalize Not working as expected if we type English content after tamil content.  


Steps to Reproduce:
1.open a empty writer in Libre office
2.type a word chennai give a space the word first letter will be capital

After

1.open a empty writer in Libre Office
2. type a tamil content and type a word tamilnadu give a space (நான் என்பது நான் மட்டும் அல்ல உன் நினைவுகலும் தான் tamilnadu  )

Actual Results:
> Chennai

> நான் என்பது நான் மட்டும் அல்ல உன் நினைவுகலும் தான் tamilnadu 

Expected Results:


> Chennai

> நான் என்பது நான் மட்டும் அல்ல உன் நினைவுகலும் தான் Tamilnadu 


Reproducible: Always


User Profile Reset: No



Additional Info:
In Tamilnadu, t should be capital. But it comes as small.
Comment 1 Michael Warner 2022-09-11 18:20:02 UTC
How is the end of a sentence indicated in Tamil? Is there a punctuation character that is used? Is 'Tamilnadu' the beginning of a new sentence where தான் was the end of the previous sentence? 


For languages that don't use a clear punctuation mark to indicate the end of a sentence, ISTR reading that either ICU or CLDR tries to guess based on certain heuristic rules in these cases, but it isn't always correct.
Comment 2 surya R 2022-09-26 07:36:01 UTC
I again tried using the correct punctuation as you suggested but the output I was expecting did not appear

(In reply to Michael Warner from comment #1)
> How is the end of a sentence indicated in Tamil? Is there a punctuation
> character that is used? Is 'Tamilnadu' the beginning of a new sentence where
> தான் was the end of the previous sentence? 
> 
> 
> For languages that don't use a clear punctuation mark to indicate the end of
> a sentence, ISTR reading that either ICU or CLDR tries to guess based on
> certain heuristic rules in these cases, but it isn't always correct.
Comment 3 QA Administrators 2022-09-27 03:33:14 UTC Comment hidden (obsolete)
Comment 4 ⁨خالد حسني⁩ 2023-02-15 14:19:33 UTC
Confirmed. Even ending the Tamil text with a period, the next English word is not capitalized. Ending the Tamil text with an English word then a period, the next English word will be capitalized.
Comment 5 Buovjaga 2024-09-29 17:19:15 UTC
Still repro. Already in 3.5.

Arch Linux 64-bit
Version: 25.2.0.0.alpha0+ (X86_64) / LibreOffice Community
Build ID: 59fc2b5ca9678855feb30345761dad739b546edc
CPU threads: 8; OS: Linux 6.10; UI render: default; VCL: kf6 (cairo+wayland)
Locale: fi-FI (fi_FI.UTF-8); UI: en-US
Calc: CL threaded
Built on 27 September 2024