Bug 137722 - [HYPHENATION] PT-BR language does not break a word
Summary: [HYPHENATION] PT-BR language does not break a word
Status: UNCONFIRMED
Alias: None
Product: LibreOffice
Classification: Unclassified
Component: Linguistic (show other bugs)
Version:
(earliest affected)
6.4.6.2 release
Hardware: All All
: medium normal
Assignee: Not Assigned
URL:
Whiteboard:
Keywords:
Depends on:
Blocks: Hyphenation
  Show dependency treegraph
 
Reported: 2020-10-24 18:50 UTC by João Paulo
Modified: 2021-06-14 03:41 UTC (History)
2 users (show)

See Also:
Crash report or crash signature:
Regression By:


Attachments
Sample ODT showing the hyphenation error (11.62 KB, application/vnd.oasis.opendocument.text)
2020-10-24 19:06 UTC, João Paulo
Details
Sample PDF showing the hyphenation error (25.14 KB, application/pdf)
2020-10-24 19:07 UTC, João Paulo
Details

Note You need to log in before you can comment on or make changes to this bug.
Description João Paulo 2020-10-24 18:50:27 UTC
Description:
PT-BR hyphenation does not break the word "justifiquem" / "jus-ti-fi-quem".

Steps to Reproduce:
Tested on LibreOffice 6.4.6, 64 bits, and on LibreOffice Portable 7.0.1.2:

Version: 7.0.1.2 (x86)
Build ID: 7cbcfc562f6eb6708b5ff7d7397325de9e764452
CPU threads: 8; OS: Windows 10.0 Build 19041; UI render: Skia/Raster; VCL: win
Locale: pt-BR (pt_BR); UI: en-US
Calc: CL

1. Create a new document.
2. Change the Default Paragraph Style, Text Flow, to enable hyphenation.  Change Font, Language, to Portuguese (Brazil).
3. Type the word "justifiquem" (without quotation marks) and, before it, any sequence of words until it is expected the hyphenator break it.

Actual Results:
LibreOffice Writer breaks the line before the word "justifiquem" instead of breaking it on the following hyphen marks: jus-ti-fi-quem.

Expected Results:
LibreOffice Writer should break the word "justifiquem" on any of the following hyphen marks: jus-ti-fi-quem.


Reproducible: Always


User Profile Reset: No



Additional Info:
The Brazilian Portuguese language is installed by default with LibreOffice Portable for a quick test without needing to install anything.

https://www.libreoffice.org/download/portable-versions
Comment 1 João Paulo 2020-10-24 19:06:23 UTC
Created attachment 166679 [details]
Sample ODT showing the hyphenation error
Comment 2 João Paulo 2020-10-24 19:07:13 UTC
Created attachment 166680 [details]
Sample PDF showing the hyphenation error
Comment 3 João Paulo 2020-10-24 19:12:17 UTC
I think the error happens when is not used the simple SPACE U+0020, as when I tested with it the error didn't appear.  I used the EM SPACE U+2003 after "justifiquem."  Maybe the error will appear with other horizontal whitespace characters (https://en.wikipedia.org/wiki/Whitespace_character#Unicode).
Comment 4 João Paulo 2020-12-24 18:13:00 UTC
Also the words "próxima" and "surpreendido" are not hyphenated correctly:

pró-xi-ma
sur-pre-en-di-do
Comment 5 João Paulo 2020-12-25 19:42:22 UTC
"atacá-lo" (word and suphix) should be hyphenated as "a-ta-cá-lo".

"como" should be "co-mo".

"desestrutu­rá-lo" should be "de-ses-tru-tu­-rá-lo".

"redação.[EM SPACE]" also is not hyphenated correctly (re-da-ção), but only because of the [EM SPACE] (Unicode U+2003) following the period.
Comment 6 Julien Nabet 2021-01-10 13:21:24 UTC
João: just to be sure, you got no hyphenation pb when using plain space (U+0020)?

Indeed, could it be expected that hyphenation is disabled with em space?

Raimundo: noticing you're the author of pt-BR hyphenation according to README_hyph_pt_BR.txt, thought you might be interested in this one.
Comment 7 João Paulo 2021-06-13 08:17:51 UTC
(In reply to Julien Nabet from comment #6)
> João: just to be sure, you got no hyphenation pb when using plain space
> (U+0020)?
> 
> Indeed, could it be expected that hyphenation is disabled with em space?
> 
> Raimundo: noticing you're the author of pt-BR hyphenation according to
> README_hyph_pt_BR.txt, thought you might be interested in this one.

Sorry for the late reply, Julien, I've got no hyphenation in PT-BR when using plain space, but no hyphenation when using other white space as the EM SPACE U+2003.
Comment 8 QA Administrators 2021-06-14 03:41:50 UTC Comment hidden (obsolete)